Phonological Vocoding Using Artificial Neural Networks

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Cernak_ICASSP15_2015
Publication status:	Published
Booktitle:	IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Year:	2015
Month:	April
Pages:	4844-4848
Publisher:	IEEE
Location:	Brisbane, Australia
Crossref:	Cernak_Idiap-RR-04-2015: Phonological vocoding using artificial neural networks, Cernak, Milos, Potard, Blaise and Garner, Philip N., Idiap-RR-04-2015
DOI:	10.1109/ICASSP.2015.7178891
Abstract:	We investigate a vocoder based on artificial neural networks using a phonological speech representation. Speech decomposition is based on the phonological encoders, realised as neural network classifiers, that are trained for a particular language. The speech reconstruction process involves using a Deep Neural Network (DNN) to map phonological features posteriors to speech parameters -- line spectra and glottal signal parameters -- followed by LPC resynthesis. This DNN is trained on a target voice without transcriptions, in a semi-supervised manner. Both encoder and decoder are based on neural networks and thus the vocoding is achieved using a simple fast forward pass. An experiment with French vocoding and a target male voice trained on 21 hour long audio book is presented. An application of the phonological vocoder to low bit rate speech coding is shown, where transmitted phonological posteriors are pruned and quantized. The vocoder with scalar quantization operates at 1 kbps, with potential for lower bit-rate.
Keywords:
Projects	Idiap armasuisse
Authors	Cernak, Milos Potard, Blaise Garner, Philip N.
Added by:	[UNK]
Total mark:	0
Attachments
Cernak_ICASSP15_2015.pdf
Notes

processing time: 0.0003 seconds.