logo Idiap Research Institute        
 [BibTeX] [Marc21]
Incremental Syllable-Context Phonetic Vocoding
Type of publication: Idiap-RR
Citation: Cernak_Idiap-RR-05-2015
Number: Idiap-RR-05-2015
Year: 2015
Month: 2
Institution: Idiap
Abstract: Current very low bit rate speech coders are, due to complexity limitations, designed to work off-line. This paper investigates incremental speech coding that operates real-time and incrementally (i.e., encoded speech depends only on already-uttered speech without the need of future speech information). Since human speech communication is asynchronous (i.e., different information flows being simultaneously processed), we hypothesised that such an incremental speech coder should also operate asynchronously. To accomplish this task, we describe speech coding that reflects the human cortical temporal sampling that packages information into units of different temporal granularity, such as phonemes and syllables, in parallel. More specifically, a phonetic vocoder — cascaded speech recognition and synthesis systems — extended with syllable-based information transmission mechanisms is investigated. There are two main aspects evaluated in this work, the synchronous and asynchronous coding. Synchronous coding refers to the case when the phonetic vocoder and speech generation process depend on the syllable boundaries during encoding and decoding respectively. On the other hand, asynchronous coding refers to the case when the phonetic encoding and speech generation processes are done independently of the syllable boundaries. Our experiments confirmed that the asynchronous incremental speech coding performs better, in terms of intelligibility and overall speech quality, mainly due to better alignment of the segmental and prosodic information. The proposed vocoding operates at an uncompressed bit rate of 213 bits/sec and achieves an average communication delay of 243 ms.
Keywords: parametric speech synthesis, Very low bit rate speech coding
Projects Idiap
Authors Cernak, Milos
Garner, Philip N.
Lazaridis, Alexandros
Motlicek, Petr
Na, Xingyu
Crossref by Cernak_TASLP_2015
Added by: [ADM]
Total mark: 0
  • Cernak_Idiap-RR-05-2015.pdf (MD5: 75876c4eb12db9255b0e4e8ee2612d57)