CONF
Cernak_INTERSPEECH_2013/IDIAP
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture
Cernak, Milos
Na, Xingyu
Garner, Philip N.
pitch analysis
speech coding
speech synthesis
EXTERNAL
https://publications.idiap.ch/attachments/papers/2013/Cernak_INTERSPEECH_2013.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Cernak_Idiap-RR-24-2013
Related documents
Proc. of Interspeech 2013
Lyon, France
2013
REPORT
Cernak_Idiap-RR-24-2013/IDIAP
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture
Cernak, Milos
Na, Xingyu
Garner, Philip N.
pitch analysis
speech coding
speech synthesis
EXTERNAL
https://publications.idiap.ch/attachments/reports/2013/Cernak_Idiap-RR-24-2013.pdf
PUBLIC
Idiap-RR-24-2013
2013
Idiap
June 2013
Current HMM-based low bit rate speech coding systems work with
phonetic vocoders. Pitch contour coding (on frame or phoneme level)
is usually fairly orthogonal to other speech coding parameters. We
make an assumption in our work that the speech signal contains
supra-segmental cues. Hence, we present encoding of the pitch on the
syllable level, used in the framework of a recognition/synthesis
speech coder with phonetic vocoder. The results imply that high
accuracy pitch contour reconstruction with negligible speech quality
degradation is possible. The proposed pitch encoding technique
operates on 30 - 35 bits per second.