CONF
motlicek:tsd:2006/IDIAP
Speech Coding based on Spectral Dynamics
Motlicek, Petr
Hermansky, Hynek
Garudadri, Harinath
Srinivasamurthy, Naveen
EXTERNAL
https://publications.idiap.ch/attachments/papers/2006/motlicek-tsd-2006.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/motlicek:rr06-05
Related documents
Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD)
2006
IDIAP-RR 06-05
In this paper we present first experimental results with a novel audio coding technique based on approximating Hilbert envelopes of relatively long segments of audio signal in critical-band-sized sub-bands by autoregressive model. We exploit the generalized autocorrelation linear predictive technique that allows for a better control of fitting the peaks and troughs of the envelope in the sub-band. Despite introducing longer algorithmic delay, improved coding efficiency is achieved. Since the described technique does not directly model short-term spectral envelopes of the signal, it is suitable not only for coding speech but also for coding of other audio signals.
REPORT
motlicek:rr06-05/IDIAP
Speech Coding based on Spectral Dynamics
Motlicek, Petr
Hermansky, Hynek
Garudadri, Harinath
Srinivasamurthy, Naveen
EXTERNAL
https://publications.idiap.ch/attachments/reports/2006/motlicek-idiap-rr-06-05.pdf
PUBLIC
Idiap-RR-05-2006
2006
IDIAP
Submitted for publication
In this paper we present first experimental results with a novel audio coding technique based on approximating Hilbert envelopes of relatively long segments of audio signal in critical-band-sized sub-bands by autoregressive model. We exploit the generalized autocorrelation linear predictive technique that allows for a better control of fitting the peaks and troughs of the envelope in the sub-band. Despite introducing longer algorithmic delay, improved coding efficiency is achieved. Since the described technique does not directly model short-term spectral envelopes of the signal, it is suitable not only for coding speech but also for coding of other audio signals.