Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	motlicek:TSD:2007
Booktitle:	Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD)
Year:	2007
Note:	IDIAP-RR 06-30
Crossref:	motlicek:rr06-30: Audio Coding Based on Long Temporal Contexts, Motlicek, Petr, Hermansky, Hynek, Garudadri, Harinath and Srinivasamurthy, Naveen, Idiap-RR-30-2006
Abstract:	Unlike classical state-of-the-art coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band-sized sub-bands. We apply auto-regressive model to approximate Hilbert envelopes in frequency sub-bands. Residual signals (Hilbert carriers) are demodulated and thresholding functions are applied in spectral domain. The Hilbert envelopes and carriers are quantized and transmitted to the decoder. Our experiments focused on designing speech/audio coder to provide broadcast radio-like quality audio around 15-25kbps. Obtained objective quality measures, carried out on standard speech recordings, were compared to the state-of-the-art 3GPP-AMR speech coding system.
Userfields:	ipdmembership={speech},
Keywords:
Projects	Idiap
Authors	Motlicek, Petr Hermansky, Hynek Ganapathy, Sriram Garudadri, Harinath
Added by:	[UNK]
Total mark:	0
Attachments
motlicek-TSD-2007.pdf motlicek-TSD-2007.ps.gz
Notes

processing time: 0.0003 seconds.