Perceptually motivated Sub-band Decomposition for FDLP Audio Coding

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	Motlicek_TSD2008_2008
Booktitle:	Text, Speech and Dialogue
Series:	Series of Lecture Notes in Artificial Intelligence (LNAI)
Volume:	5246
Year:	2008
Month:	9
Publisher:	Springer-Verlag Berlin, Heidelberg
Location:	Brno, Czech Republic
ISBN:	978-3-540-87390-7
Abstract:	This paper describes employment of non-uniform QMF decomposition to increase the efficiency of a generic wide-band audio coding system based on Frequency Domain Linear Prediction (FDLP). The base line FDLP codec, operating at high bit-rates (~136 kbps,',','), exploits a uniform QMF decomposition into 64 sub-bands followed by sub-band processing based on FDLP. Here, we propose a non-uniform QMF decomposition into 32 frequency sub-bands obtained by merging 64 uni- form QMF bands. The merging operation is performed in such a way that bandwidths of the resulting critically sampled sub-bands emulate the characteristics of the critical band filters in the human auditory system. Such frequency decomposition, when employed in the FDLP audio codec, results in a bit-rate reduction of 40% over the base line. We also describe the complete audio codec, which provides high-fidelity audio compression at ~66 kbps. In subjective listening tests, the FDLP codec outperforms MPEG-1 Layer 3 (MP3) and achieves similar qualities as MPEG-4 HE-AAC codec.
Keywords:	Audio Coding, Frequency Domain Linear Prediction (FDLP), speech coding
Projects	Idiap
Authors	Motlicek, Petr Ganapathy, Sriram Hermansky, Hynek Garudadri, Harinath Athineos, Marios
Added by:	[UNK]
Total mark:	0
Attachments
Motlicek_TSD2008_2008.pdf
Notes

processing time: 0.0003 seconds.