logo Idiap Research Institute        
 [BibTeX] [Marc21]
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding
Type of publication: Conference paper
Citation: motlicek:mlmi:2007
Booktitle: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI)
Year: 2007
Note: IDIAP-RR 07-16
Crossref: motlicek:rr07-16:
Abstract: This paper proposes an analysis technique for wide-band audio applications based on the predictability of the temporal evolution of Quadrature Mirror Filter (QMF) sub-band signals. The input audio signal is first decomposed into 64 sub-band signals using QMF decomposition. The temporal envelopes in critically sampled QMF sub-bands are approximated using frequency domain linear prediction applied over relatively long time segments (e.g. 1000 ms). Line Spectral Frequency parameters related to autoregressive models are computed and quantized in each frequency sub-band. The sub-band residuals are quantized in the frequency domain using a combination of split Vector Quantization (VQ) (for magnitudes) and uniform scalar quantization (for phases). In the decoder, the sub-band signal is reconstructed using the quantized residual and the corresponding quantized envelope. Finally, application of inverse QMF reconstructs the audio signal. Even with simple quantization techniques and without any sophisticated modules, the proposed audio coder provides encouraging results in objective quality tests. Also, the proposed coder is easily scalable across a wide range of bit-rates.
Userfields: ipdmembership={speech},
Keywords:
Projects Idiap
Authors Motlicek, Petr
Hermansky, Hynek
Ganapathy, Sriram
Garudadri, Harinath
Added by: [UNK]
Total mark: 0
Attachments
  • motlicek-mlmi-2007.pdf
  • motlicek-mlmi-2007.ps.gz
Notes