CONF Motlicek_INTERSPEECH2009-3_2009/IDIAP Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec Motlicek, Petr Ganapathy, Sriram Hermansky, Hynek Arithmetic Coding Audio Coding Entropy Coding Frequency Domain Linear Prediction (FDLP) Huffman Coding EXTERNAL http://publications.idiap.ch/attachments/papers/2009/Motlicek_INTERSPEECH2009-3_2009.pdf PUBLIC ISCA - 10th Annual Conference of the International Speech Communication Association Brighton, England 2009 ISCA 2009 September 2009 2591-2594 1990-9772 A speech/audio codec based on Frequency Domain Linear Prediction (FDLP) exploits auto-regressive modeling to approximate instantaneous energy in critical frequency sub-bands of relatively long input segments. The current version of the FDLP codec operating at 66 kbps has been shown to provide comparable subjective listening quality results to state-of-the-art codecs on similar bit-rates even without employing standard blocks such as entropy coding or simultaneous masking. This paper describes an experimental work to increase compression efficiency of the FDLP codec by employing entropy coding. Unlike conventional Huffman coding employed in current speech/audio coding systems, we describe an efficient way to exploit arithmetic coding to entropy compress quantized spectral magnitudes of the sub-band FDLP residuals. Such an approach provides 11% (âˆ¼ 3 kbps) bit-rate reduction compared to the Huffman coding algorithm (âˆ¼ 1 kbps).

</datafield>

<subfield code="a">Motlicek_INTERSPEECH2009-3_2009/IDIAP</subfield>

</datafield>

<subfield code="a">Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec</subfield>

</datafield>

<subfield code="a">Motlicek, Petr</subfield>

</datafield>

<subfield code="a">Ganapathy, Sriram</subfield>

</datafield>

<subfield code="a">Hermansky, Hynek</subfield>

</datafield>

<subfield code="a">Arithmetic Coding</subfield>

</datafield>

<subfield code="a">Audio Coding</subfield>

</datafield>

<subfield code="a">Entropy Coding</subfield>

</datafield>

<subfield code="a">Frequency Domain Linear Prediction (FDLP)</subfield>

</datafield>

<subfield code="a">Huffman Coding</subfield>

</datafield>

<subfield code="i">EXTERNAL</subfield>

<subfield code="u">http://publications.idiap.ch/attachments/papers/2009/Motlicek_INTERSPEECH2009-3_2009.pdf</subfield>

<subfield code="x">PUBLIC</subfield>

</datafield>

<subfield code="a">ISCA - 10th Annual Conference of the International Speech Communication Association</subfield>

<subfield code="c">Brighton, England</subfield>

</datafield>

</datafield>

<subfield code="d">September 2009</subfield>

</datafield>

</datafield>

<subfield code="a">A speech/audio codec based on Frequency Domain Linear Prediction (FDLP) exploits auto-regressive modeling to approximate instantaneous energy in critical frequency sub-bands of relatively long input segments. The current version of the FDLP codec operating at 66 kbps has been shown to provide comparable subjective listening quality results to state-of-the-art codecs on similar bit-rates even without employing standard blocks such as entropy coding or simultaneous masking. This paper describes an experimental work to increase compression efficiency of the FDLP codec by employing entropy coding. Unlike conventional Huffman coding employed in current speech/audio coding systems, we describe an efficient way to exploit arithmetic coding to entropy compress quantized spectral magnitudes of the sub-band FDLP residuals. Such an approach provides 11% (âˆ¼ 3 kbps) bit-rate reduction compared to the Huffman coding algorithm (âˆ¼ 1 kbps).</subfield>

</datafield>

</record>

</collection>