ARTICLE
moeller-specom02/IDIAP
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model
Moeller, S.
Bourlard, Hervé
EXTERNAL
https://publications.idiap.ch/attachments/reports/2001/rr01-17.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/moeller-01-17
Related documents
Speech Communication
2002
IDIAP-RR 01-17
This paper addresses the impact of telephone transmission channels on automatic speech recognition (ASR) performance. A real-time simulation model is described and implemented, which allows impairments that are encountered in traditional as well as modern (mobile, IP-based) networks to be flexibly and efficiently generated. The model is based on input parameters which are known to telephone network planners; thus, it can be applied without measuring specific network characteristics. It can be used for an analytic assessment of the impact of channel impairments on ASR performance, for producing training material with defined transmission characteristics, or for testing spoken dialogue systems in realistic network environments. In the present paper, we present an investigation of the first point. Two speech recognizers which are integrated into a spoken dialogue system for information retrieval are assessed in relation to controlled amounts of transmission degradations. The measured ASR performance degradation is compared to speech quality degradation in human-human communication. It turns out that different behavior can be expected for some impairments. This fact has to be taken into account in both telephone network planning as well as in speech and language technology development.
REPORT
moeller-01-17/IDIAP
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model
Moeller, S.
Bourlard, Hervé
EXTERNAL
https://publications.idiap.ch/attachments/reports/2001/rr01-17.pdf
PUBLIC
Idiap-RR-17-2001
2001
IDIAP
To be published in Speech Communication
This paper addresses the impact of telephone transmission channels on automatic speech recognition (ASR) performance. A real-time simulation model is described and implemented, which allows impairments that are encountered in traditional as well as modern (mobile, IP-based) networks to be flexibly and efficiently generated. The model is based on input parameters which are known to telephone network planners; thus, it can be applied without measuring specific network characteristics. It can be used for an analytic assessment of the impact of channel impairments on ASR performance, for producing training material with defined transmission characteristics, or for testing spoken dialogue systems in realistic network environments. In the present paper, we present an investigation of the first point. Two speech recognizers which are integrated into a spoken dialogue system for information retrieval are assessed in relation to controlled amounts of transmission degradations. The measured ASR performance degradation is compared to speech quality degradation in human-human communication. It turns out that different behavior can be expected for some impairments. This fact has to be taken into account in both telephone network planning as well as in speech and language technology development.