CONF
valente:Interspeech2:2008/IDIAP
On the Combination of Auditory and Modulation Frequency Channels for ASR applications
Valente, Fabio
Hermansky, Hynek
EXTERNAL
https://publications.idiap.ch/attachments/papers/2008/valente-Interspeech2-2008.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/valente:rr08-12
Related documents
Interspeech 2008
2008
IDIAP-RR 08-12
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work \cite{icassp2008}
REPORT
valente:rr08-12/IDIAP
On the Combination of Auditory and Modulation Frequency Channels for ASR applications
Valente, Fabio
Hermansky, Hynek
EXTERNAL
https://publications.idiap.ch/attachments/reports/2008/valente-idiap-rr-08-12.pdf
PUBLIC
Idiap-RR-12-2008
2008
IDIAP
Published in Interspeech 2008
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work \cite{icassp2008}, we showed that combination of classifiers trained on different ranges of {\it modulation} frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verify that combination of classifiers trained on different ranges of {\it auditory} frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2\% (from 45.8\% to 39.6\%) w.r.t the single classifier approach in a LVCSR task.