On the Combination of Auditory and Modulation Frequency Channels for ASR applications

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	valente:rr08-12
Number:	Idiap-RR-12-2008
Year:	2008
Institution:	IDIAP
Note:	Published in Interspeech 2008
Abstract:	This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work \cite{icassp2008}, we showed that combination of classifiers trained on different ranges of {\it modulation} frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verify that combination of classifiers trained on different ranges of {\it auditory} frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2\% (from 45.8\% to 39.6\%) w.r.t the single classifier approach in a LVCSR task.
Userfields:	ipdmembership={speech},
Keywords:
Projects	Idiap
Authors	Valente, Fabio Hermansky, Hynek
Crossref by	valente:Interspeech2:2008
Added by:	[UNK]
Total mark:	0
Attachments
valente-idiap-rr-08-12.pdf valente-idiap-rr-08-12.ps.gz
Notes

processing time: 0.0011 seconds.