SNR Features for Automatic Speech Recognition

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	Garner_Idiap-RR-25-2009
Number:	Idiap-RR-25-2009
Year:	2009
Month:	9
Institution:	Idiap
Abstract:	When combined with cepstral normalisation techniques, the features normally used in Automatic Speech Recognition are based on Signal to Noise Ratio (SNR). We show that calculating SNR from the outset, rather than relying on cepstral normalisation to produce it, gives features with a number of practical and mathematical advantages over power-spectral based ones. In a detailed analysis, we derive Maximum Likelihood and Maximum a-Posteriori estimates for SNR based features, and show that they can outperform more conventional ones, especially when subsequently combined with cepstral variance normalisation. We further show anecdotal evidence that SNR based features lend themselves well to noise estimates based on low-energy envelope tracking.
Keywords:
Projects	IM2
Authors	Garner, Philip N.
Crossref by	Garner_ASRU_2009
Added by:	[ADM]
Total mark:	0
Attachments
Garner_Idiap-RR-25-2009.pdf (MD5: c3de4c7bb1450e7748fa4fe467d73b73)
Notes

processing time: 0.0003 seconds.