Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.
Type of publication: | Idiap-RR |
Citation: | Garner_Idiap-RR-15-2011 |
Number: | Idiap-RR-15-2011 |
Year: | 2011 |
Month: | 5 |
Institution: | Idiap |
Abstract: | Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this {\em SNR-cepstrum} by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The SNR-cepstrum is shown to be almost identical to the articulation index known in psycho-acoustics. Combination of the SNR-cepstrum with the well known perceptual linear prediction method is shown to be beneficial in noisy environments. |
Keywords: | aurora, Automatic Speech Recognition, cepstral normalisation, Noise Robustness |
Projects |
IM2 |
Authors | |
Crossref by |
Garner_SPECOM_2011 |
Added by: | [ADM] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|