ARTICLE Garner_SPECOM_2011/IDIAP Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition Garner, Philip N. EXTERNAL http://publications.idiap.ch/attachments/papers/2011/Garner_SPECOM_2011.pdf PUBLIC http://publications.idiap.ch/index.php/publications/showcite/Garner_Idiap-RR-15-2011 Related documents Speech Communication 53 8 991--1001 2011 http://dx.doi.org/10.1016/j.specom.2011.05.007 doi Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this SNR-cepstrum by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The relationship between the SNR-cepstrum and the articulation index, known in psycho-acoustics, is discussed. Experiments are presented suggesting that the combination of the SNR-cepstrum with the well known perceptual linear prediction method can be beneficial in noisy environments. REPORT Garner_Idiap-RR-15-2011/IDIAP Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition. Garner, Philip N. aurora Automatic Speech Recognition cepstral normalisation Noise Robustness EXTERNAL http://publications.idiap.ch/attachments/reports/2010/Garner_Idiap-RR-15-2011.pdf PUBLIC Idiap-RR-15-2011 2011 Idiap May 2011 Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this {\em SNR-cepstrum} by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The SNR-cepstrum is shown to be almost identical to the articulation index known in psycho-acoustics. Combination of the SNR-cepstrum with the well known perceptual linear prediction method is shown to be beneficial in noisy environments.