ARTICLE
Garner_SPECOM_2011/IDIAP
Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
Garner, Philip N.
EXTERNAL
https://publications.idiap.ch/attachments/papers/2011/Garner_SPECOM_2011.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Garner_Idiap-RR-15-2011
Related documents
Speech Communication
53
8
991--1001
2011
http://dx.doi.org/10.1016/j.specom.2011.05.007
doi
Cepstral normalisation in automatic speech recognition is
investigated in the context of robustness to additive noise. In
this paper, it is argued that such normalisation leads naturally to
a speech feature based on signal to noise ratio rather than absolute
energy (or power). Explicit calculation of this SNR-cepstrum
by means of a noise estimate is shown to have theoretical and
practical advantages over the usual (energy based) cepstrum. The
relationship between the SNR-cepstrum and the articulation index,
known in psycho-acoustics, is discussed. Experiments are presented
suggesting that the combination of the SNR-cepstrum with the well
known perceptual linear prediction method can be beneficial in noisy
environments.
REPORT
Garner_Idiap-RR-15-2011/IDIAP
Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.
Garner, Philip N.
aurora
Automatic Speech Recognition
cepstral normalisation
Noise Robustness
EXTERNAL
https://publications.idiap.ch/attachments/reports/2010/Garner_Idiap-RR-15-2011.pdf
PUBLIC
Idiap-RR-15-2011
2011
Idiap
May 2011
Cepstral normalisation in automatic speech recognition is
investigated in the context of robustness to additive noise. It is
argued that such normalisation leads naturally to a speech feature
based on signal to noise ratio rather than absolute energy (or
power). Explicit calculation of this {\em SNR-cepstrum} by means of a
noise estimate is shown to have theoretical and practical advantages
over the usual (energy based) cepstrum. The SNR-cepstrum is shown
to be almost identical to the articulation index known in
psycho-acoustics. Combination of the SNR-cepstrum with the well
known perceptual linear prediction method is shown to be beneficial
in noisy environments.