Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.
| Type of publication: | Idiap-RR |
| Citation: | Garner_Idiap-RR-15-2011 |
| Number: | Idiap-RR-15-2011 |
| Year: | 2011 |
| Month: | 5 |
| Institution: | Idiap |
| Abstract: | Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this {\em SNR-cepstrum} by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The SNR-cepstrum is shown to be almost identical to the articulation index known in psycho-acoustics. Combination of the SNR-cepstrum with the well known perceptual linear prediction method is shown to be beneficial in noisy environments. |
| Keywords: | aurora, Automatic Speech Recognition, cepstral normalisation, Noise Robustness |
| Projects: |
IM2 |
| Authors: | |
| Crossref by |
Garner_SPECOM_2011 |
| Added by: | [ADM] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|