Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
Type of publication: | Journal paper |
Citation: | Garner_SPECOM_2011 |
Publication status: | Published |
Journal: | Speech Communication |
Volume: | 53 |
Number: | 8 |
Year: | 2011 |
Month: | October |
Pages: | 991--1001 |
Crossref: | Garner_Idiap-RR-15-2011: |
DOI: | http://dx.doi.org/10.1016/j.specom.2011.05.007 |
Abstract: | Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this SNR-cepstrum by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The relationship between the SNR-cepstrum and the articulation index, known in psycho-acoustics, is discussed. Experiments are presented suggesting that the combination of the SNR-cepstrum with the well known perceptual linear prediction method can be beneficial in noisy environments. |
Keywords: | |
Projects |
IM2 |
Authors | |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|