Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition
| Type of publication: | Journal paper |
| Citation: | Garner_SPECOM_2011 |
| Publication status: | Published |
| Journal: | Speech Communication |
| Volume: | 53 |
| Number: | 8 |
| Year: | 2011 |
| Month: | October |
| Pages: | 991--1001 |
| Crossref: | Garner_Idiap-RR-15-2011: |
| DOI: | http://dx.doi.org/10.1016/j.specom.2011.05.007 |
| Abstract: | Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this SNR-cepstrum by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The relationship between the SNR-cepstrum and the articulation index, known in psycho-acoustics, is discussed. Experiments are presented suggesting that the combination of the SNR-cepstrum with the well known perceptual linear prediction method can be beneficial in noisy environments. |
| Keywords: | |
| Projects: |
IM2 |
| Authors: | |
| Added by: | [UNK] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|