Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition
Type of publication: | Conference paper |
Citation: | sgarimel:is:2008 |
Booktitle: | Interspeech 2008 |
Year: | 2008 |
Note: | IDIAP-RR 08-25 |
Crossref: | sgarimel:rr08-25: |
Abstract: | We propose a new auditory inspired feature extraction technique for automatic speech recognition (ASR). Features are extracted by filtering the temporal trajectory of spectral energies in each critical band of speech by a bank of finite impulse response (FIR) filters. Impulse responses of these filters are derived from a modified Gabor envelope in order to emulate asymmetries of the temporal receptive field (TRF) profiles observed in higher level auditory neurons. We obtain $11.4\% $ relative improvement in word error rate on OGI-Digits database and, $3.2\%$ relative improvement in phoneme error rate on TIMIT database over the MRASTA technique. |
Userfields: | ipdmembership={speech}, |
Keywords: | |
Projects |
Idiap |
Authors | |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|