On Factorizing Spectral Dynamics for Robust Speech Recognition
Type of publication: | Idiap-RR |
Citation: | vivek-rr-03-32 |
Number: | Idiap-RR-32-2003 |
Year: | 2003 |
Institution: | IDIAP |
Note: | in proceedings of Eurospeech 2003 |
Abstract: | In this paper, we introduce new dynamic speech features based on the modulation spectrum. These features, termed Mel-cepstrum Modulation Spectrum (MCMS,',','), map the time trajectories of the spectral dynamics into a series of slow and fast moving orthogonal components, providing a more general and discriminative range of dynamic features than traditional delta and acceleration features. The features can be seen as the outputs of an array of band-pass filters spread over the cepstral modulation frequency range of interest. In experiments, it is shown that, as well as providing a slight improvement in clean conditions, these new dynamic features yield a significant increase in speech recognition performance in various noise conditions when compared directly to the standard temporal derivative features and RASTA-PLP features. |
Userfields: | ipdmembership={speech}, |
Keywords: | |
Projects |
Idiap |
Authors | |
Crossref by |
vivek-rr-03-32b |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|