logo Idiap Research Institute        
 [BibTeX] [Marc21]
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition
Type of publication: Idiap-RR
Citation: ikbal-rr-04-50
Number: Idiap-RR-50-2004
Year: 2004
Institution: IDIAP
Address: Martigny, Switzerland
Note: Submitted for publication
Abstract: In this paper, we present a HMM/ANN based algorithm to estimate the spectral peak locations. This algorithm makes use of distinct time-frequency (TF) patterns in the spectrogram for estimating the peak locations. Such an use of TF patterns is expected to impose temporal constraints during the peak estimation task, thereby yielding a smoother estimate of the peaks over time. Additionally, the algorithm use an ergodic topology for the HMM/ANN, thus allowing an estimation of a varying number of peak locations over time. The usefulness of the proposed algorithm is evaluated in the framework of a recently introduced noise robust feature called spectro-temporal activity pattern (STAP) feature. Interestingly, recently introduced, phase autocorrelation (PAC) spectrum, with enhanced spectral peaks and smoothed spectral valleys, turns out to be more appropriate for this algorithm than the regular spectrum.
Userfields: ipdinar={2004}, ipdmembership={speech}, language={English},
Projects Idiap
Authors Ikbal, Shajith
Bourlard, Hervé
Magimai.-Doss, Mathew
Added by: [UNK]
Total mark: 0
  • rr04-50.pdf
  • rr04-50.ps.gz