logo Idiap Research Institute        
 [BibTeX] [Marc21]
Multistream Speaker Diarization beyond Two Acoustic Feature Streams
Type of publication: Conference paper
Citation: Vijayasenan_ICASSP2010_2010
Booktitle: International Conference on Acoustics, Speech, and Signal Processing
Year: 2010
Crossref: diarmulti4feat
Abstract: Speaker diarization for meetings data are recently converging towards multistream systems. The most common complementary features used in combination with MFCC are Time Delay of Arrival (TDOA). Also other features have been proposed although, there are no reported improvements on top of MFCC+TDOA systems. In this work we investigate the combination of other feature sets along with MFCC+TDOA. We discuss issues and problems related to the weighting of four different streams proposing a solution based on a smoothed version of the speaker error. Experiments are presented on NIST RT06 meeting diarization evaluation. Results reveal that the combination of four acoustic feature streams results in a 30% relative improvement with respect to the MFCC+TDOA feature combination. To the authors’ best knowledge, this is the first successful attempt to improve the MFCC+TDOA baseline including other feature streams.
Keywords: Speaker Diarization
Projects Idiap
AMIDA
IM2
Authors Vijayasenan, Deepu
Valente, Fabio
Bourlard, Hervé
Added by: [UNK]
Total mark: 0
Attachments
  • Vijayasenan_ICASSP2010_2010.pdf
Notes