logo Idiap Research Institute        
 [BibTeX] [Marc21]
A Probabilistic Framework for Multiple Speaker Localization
Type of publication: Idiap-RR
Citation: Oualil_Idiap-RR-37-2012
Number: Idiap-RR-37-2012
Year: 2012
Month: 12
Institution: Idiap
Note: Submitted to ICASSP'13
Abstract: This paper presents a novel probabilistic framework for localizing multiple speakers with a microphone array. In this framework, the generalized cross correlation function (GCC) of each microphone pair is interpreted as a probability distribution of the time difference of arrival (TDOA) and subsequently approximated as a Gaussian mixture. The distribution parameters are estimated with a weighted expectation maximization algorithm. Then, the joint distribution of the TDOA Gaussian mixtures is mapped to a multimodal distribution in the location space, where each mode represents a potential source location. The approach taken here performs the localization by 1) reducing the search space to some regions that are likely to contain a source and then 2) extracting the actual speaker locations with a numerical optimization algorithm. The effectiveness of the proposed approach is shown using the AV16.3 corpus.
Keywords: Gaussian mixture, localization, microphone arrays, multiple speakers, Steered response power
Projects FP 7
Authors Oualil, Youssef
Magimai.-Doss, Mathew
Faubel, Friedrich
Klakow, Dietrich
Editors Oualil, Youssef
Magimai.-Doss, Mathew
Crossref by Oualil_ICASSP2013_2013
Added by: [ADM]
Total mark: 0
  • Oualil_Idiap-RR-37-2012.pdf