CONF
Aran_ICPR2010_2010/IDIAP
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations
Aran, Oya
Gatica-Perez, Daniel
EXTERNAL
https://publications.idiap.ch/attachments/papers/2010/Aran_ICPR2010_2010.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Aran_Idiap-RR-17-2010
Related documents
20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010
Istanbul, Turkey
2010
August 2010
This paper addresses the multimodal nature of social dominance and presents multimodal fusion techniques to combine audio and visual nonverbal cues for dominance estimation in small group conversations. We combine the two modalities both at the feature extraction level and at the classifier level via score and rank level fusion. The classification is done by a simple rule-based estimator. We perform experiments on a new 10-hour dataset derived from the popular AMI meeting corpus. We objectively evaluate the performance of each modality and each cue alone and in combination. Our results show that the combination of audio and visual cues is necessary to achieve the best performance.
REPORT
Aran_Idiap-RR-17-2010/IDIAP
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations
Aran, Oya
Gatica-Perez, Daniel
EXTERNAL
https://publications.idiap.ch/attachments/reports/2010/Aran_Idiap-RR-17-2010.pdf
PUBLIC
Idiap-RR-17-2010
2010
Idiap
July 2010
This paper addresses the multimodal nature of social dominance and presents multimodal fusion techniques to combine audio and visual nonverbal cues for dominance estimation in small group conversations. We combine the two modalities both at the feature extraction level and at the classifier level via score and rank level fusion. The classification is done by a simple rule-based estimator. We perform experiments on a new 10-hour dataset derived from the popular AMI meeting corpus. We objectively evaluate the performance of each modality and each cue alone and in combination. Our results show that the combination of audio and visual cues is necessary to achieve the best performance.