logo Idiap Research Institute        
 [BibTeX] [Marc21]
Integrating audio and vision for robust automatic gender recognition
Type of publication: Idiap-RR
Citation: Pronobis_Idiap-RR-73-2008
Number: Idiap-RR-73-2008
Year: 2008
Month: 11
Institution: Idiap
Abstract: We propose a multi-modal Automatic Gender Recognition (AGR) system based on audio-visual cues and present its thorough evaluation in realistic scenarios. First, we analyze robustness of different audio and visual features under varying conditions and create two uni-modal AGR systems. Then, we build an integrated audio-visual system by fusing information from each modality at the classifier level. Our extensive studies on the BANCA corpus comprising datasets of varying complexity show that: (a) the audio-based system is more robust than the vision-based system; (b) integration of audio-visual cues yields a resilient system and improves performance in noisy conditions.
Projects Idiap
Authors Pronobis, Marianna
Magimai.-Doss, Mathew
Added by: [ADM]
Total mark: 0
  • Pronobis_Idiap-RR-73-2008.pdf (MD5: 69ea98fb2c1c3834a57b4891e147c52b)