CONF Duffner_ICSAP_2011/IDIAP The TA2 Database - A Multi-Modal Database from Home Entertainment Duffner, Stefan Motlicek, Petr Korchagin, Danil EXTERNAL https://publications.idiap.ch/attachments/papers/2010/Duffner_ICSAP_2011.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/Duffner_Idiap-RR-37-2010 Related documents International Conference on Signal Acquisition and Processing Singapore 2011 February 2011 This paper presents a new database containing high-definition audio and video recordings in a rather unconstrained video-conferencing-like environment. The database consists of recordings of people sitting around a table in two separate rooms communicating and playing online games with each other. Extensive annotation of head positions, voice activity and word transcription has been performed on the dataset, making it especially useful for evaluating automatic speech-recognition, voice activity detection, speaker localisation, multi-face detection and tracking, and other audio-visual analysis algorithms. REPORT Duffner_Idiap-RR-37-2010/IDIAP The TA2 Database - A Multi-Modal Database from Home Entertainment Duffner, Stefan Motlicek, Petr Korchagin, Danil EXTERNAL https://publications.idiap.ch/attachments/reports/2011/Duffner_Idiap-RR-37-2010.pdf PUBLIC Idiap-RR-37-2010 2010 Idiap Rue Marconi 19, CH-1920 Martigny October 2010 This paper presents a new database containing high-definition audio and video recordings in a rather unconstrained video-conferencing-like environment. The database consists of recordings of people sitting around a table in two separate rooms communicating and playing online games with each other. Extensive annotation of head positions, voice activity and word transcription has been performed on the dataset, making it especially useful for evaluating automatic speech-recognition, voice activity detection, speaker localisation, multi-face detection and tracking, and other audio-visual analysis algorithms.