logo Idiap Research Institute        
All publications

2013
A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013
attachment
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013
attachment
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, Kenneth Alberto Funes Mora, in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013
[DOI]
Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013
Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, Gyorgy Szaszak and Philip N. Garner, in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013
attachment
Using the Europarl corpus for cross-linguistic research, Bruno Cartoni, Sandrine Zufferey and Thomas Meyer, in: Belgian Journal of Linguistics(27):23 – 42, 2013
[URL]
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013
attachment
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013
attachment
[DOI]
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013
attachment
[DOI]
Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013
attachment
[DOI]
Convexity in source separation: Models, geometry, and algorithms, Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei and Luca Baldassarre, in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013
attachment
Inferring Mood in Ubiquitous Conversational Video, Dairazalia Sanchez-Cortes, Joan-Isaac Biel, Shiro Kumano, Junji Yamato, Kazuhiro Otsuka and Daniel Gatica-Perez, in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013
attachment
One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013
attachment
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013
attachment
Multiclass Latent Locally Linear Support Vector Machines, Marco Fornoni, Barbara Caputo and Francesco Orabona, in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013
attachment
[URL]
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez and Jean-Marc Odobez, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013
attachment
[DOI]
Unsupervised methods for activity analysis and detection of abnormal events, Remi Emonet and Jean-Marc Odobez, in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013
attachment
[DOI]
Idiap at MediaEval 2013: Search and Hyperlinking Task, Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi and Andrei Popescu-Belis, in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013
attachment
Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, Ramya Rasipuram, Marzieh Razavi and Mathew Magimai.-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013
attachment
Reservoir Boosting : Between Online and Offline Ensemble Learning, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the international conference on Neural Information Processing Systems, 2013
attachment
Multi-Commodity Network Flow for Tracking Multiple People, Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
Interactive Multimodal Information Management: Shaping the Vision, Andrei Popescu-Belis and Hervé Bourlard, in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013
attachment
Automatic Staging of Audio with Emotions, Lakshmi Saheer and Milos Cernak, in: International Conference on Affective Computing and Intelligent Interaction, 2013
Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, in: ISCA Speech Synthesis Workshop, 2013
attachment