logo Idiap Research Institute        
All publications

2010
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010
attachment
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010
Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010
attachment
[URL]
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010
attachment
Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010
attachment
By their apps you shall understand them: mining large-scale patterns of mobile phone usage, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010
attachment
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010
attachment
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Alessandro Vinciarelli and Gelareh Mohammadi, in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010
attachment
More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, Hugues Salamin, Gelareh Mohammadi, Khiet Truong and Alessandro Vinciarelli, in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010
attachment
Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010
attachment
Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010
attachment
[DOI]
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, Gokul Chittaranjan and Hayley Hung, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010
attachment
Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010
attachment
Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010
attachment
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, Daniel Gatica-Perez and Jean-Marc Odobez, in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010
Inferring competitive role patterns in reality TV show through nonverbal analysis, Raducanu Bogdan and Daniel Gatica-Perez, in: Multimedia Tools and Applications, Special issue on Social Media, 2010
attachment
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010
attachment
Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010
attachment
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010
attachment
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010
attachment
English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010
attachment
Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010
attachment
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, Alfred Dielmann, Giulia Garau and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010
attachment
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010
attachment
Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai.-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010
attachment