Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

Human Behavior Understanding, Alessandro Vinciarelli, Springer Verlag, 2010

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010

attachment

Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010

attachment

[DOI]

Speech Enhancement using an Improved MMSE Estimator with Laplacian Prior, Mehdi Rashidinejad, Hamid Reza Abutalebi and Ali Akbar Tadaion, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain, Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh and Hamid Sheikhzadeh, in: Proceedings of 5th International Symposium on Telecommunications, 2010

attachment

Social Network Analysis for Automatic Role Recognition, Sarah Favre, Ecole Polytechnique Fédérale de Lausanne, 2010

attachment

Discovering Human Places of Interest from Multimodal Mobile Phone Data, Raul. Montoliu and Daniel Gatica-Perez, in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010

attachment

Feature distribution modelling techniques for 3D face recognition, Chris McCool, Jordi Sanchez-Riera and Sébastien Marcel, in: Pattern Recognition Letters, 31:1324-1330, 2010

attachment

An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, Deepu Vijayasenan, Ecole polytechnique fédérale de Lausanne, 2010

attachment

Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition, Fabio Valente, in: Speech Communication, 52(10):790-800, 2010

[DOI]

Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule, Fabio Valente, in: Speech Communication, 52(3):213-222, 2010

[DOI]

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

attachment

A Comparative Study of MLP Front-ends for Mandarin ASR, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Ravuri Suman and Wang Wen, in: Proceedings of Interspeech, Japan, 2010

attachment

Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., Fabio Valente and Alessandro Vinciarelli, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

attachment

Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, Alessandro Vinciarelli and Fabio Valente, in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010

attachment

Hierarchical Tandem Features for ASR in Mandarin, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-39-2010

attachment

Automatic Time Skew Detection and Correction, Danil Korchagin, Idiap-RR-42-2010

attachment

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010

attachment

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of Interspeech, 2010

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010

attachment

[URL]

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, Idiap-RR-37-2010

attachment

Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010

attachment

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010

attachment

By their apps you shall understand them: mining large-scale patterns of mobile phone usage, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010

attachment

Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, Katayoun Farrahi and Daniel Gatica-Perez, in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010

attachment

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-36-2010

attachment

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Alessandro Vinciarelli and Gelareh Mohammadi, in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010

attachment

The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, Gelareh Mohammadi, Alessandro Vinciarelli and Marcello Mortillaro, in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010

attachment

More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, Hugues Salamin, Gelareh Mohammadi, Khiet Truong and Alessandro Vinciarelli, in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010

attachment

Automatic Role Recognition Based on Conversational and Prosodic Behaviour, Hugues Salamin, Khiet Truong, Gelareh Mohammadi and Alessandro Vinciarelli, in: Proceedings of the ACM International Conference on Multimedia, 2010

attachment

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-34-2010

attachment

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010

attachment

Personalising speech-to-speech translation in the EMIME project, Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu and Junichi Yamagishi, in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010

[URL]

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010

attachment

[DOI]

A Multi Cue Discriminative Approach to Semantic Place Classification, Marco Fornoni, Jesus Martinez-Gomez and Barbara Caputo, in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010

attachment

The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, Hayley Hung and Gokul Chittaranjan, in: ACM Multimedia, 2010

attachment

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, Gokul Chittaranjan and Hayley Hung, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010

attachment

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: BMVC 2010, Aberystwyth University, Aberystwyth, BMVA Press, 2010

attachment

Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010

attachment

Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010

attachment

Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, Daniel Gatica-Perez and Jean-Marc Odobez, in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010

Inferring competitive role patterns in reality TV show through nonverbal analysis, Raducanu Bogdan and Daniel Gatica-Perez, in: Multimedia Tools and Applications, Special issue on Social Media, 2010

attachment

Mining group nonverbal conversational patterns using probabilistic topic models, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 2010

attachment

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 12(5), 2010

[DOI]

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010

attachment

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010

attachment

A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010

attachment

Towards a standard for dialogue act annotation, Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and Traum. David, in: 7th International Conference on Language Resources and Evaluation, Malta, 2010

attachment

[URL]

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

processing time: 0.0004 seconds.