All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |
2010
A Comparative Study of MLP Front-ends for Mandarin ASR, , , , and , in: Proceedings of Interspeech, Japan, 2010 |
![]() |
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
![]() |
Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, and , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010 |
![]() |
Hierarchical Tandem Features for ASR in Mandarin, , and , Idiap-RR-39-2010 |
![]() |
Automatic Time Skew Detection and Correction, , Idiap-RR-42-2010 |
![]() |
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010 |
![]() |
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , in: Proc. of the 18th Intl. Conf. on Multimedia, Firenze, Italy, 2010 |
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , in: Proceedings of Interspeech, 2010 |
Fast Bounding Box Estimation based Face Detection, and , in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010 |
![]() [URL] |
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , Idiap-RR-37-2010 |
![]() |
Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, , , and , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010 |
![]() |
Learning from Candidate Labeling Sets, and , in: Advances in Neural Information Processing Systems 23 (NIPS10), NIPS Foundation, Vancouver, B.C., Canada, MIT Press, 2010 |
![]() |
By their apps you shall understand them: mining large-scale patterns of mobile phone usage, and , in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010 |
![]() |
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling, and , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, Minneapolis, Minnesota, USA, 2010 |
![]() |
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , Idiap-RR-36-2010 |
![]() |
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010 |
![]() |
The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions, , and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
![]() |
More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, , , and , in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010 |
![]() |
Automatic Role Recognition Based on Conversational and Prosodic Behaviour, , , and , in: Proceedings of the ACM International Conference on Multimedia, 2010 |
![]() |
Measuring the gap between HMM-based ASR and TTS, , and , Idiap-RR-34-2010 |
![]() |
Measuring the gap between HMM-based ASR and TTS, , and , in: IEEE Journal of Selected Topics in Signal Processing, in print, 2010 |
![]() |
Personalising speech-to-speech translation in the EMIME project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden, 2010 |
[URL] |
Tuning-Robust Initialization Methods for Speaker Diarization, and , in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010 |
![]() [DOI] |
A Multi Cue Discriminative Approach to Semantic Place Classification, , and , in: CLEF 2010 Notebook Papers/LABs/Workshops, 2010 |
![]() |
The Wolf Corpus: Exploring group behaviour in a competitive role-playing game, and , in: ACM Multimedia, 2010 |
![]() |
Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, and , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
![]() |
Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , in: BMVC 2010, Aberystwyth University, Aberystwyth, BMVA Press, 2010 |
![]() |
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
![]() |
Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010 |
![]() |
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, and , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010 |
Inferring competitive role patterns in reality TV show through nonverbal analysis, and , in: Multimedia Tools and Applications, Special issue on Social Media, 2010 |
![]() |
Mining group nonverbal conversational patterns using probabilistic topic models, and , in: IEEE Transactions on Multimedia, 2010 |
![]() |
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , in: IEEE Transactions on Multimedia, 12(5), 2010 |
[DOI] |
Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, and , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010 |
![]() |
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
![]() |
A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
![]() |
Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
![]() [URL] |
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
![]() |
The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
![]() |
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
![]() |
The AMIDA 2009 Meeting Transcription System, , , , , , , , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
![]() |
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
![]() |
Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
![]() |
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
![]() |
Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
![]() |
Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
![]() |
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
![]() |
A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition, and , in: Pattern Recognition, 43(5), 2010 |
![]() [DOI] |
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
![]() |
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
![]() |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |