All publications
| 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | 301-350 | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-800 | 801-850 | 851-900 | 901-950 | 951-1000 | 1001-1050 | 1051-1100 | 1101-1150 | 1151-1200 | 1201-1250 | 1251-1300 | 1301-1350 | 1351-1400 | 1401-1450 | 1451-1500 | 1501-1550 | 1551-1600 | 1601-1650 | 1651-1700 | 1701-1734 |
2010
| Voices of Vlogging, and , in: Proc. AAAI Int. Conf. on Weblogs and Social Media (ICWSM), Washington DC, 2010 |
|
| Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS), Berlin., 2010 |
|
| Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, and , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010 |
| Inferring competitive role patterns in reality TV show through nonverbal analysis, and , in: Multimedia Tools and Applications, Special issue on Social Media, 2010 |
|
| Estimating Dominance in Multi-Party Meetings Using Speaker Diarization, , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 2010 |
|
| Mining group nonverbal conversational patterns using probabilistic topic models, and , in: IEEE Transactions on Multimedia, 2010 |
|
| Modeling and Understanding Flickr Communities through Topic-based Analysis, and , in: IEEE Transactions on Multimedia, 12(5):399-416, 2010 |
[DOI] |
| Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, and , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010 |
|
| Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, and , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), Carnegie Mellon University, Pittsburgh, PA, USA, 2010 |
|
| Towards a standard for dialogue act annotation, , , , , , , , , , , and , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010 |
[URL] |
| The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
|
| The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
| Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, , , , , , , , , , , , , , , , , , and , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
| The AMIDA 2009 Meeting Transcription System, , , , , , , , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
| Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
| Implementation of VTLN for Statistical Speech Synthesis, , , and , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010 |
|
| Floor Holder Detection and End of Speaker Turn Prediction in Meetings, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010 |
|
| Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
|
| A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition, and , in: Pattern Recognition, 43(5):1776-1788, 2010 |
[DOI] |
| Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010 |
|
| Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
|
| An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
| Towards mixed language speech recognition systems, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| Hierarchical Multilayer Perceptron based Language Identification, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| Tracter: A Lightweight Dataflow Framework, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
| An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
| Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
| Flickr Groups: Multimedia Communities for Multimedia Analysis, and , Idiap-RR-18-2010 |
|
| Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
| Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
| Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
| Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, and , in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4):746-755, 2010 |
|
| Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
| English Spoken Term Detection in Multilingual Recordings, , and , Idiap-RR-21-2010 |
|
| On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , , and , Idiap-RR-30-2010 |
|
| MOBIO: Mobile Biometric Face and Speaker Authentication, , , , , , , , and , Idiap-RR-31-2010 |
|
| Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , and , Idiap-RR-09-2010 |
|
| Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
| Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
| Tracter: A Lightweight Dataflow Framework, and , Idiap-RR-10-2010 |
|
| Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
| The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
| Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
| OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
| Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
| 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | 301-350 | 351-400 | 401-450 | 451-500 | 501-550 | 551-600 | 601-650 | 651-700 | 701-750 | 751-800 | 801-850 | 851-900 | 901-950 | 951-1000 | 1001-1050 | 1051-1100 | 1101-1150 | 1151-1200 | 1201-1250 | 1251-1300 | 1301-1350 | 1351-1400 | 1401-1450 | 1451-1500 | 1501-1550 | 1551-1600 | 1601-1650 | 1651-1700 | 1701-1734 |
