All publications sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |
| Multimodal Signal Processing for Meetings: an Introduction, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012 |
|
| BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, , and , Idiap-RR-39-2011 |
|
| Finding Audio-Visual Events in Informal Social Gatherings, , , and , in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011 |
|
| Engagement-based Multi-party Dialog with a Humanoid Robot, , , , , , and , in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011 |
|
| Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , Idiap-RR-38-2011 |
|
| OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010 |
|
| Hand Gesture Analysis, , and , in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011 |
| Analysis of Group Conversations: Modeling Social Verticality, and , in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011 |
| A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, , , and , in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012 |
[DOI] |
| Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis, , , , , , , , , , , , and , in: Computer Speech and Language, 2011 |
[DOI] [URL] |
| Environment - Application - Adaptation: a Community Architecture for Ambient Intelligence, , in: International Conference on Ambient Computing, Applications, Services and Technologies, 2011 |
| Speaker Diarization, and , in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012 |
[URL] |
| Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, and , in: Proceedings of Interspeech, 2011 |
|
| Analysis and Comparison of Recent MLP Features for LVCSR Systems, , and , in: Proceedings of Interspeech 2011, 2011 |
|
| Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, , and , in: Speech Communication, 54(1), 2012 |
[DOI] |
| Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features, , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 19(8), 2011 |
[DOI] |
| Data-driven extraction of spectral-dynamics based posteriors, , in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011 |
[URL] |
| Speaker Diarization of Meetings based on Speaker Role N-gram Models, , and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011 |
|
| MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
| Current trends in multilingual speech processing, , , , , , , , and , in: Sadhana, 36(5):885–915, 2011 |
[DOI] [URL] |
| Transcribing meetings with the AMIDA systems, , , , , , , , , and , in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):486--498, 2012 |
[DOI] [URL] |
| Overview of the CLEF 2009 medical image annotation track, , , , and , in: Workshop of the Cross-Language Evaluation Forum, Corfu, Greece, pages 85-93, Springer Berlin Heidelberg, 2009 |
[DOI] |
| Object Recognition using Visuo-Affordance Maps, , , and , in: International Conference on Intelligent Robots and Systems, Taipei, pages 1572-1578, IEEE, 2010 |
[DOI] |
| Towards a quantitative measure of rareness, and , in: DIRAC Workshop at the European Conference on Machine Learning, pages 129-136, Springer Berlin Heidelberg, 2010 |
[DOI] |
| Transferring Activities: Updating Human Behavior Analysis, , , , and , in: Visual Surveillance Workshop at ICCV, 2011 |
|
| Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , in: Proceedings of IEEE Computer Vision and Pattern Recognition Conference, San Francisco, CA, pages 3081-3088, IEEE, 2010 |
[DOI] |
| Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People, , , , and , in: IEEE Multimedia, 18(2):26-37, 2011 |
[DOI] |
| Domain-specific language model adaptation: a case study, , and , Idiap-Com-01-2013 |
|
| VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|
| Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
| Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , in: International Joint Conference on Biometrics, 2011 |
| An Audio Visual Corpus for Emergent Leader Analysis, , and , in: Multimodal Corpora for Machine Learning: Taking Stock and Road mapping the Future, 2011 |
| Robustness of Group Delay Representations for Noisy Speech Signals, , and , Idiap-RR-36-2011 |
|
| Robustness of Group Delay Representations for Noisy Speech Signals, , and , in: IJST (Springer), 14(4), 2011 |
|
| Privacy-Sensitive Audio Features for Conversational Speech Processing, , Ecole Polytechnique Fédérale de Lausanne, 2011 |
|
| Human Interaction Discovery in Smartphone Proximity Networks, and , in: Personal and Ubiquitous Computing, 2012 |
|
| Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction, , and , in: Proceedings of International Conference on Document Analysis and Recognition, 2011 |
| Mining Large-Scale Smartphone Data for Personality Studies, , and , in: Personal and Ubiquitous Computing, 2012 |
|
| Boosting Localized Features for Speaker and Speech Recognition, , Ecole Polytechnique Federale de Lausanne (EPFL), 2011 |
|
| Multi-camera Open Space Human Activity Discovery for Anomaly Detection, , and , in: 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011 |
|
| Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, and , in: Interspeech, Florence, Italy, pages 953-956, 2011 |
|
| Continuous Speech Recognition using Boosted Binary Features, , and , Idiap-RR-35-2011 |
|
| Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012 |
|
| Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , Idiap-RR-34-2011 |
|
| Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior, and , in: IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction, 12(6):563 - 575, 2010 |
|
| IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, , and , Idiap-RR-40-2011 |
|
| Competition on Counter Measures to 2-D Facial Spoofing Attacks, , , , , , , , , , , , , , , , , , , , , , , and , in: Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, 2011 |
|
| Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011 |
|
| Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
| Comparing machines and humans on a visual categorization test, , , , , and , in: Proceedings of the National Academy of Sciences, 2011 |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |