All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |
2013
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
![]() |
Proceedings of the ACL Workshop on Discourse in Machine Translation (DiscoMT 2013), , , and , Association for Computational Linguistics, 2013 |
[URL] |
On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , Idiap-RR-43-2013 |
![]() |
3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks, , in: Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, 2013 |
[DOI] |
Context Aware Addressee Estimation for Human Robot Interaction, , , and , in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013 |
Leveraging the robot dialog state for visual focus of attention recognition, , , , and , in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013 |
Multimodal Analysis of Body Communication Cues in Employment Interviews, , , and , in: 15th ACM International Conference on Multimodal Interaction Proceedings, 2013 |
![]() |
Evaluating Intra- and Crosslingual Adaptation for Non-native Speech Recognition in a Bilingual Environment, and , in: Proceedings of the 4th IEEE International Conference on Cognitive Infocommunications, IEEE, Budapest, Hungary, pages 357-361, 2013 |
![]() |
Is Deep Learning Really Necessary for Word Embeddings?, , and , Idiap-RR-44-2013 |
![]() |
Introduction to the Special Issue on Learning Semantics, , , , , and , in: Machine Learning, 2013 |
[DOI] |
Recurrent Convolutional Neural Networks for Scene Labeling, and , Idiap-RR-41-2013 |
![]() |
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, , and , Idiap-RR-40-2013 |
![]() |
Using the Europarl corpus for cross-linguistic research, , and , in: Belgian Journal of Linguistics(27):23 – 42, 2013 |
[URL] |
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
![]() |
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
![]() |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
![]() [DOI] |
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , Idiap-RR-38-2013 |
![]() |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
![]() [DOI] |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , Idiap-RR-37-2013 |
![]() |
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
![]() [DOI] |
Convexity in source separation: Models, geometry, and algorithms, , , , and , in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013 |
![]() |
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
![]() |
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , Idiap-RR-33-2013 |
![]() |
Gesture control interface for immersive panoramic displays, , , , , , and , in: Multimedia Tools and Applications, 1380-7501:1-27, 2013 |
[DOI] |
Creative Applications of Human Behavior Understanding. HBU 2013: 1-14, , , and , in: Human Behavior Understanding, pages 1-14, 2013 |
Inferring Mood in Ubiquitous Conversational Video, , , , , and , in: 12th International Conference on Mobile and Ubiquitous Multimedia, Luleå, Sweden, ACM Press, 2013 |
![]() |
One of a Kind: Inferring Personality Impressions in Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
![]() |
Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
![]() |
Multiclass Latent Locally Linear Support Vector Machines, , and , in: JMLR W&CP, Volume 29: Asian Conference on Machine Learning, Canberra, Australia, pages 229-244, 2013 |
![]() [URL] |
A Semi-Automated System for Accurate Gaze Coding in Natural Dyadic Interactions, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013 |
![]() [DOI] |
Unsupervised methods for activity analysis and detection of abnormal events, and , in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013 |
![]() [DOI] |
Observation of Vehicle Axles Through Pass-by Noise: A Strategy of Microphone Array Design, , , , and , in: IEEE Trans. on Intelligent Transportation Systems, 2013 |
![]() |
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013 |
![]() |
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project, , , , , , , , and , in: Workshop on Speech, Language and Audio in Multimedia, 2013 |
![]() |
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
![]() |
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
![]() |
Probabilistic Lexical Modeling and Unsupervised Training for Zero-Resourced ASR, , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
![]() |
Reservoir Boosting : Between Online and Offline Ensemble Learning, and , in: Proceedings of the international conference on Neural Information Processing Systems, 2013 |
![]() |
Multi-Commodity Network Flow for Tracking Multiple People, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013 |
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, , , and , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013 |
![]() |
Interactive Multimodal Information Management, and , EPFL Press, 2013 |
Interactive Multimodal Information Management: Shaping the Vision, and , in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013 |
![]() |
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, , , , , , , , and , in: Advances in Multimedia, 2013:21, 2013 |
![]() [DOI] [URL] |
Automatic Staging of Audio with Emotions, and , in: International Conference on Affective Computing and Intelligent Interaction, 2013 |
Understanding Factors in Emotion Perception, and , Idiap-RR-28-2013 |
![]() |
Understanding Factors in Emotion Perception, and , in: ISCA Speech Synthesis Workshop, 2013 |
![]() |
Inferring social activities with mobile sensor networks, , , , and , in: 15th ACM International Conference on Multimodal Interaction, 2013 |
![]() |
From Big Smartphone Data to Worldwide Research: The Mobile Data Challenge, , , , , , , and , in: Pervasive and Mobile Computing, 9(6):752–771, 2013 |
![]() |
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
![]() [DOI] [URL] |
Revisiting the Generality of the Rank-based Human Mobility Model, and , in: Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication, Zurich, Switzerland, pages 1209-1218, ACM, 2013 |
![]() [DOI] [URL] |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |