All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
2012
Speaker Diarization of Meetings based on large TDOA feature vectors, and , in: Proceedings of International Conference on Acoustic, Speech and Signal Processing, 2012 |
|
Translating English Discourse Connectives into Arabic: a Corpus-based Analysis and an Evaluation Metric, and , in: Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), 2012 |
|
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery, , and , in: 24th IEEE International Conference on Tools with Artificial Intelligence, Athens, Greece, IEEE, 2012 |
[URL] |
We are not Contortionists: Coupled Adaptive Learning for Head and Body Orientation Estimation in Surveillance Video, and , in: IEEE International Conference on Computer Vision and Pattern Recognition, 2012 |
|
Linking Speaking and Looking Behavior Patterns with Group Composition, Perception, and Performance, , , , and , in: Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012 |
|
Using Self-Context for Multimodal Detection of Head Nods in Face-to-Face Interactions, , and , in: Proceedings of the 14th ACM International Conference on Multimodal Interaction, 2012 |
|
Sequential Topic Models for Mining Recurrent Activities and their Relationships : Application to long term video recordings, , École Polytechnique Fédérale de Lausanne, 2012 |
|
The YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs, and , in: IEEE Transactions on Multimedia, 2012 |
|
Iterative Relevance Feedback with Adaptive Exploration/Exploitation Trade-off, and , in: Proceedings of the 21st ACM Conference on Information and Knowledge Management, pages 1323-1331, 2012 |
|
Machine Translation of Labeled Discourse Connectives, , , and , in: Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pages 10, 2012 |
|
StressSense: Detecting Stress in Unconstrained Acoustic Environments using Smartphones, , , , , , , and , in: Ubicomp'12, Pittsburgh, 2012 |
|
Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, , , , , and , in: Proceedings of Interspeech, 2012 |
|
Baseline System for Automatic Speech Recognition with French GlobalPhone Database, and , Idiap-RR-26-2012 |
|
Reading Companion: The Technical and Social Design of an Automated Reading Tutor, , , , , and , in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012 |
|
Discovering Places of Interest in Everyday Life from Smartphone Data, , and , in: Multimedia Tools and Applications, 2012 |
|
The Mobile Data Challenge: Big Data for Mobile Computing Research, , , , , , , , and , in: Pervasive Computing, Newcastle, 2012 |
|
The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, , , , , , , , , and , Idiap-RR-33-2012 |
|
Boosting localized binary features for speech recognition, , and , in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012 |
|
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Leveraging over prior knowledge for online learning of visual categories, , , and , in: Proceedings of the British Machine Vision Conference, 2012 |
|
Contextual Conditional Models for Smartphone-based Human Mobility Prediction, and , in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , Idiap-RR-23-2012 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
Building the NinaPro Database: a Resource for the Biorobotics Community, , , , , , , , and , in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012 |
|
Who Wants To Be A Millionaire?, , , and , Idiap-Com-03-2012 |
|
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012 |
|
Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, , , and , in: Statistical and Perceptual Audition Workshop, 2012 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , in: 20th European Signal Processing Conference, 2012 |
|
Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, and , Idiap-RR-17-2012 |
|
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
|
Microphone Array Beampattern Characterization for Hands-free Speech Applications, , and , in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012 |
|
Sparsity in Topic Models, , and , in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
|
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
|
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
[DOI] |
On Speaker-Independent Personality Perception and Prediction from Speech, , , , , and , in: in Proceedings of INTERSPEECH 2012, 2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
|
Using self-context for multimodal detection of head nods in face-to-face interactions, , and , Idiap-RR-27-2012 |
|
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , Idiap-RR-19-2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , Idiap-RR-16-2012 |
|
Integrating Language Identification to improve Multilingual Speech Recognition, , Idiap-RR-24-2012 |
|
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
|
Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, , , , and , in: Journal on Multimodal User Interfaces, 2012 |
|
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |