All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |
2012
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
![]() |
From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, , , and , in: in Proceedings of ACM Multimedia 2012, 2012 |
![]() |
Microphone Array Beampattern Characterization for Hands-free Speech Applications, , and , in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012 |
![]() |
Sparsity in Topic Models, , and , in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012 |
![]() |
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
![]() |
Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, , and , in: SAPA-SCALE Conference, International Speech Communication Association, 2012 |
![]() |
Gaze Estimation From Multimodal Kinect Data, and , in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012 |
![]() [DOI] |
On Speaker-Independent Personality Perception and Prediction from Speech, , , , , and , in: in Proceedings of INTERSPEECH 2012, 2012 |
![]() |
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
![]() |
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
![]() |
Structured Sparse Coding for Microphone Array Location Calibration, , , and , in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012 |
![]() |
Using self-context for multimodal detection of head nods in face-to-face interactions, , and , Idiap-RR-27-2012 |
![]() |
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , Idiap-RR-19-2012 |
![]() |
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012 |
![]() |
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , Idiap-RR-16-2012 |
![]() |
Integrating Language Identification to improve Multilingual Speech Recognition, , Idiap-RR-24-2012 |
![]() |
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
![]() |
Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, , , , and , in: Journal on Multimodal User Interfaces, 2012 |
![]() |
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
![]() |
Speaker diarization of overlapping speech based on silence distribution in meeting recordings, and , in: INTERSPEECH, Portland, Oregon, USA, 2012 |
![]() |
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , Idiap-RR-14-2012 |
![]() |
Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, , and , in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012 |
![]() [URL] |
Robust triphone mapping for acoustic modeling, , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
![]() |
Synthetic References for Template-based ASR using Posterior Features, , and , in: Proceedings of Interspeech, Portland, Oregon, USA, 2012 |
![]() |
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , in: Proceedings of Interspeech, Portland, Oregon, 2012 |
![]() |
Phase AutoCorrelation (PAC) features for noise robust speech recognition, , , and , in: Speech Communication, 54(7):867–880, 2012 |
[DOI] |
A Survey on Language Modeling using Neural Networks, and , Idiap-RR-32-2012 |
![]() |
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , Idiap-RR-25-2012 |
![]() |
Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, , and , in: Journal of Multimedia, 7(2):179--192, 2012 |
![]() |
ScoreToolKit Documentation, and , Idiap-Com-02-2012 |
![]() |
Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, , and , in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012 |
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , Idiap-RR-21-2012 |
![]() |
Supervised and unsupervised Web-based language model domain adaptation, , , and , Idiap-RR-22-2012 |
![]() |
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
![]() |
Audiovisual Diarization Of People In Video Content, , and , in: Multimedia Tools and Applications, 2012 |
![]() |
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
![]() |
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012 |
![]() |
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , Idiap-RR-13-2012 |
![]() |
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, , , , and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012 |
![]() |
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, , and , in: IEEE Transactions on Audio, Speech and Language Processing, 2012 |
![]() |
COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, , , and , in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012 |
![]() |
On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, , and , in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012 |
![]() |
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , Idiap-RR-15-2012 |
![]() |
Alternative search techniques for face detection using location estimation and binary features, , ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012 |
![]() |
Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012 |
![]() |
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , Idiap-RR-10-2012 |
![]() |
Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, and , in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012 |
![]() |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, and , in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012 |
![]() |
Using Sense-labeled Discourse Connectives for Statistical Machine Translation, and , in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012 |
![]() |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |