All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |
2017
Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment, , , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
![]() |
2D Face Recognition: An Experimental and Reproducible Research Survey, , and , Idiap-RR-13-2017 |
![]() |
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
![]() |
Topic and Sentiment in Phrase-Based Statistical Machine Translation, , and , Idiap-RR-10-2017 |
![]() |
Analyzing and Visualizing Ancient Maya Hieroglyphics Using Shape: from Computer Vision to Digital Humanities, , , and , in: Digital Scholarship in the Humanities, 32:179-194, 2017 |
![]() |
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , Idiap-RR-09-2017 |
![]() |
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , in: Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017), 2017 |
![]() |
A Posterior-Based Multi-Stream Formulation for G2P Conversion, and , in: IEEE Signal Processing Letters, 2017 |
![]() |
Object Detection with Active Sample Harvesting, , École Polytechnique Fédérale de Lausanne, 2017 |
![]() |
Large-Scale Image Segmentation with Convolutional Networks, , Sciences et Techniques de l’Ingénieur (STI), 2017 |
![]() |
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , in: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, pages 30-40, Association for Computational Linguistics (ACL), 2017 |
![]() |
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , Idiap-RR-07-2017 |
![]() |
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
![]() |
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , Idiap-RR-06-2017 |
![]() |
Multilingual Hierarchical Attention Networks for Document Classification, and , Idiap-RR-17-2017 |
![]() [URL] |
Explicit Document Modeling through Weighted Multiple-Instance Learning, and , in: Journal of Artificial Intelligence Research (JAIR), 58:591--626, 2017 |
![]() |
Multilingual Visual Sentiment Concept Clustering and Analysis, , , , , , and , in: International Journal of Multimedia Information Retrieval, 2017 |
![]() |
Real-time Multiple Head Tracking Using Texture and Colour Cues, and , Idiap-RR-02-2017 |
![]() |
Intonation Modelling for Speech Synthesis and Emphasis Preservation, , École Polytechnique Fédérale de Lausanne, 2017 |
![]() [DOI] |
The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, , , and , Idiap-RR-03-2017 |
![]() |
On the Impact of Non-modal Phonation On Phonological Features, , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
![]() |
Multi-view Representation Learning Via GCCA for Multimodal Analysis of Parkinson's Disease, , , , , , , , , , , , , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
![]() |
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , Idiap-RR-15-2017 |
![]() |
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , in: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, Association for Computational Linguistics, 2017 |
![]() |
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , Idiap-RR-08-2017 |
![]() |
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017 |
![]() |
CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-31-2017 |
![]() |
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-04-2017 |
![]() |
Template-matching for Text-dependent Speaker Verification, , , and , Idiap-RR-32-2017 |
![]() |
INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, , , and , Idiap-RR-05-2017 |
![]() |
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
![]() |
Long Term Spectral Statistics for Voice Presentation Attack Detection, , , and , Idiap-RR-11-2017 |
![]() |
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , Idiap-RR-01-2017 |
![]() |
Extracting Maya Glyphs from Degraded Ancient Documents via Image Segmentation, , and , in: Journal on Computing and Cultural Heritage, 10, 2017 |
![]() |
SenseCityVity: Mobile Crowdsourcing, Urban Awareness, and Collective Action in Mexico, , , , , , , , , and , in: IEEE Pervasive Computingg, Special Issue on Smart Cities, 16(2):44-53, 2017 |
![]() |
BEAT: An Open-Source Web-Based Open-Science Platform, , and , Idiap-RR-14-2017 |
![]() |
Rapport with Virtual Agents: What do Human Social Cues and Personality Explain?, , and , in: IEEE Transactions on Affective Computing, 8(3):382-395, 2017 |
![]() [DOI] |
A Sub-Quadratic Exact Medoid Algorithm, and , Idiap-RR-19-2017 |
![]() |
Machine learning-based tools to model and to remove the off-target effect, , , and , in: Pattern Analysis and Applications, 20(1):87-100, 2017 |
[DOI] |
Analysis of Small Groups, , and , in: Social Signal Processing, pages 349-367, Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017 |
[DOI] |
From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, , , and , Idiap-RR-12-2017 |
![]() |
2016
A MultiPath Network for Object Detection, , , , , , and , in: Proceedings of the British Machine Vision Conference, BMVA Press, 2016 |
[URL] |
Learning to Refine Object Segments, , , and , in: Computer Vision - ECCV 2016, Amsterdam, pages 75-91, Springer, 2016 |
![]() [DOI] [URL] |
Unsupervised Interpretable Pattern Discovery in Time Series Using Autoencoders, , , and , in: IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), 2016 |
![]() |
CRF-Based Context Modeling for Person Identification in Broadcast Videos, , , and , in: Frontiers in ICT: Computer Image Analysis, 3, 2016 |
![]() |
Manual and automatic labeling of discourse connectives for machine translation (Keynote paper), , in: TextLink: Structuring Discourse in Multilingual Europe (Handbook of the Second Action Conference), Budapest, Hungary, pages 16-20, 2016 |
[URL] |
Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, , , , , , , , and , in: Proceedings of WMT 2016 (First Conference on Machine Translation), Association for Computational Linguistics, Berlin, Germany, pages 525–542, 2016 |
[URL] |
Comparing Two Strategies for Query Expansion in a News Monitoring System, and , in: Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, pages 267-275, Springer-Verlag, 2016 |
[DOI] |
Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens, , , , , , and , in: Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, pages 21-28, ACM, 2016 |
[DOI] |
Speaker Diarization and Linking of Meeting Data, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(11):1935-1945, 2016 |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |