All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
B
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , Idiap-RR-11-1998 |
|
INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, , , and , Idiap-RR-21-1999 |
|
Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité, and , in: Proceedings of JEP'2000, no IDIAP RR, see RESPITE www, 2000 |
A measure of speech and pitch reliability from voicing, and , in: Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Scandinavian AI Society, 1999 |
A front-end using the harmonicity cue for speech enhancement in loud noise, , and , in: Int. Conf. on Spoken Language Processing (ICSLP), 2000 |
Interfacing of CASA and partial recognition based on a multistream technique, , , and , in: ICSLP'98, Sidney, 1998 |
|
A new SNR-feature mapping for robust multistream speech recognition, and , in: Proc. Int. Congress on Phonetic Sciences (ICPhS), 1999 |
Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1999 |
|
Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, , Idiap-RR-18-1997 |
|
Optimisation de réseaux de neurones, , {EPFL}, Lausanne, Switzerland, 1995 |
Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, , , and , in: ACL, 2022 |
|
DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, , , and , in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022 |
|
An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, , , and , in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022 |
|
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Idiap at MediaEval 2013: Search and Hyperlinking Task, , , and , in: MediaEval 2013 Workshop, Barcelona, Spain, CEUR-WS.org, 2013 |
|
Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, and , Idiap-RR-09-2014 |
|
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Multi-factor Segmentation for Topic Visualization and Recommendation: the MUST-VIS System, , , , , , , and , in: Proceedings of the 21st ACM International Conference on Multimedia, Barcelona, Spain, pages 365-368, ACM, 2013 |
[DOI] [URL] |
CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, , , , , , and , in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023 |
[DOI] [URL] |
Vascular Biometrics Experiments on Candy -- A New Contactless Finger-Vein Dataset, , , , and , in: Proceedings of the International Conference on Pattern Recognition (ICPR), Calcutta (India), 2024 |
|
What you can't see can help you -- extended-range imaging for 3D-mask presentation attack detection, and , in: Proceedings of the 16th International Conference on Biometrics Special Interest Group., Darmstadt (Germany), Gesellschaft fuer Informatik e.V. (GI), 2017 |
|
Recent Advances in Face Presentation Attack Detection, , , and , in: Handbook of Biometric Anti-Spoofing, Springer, 2019 |
[URL] |
Spoofing Deep Face Recognition With Custom Silicone Masks, , and , in: Proceedings of BTAS2018, 2018 |
|
HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, , , , and , in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022 |
|
Mining Conversational Social Video, , EPFL, 2013 |
|
You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, , and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011 |
|
Mining Crowdsourced First Impressions in Online Social Video, and , in: IEEE Transactions on Multimedia, 16(7), 2014 |
|
The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012 |
|
The YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs, and , in: IEEE Transactions on Multimedia, 2012 |
|
Call me Guru: user categories and large-scale behavior in YouTube, and , in: Social Media Computing, Springer, 2011 |
|
VlogSense: Conversational Behavior and Social Attention in YouTube, and , in: Transactions on Multimedia Computing, Communications and Applications, 2011 |
|
Voices of Vlogging, and , in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010 |
|
Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010 |
|
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, and , in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009 |
|
Hi YouTube! Personality Impressions and Verbal Content in Social Video, , , and , in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013 |
|
Bites'n'Bits: Inferring Eating Behavior from Contextual Mobile Data, , , and , in: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT), 1(4):125-157, 2017 |
|
FaceTube: predicting personality from facial expressions of emotion in online conversational video, , and , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012 |
|
Whole-Body Ergodic Exploration with a Manipulator Using Diffusion, , and , in: IEEE Robotics and Automation Letters, 8(12):8581-8587, 2023 |
[DOI] [URL] |
Energy assessment of a district by integrating solar thermal in district heating network: a dynamic analysis approach, , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
From Zero Energy to Zero Power Buildings: a new paradigm for a sustainable transition of the building stock, , and , in: Sustainable Cities and Society, 2023 |
[DOI] [URL] |
Learning From Humans, , and , in: Handbook of Robotics, pages 1995-2014, Springer, 2016 |
[DOI] [URL] |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , in: 6th european conference on speech communication and technology --- eurospeech'99, 1999 |
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , Idiap-RR-24-1999 |
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , in: Eurospeech 97, 1997 |
|
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , Idiap-RR-05-1997 |
|
An overview of the cave project research activities in speaker verification, , , , , and , in: Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998 |
Speaker Verification in the Telephone Network : Research Activities in the CAVE Project, , , , , and , in: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997 |
|
On the choice of the low-dimensional domain for global optimization via random embeddings, , and , in: Journal of Global Optimization, 2019 |
[DOI] [URL] |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |