All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
K
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , Idiap-RR-25-2005 |
|
Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , in: International Conference on Spoken Language Processing (ICSLP), 2006 |
|
Using more informative posterior probabilities for speech recognition, , , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , Idiap-RR-67-2006 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , in: Proceedings of Interspeech, 2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , Idiap-RR-23-2005 |
|
Using more informative posterior probabilities for speech recognition, , , and , Idiap-RR-91-2005 |
|
BLESS: Benchmarking Large Language Models on Sentence Simplification, , , , , , and , in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023 |
|
Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, , and , in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013 |
|
Real-time Multiple Head Tracking Using Texture and Colour Cues, and , Idiap-RR-02-2017 |
|
An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, , , , , , and , in: Aerospace, 10(10):876, 2023 |
[DOI] [URL] |
Kullback-Leibler Proximal Variational Inference, , , and , in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015 |
|
Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, , , , and , in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022 |
|
IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, , and , in: ACL, 2022 |
|
ParsiNLU: A Suite of Language Understanding Challenges for Persian, , , , , , , , , , , , , , , , , , , , , and , in: TACL, 2021 |
|
Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, , , , , , , , , and , in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017 |
[DOI] [URL] |
SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, , , , , , , , , and , in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018 |
[DOI] [URL] |
Towards a breakthrough speaker identification approach for law enforcement agencies, , , , , , , , , and , Idiap-RR-29-2017 |
|
INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, , , , , and , in: Proceedings of ICASSP 2020, 2020 |
|
INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, , , and , Idiap-RR-13-2019 |
|
STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, , , , , and , Idiap-RR-12-2019 |
|
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, , , and |
|
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, , , and , Idiap-RR-34-2020 |
|
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
SPEAR: An open source toolbox for speaker recognition based on Bob, , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 - 1659, 2014 |
[DOI] [URL] |
The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, , and , in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012 |
|
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , in: Image and Vision Computing:1147-1160, 2014 |
[DOI] [URL] |
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , Idiap-RR-30-2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
|
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , Idiap-RR-35-2013 |
|
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013 |
|
Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
|
Combining transcription-based and acoustic-based speaker identifications for broadcast news, , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012 |
|
ICB 2013 - Competition on speaker recognition in mobile environment using the MOBIO database: The Evaluation Plan, , and , Idiap-Com-04-2012 |
|
Audiovisual Diarization Of People In Video Content, , and , in: Multimedia Tools and Applications, 2012 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-32-2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: The 6th IAPR International Conference on Biometrics, 2013 |
|
Referencing in YouTube Knowledge Communication Videos, and , in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023 |
|
Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, , , and , in: ACM Multimedia, 2012 |
Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012 |
|
Automatic detection of conflict escalation in spoken conversations, , and , in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012 |
|
Towards rich mobile phone datasets: Lausanne data collection campaign, , , , and , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, , , , , , , and , in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021 |
[DOI] |
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |