Publication list - Idiap Publications

Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation, Hamed Ketabdar and Hervé Bourlard, in: ICASSP'08, 2008

In-Context Phone Posteriors as Complementary Features for Tandem ASR, Hamed Ketabdar and Hervé Bourlard, in: ICSLP'08, 2008

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, in: Proceedings MLMI workshop, 2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, Idiap-RR-25-2005

Identifying unexpected words using in-context and out-of-context phoneme posteriors, Hamed Ketabdar and Hynek Hermansky, Idiap-RR-68-2006

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: International Conference on Spoken Language Processing (ICSLP), 2006

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-67-2006

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: Proceedings of Interspeech, 2005

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2005

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-91-2005

BLESS: Benchmarking Large Language Models on Sentence Simplification, Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego and Matthew Shardlow, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023

Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target, Vasil Khalidov, Florence Forbes and Radu Horaud, in: Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013

Real-time Multiple Head Tracking Using Texture and Colour Cues, Vasil Khalidov and Jean-Marc Odobez, Idiap-RR-02-2017

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, Driss Khalil, Amrutha Prasad, Petr Motlicek, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Srikanth Madikeri and Schüpbach Christof, in: Aerospace, 10(10):876, 2023

[DOI]
[URL]

Kullback-Leibler Proximal Variational Inference, Emtiyaz Khan, Pierre Baqué, Francois Fleuret and Pascal Fua, in: Proceedings of the international conference on Neural Information Processing Systems, pages 3402-3410, 2015

Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, Deepanshu Khanna, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, Deepanshu Khanna, Muskaan Singh and Petr Motlicek, in: ACL, 2022

ParsiNLU: A Suite of Language Understanding Challenges for Persian, Daniel Khashabi, Arman Cohan, Siamak Shakeri, Pedram Hosseini, Pouya Pezeshkpour, Marzieh Bitaab, Faeze Brahman, Sarik Ghazarian, Arman Kabiri, Rabeeh Karimi Mahabadi, Omid Memarrast, Ahmadreza Mosallanezhad, Erfan Noury, Shahab Raji, Mohammad Sadegh Rasooli, Sepideh Sadeghi, Erfan Sadeqi Azer, Niloofar Safi Samghabadi, Mahsa Shafaei, Saber Sheybani, Ali Tazarv and Yadollah Yaghoobzadeh, in: TACL, 2021

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil and Srikanth Madikeri, in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017

[DOI]
[URL]

SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil and Gerhard Backfried, in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018

[DOI]
[URL]

Towards a breakthrough speaker identification approach for law enforcement agencies, Khaled Khelif, yann Mombrun, Petr Motlicek, Gerhard Backfried, Damien Kelly, Farhan Sahito, Gideon Hazzani, Luca Scarpatto, Emmanouil Chatzigavriil and Srikanth Madikeri, Idiap-RR-29-2017

INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Subhadeep Dey, Hervé Bourlard, Petr Motlicek and Jayadev Billa, in: Proceedings of ICASSP 2020, 2020

INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-13-2019

STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek and Hervé Bourlard, Idiap-RR-12-2019

Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: Proceedings of Interspeech, 2021

[DOI]

Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021

An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis

COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis, Idiap-RR-34-2020

Hierarchical speaker clustering methods for the NIST i-vector Challenge, Elie Khoury, Laurent El Shafey, Marc Ferras and Sébastien Marcel, in: Odyssey: The Speaker and Language Recognition Workshop, 2014

SPEAR: An open source toolbox for speaker recognition based on Bob, Elie Khoury, Laurent El Shafey and Sébastien Marcel, in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 - 1659, 2014

[DOI]
[URL]

The Idiap Speaker Recognition Evaluation System at NIST SRE 2012, Elie Khoury, Laurent El Shafey and Sébastien Marcel, in: NIST Speaker Recognition Conference, NIST, Orlando, USA, 2012

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, in: Image and Vision Computing:1147-1160, 2014

[DOI]
[URL]

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, Idiap-RR-30-2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, in: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, Dallas, Texas, USA, pages 97-104, ACM, 2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, Idiap-RR-31-2013

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-35-2013

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, in: Biometric Technologies in Forensic Science, Nijmegen, The Netherlands, 2013

Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu and Sébastien Marcel, in: The 15th Annual Conference of the International Speech Communication Association, 2014

Combining transcription-based and acoustic-based speaker identifications for broadcast news, Elie Khoury, Antoine Laurent, Sylvain Meignier and Simon Petitrenaud, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012

ICB 2013 - Competition on speaker recognition in mobile environment using the MOBIO database: The Evaluation Plan, Elie Khoury, Sébastien Marcel and Manuel Günther, Idiap-Com-04-2012

Audiovisual Diarization Of People In Video Content, Elie Khoury, Christine Sénac and Philippe Joly, in: Multimedia Tools and Applications, 2012

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus de Assis Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, Idiap-RR-32-2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, in: The 6th IAPR International Conference on Biometrics, 2013

Referencing in YouTube Knowledge Communication Videos, Haeeun Kim and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences (IMX '23), June 2023, Nantes, France, 2023

Predicting the Conflict Level in Television Political Debates: an Approach Based on Crowdsourcing, Nonverbal Communication and Gaussian Processes, Samuel Kim, Maurizio Filippone, Fabio Valente and Alessandro Vinciarelli, in: ACM Multimedia, 2012

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

Towards rich mobile phone datasets: Lausanne data collection campaign, N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez and J. K. Laurila, in: Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 2010