Srikanth Madikeri - Idiap Publications

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-26-2020

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: IEEE/ACM Transactions on Audio Speech and Language Processing, 2020

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, Esaú Villatoro-Tello, Srikanth Madikeri, Bidisha Sharma, Driss Khalil, Shashi Kumar, Nigmatulina Iuliia, Petr Motlicek and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024

[DOI]
[URL]

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection, Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioğlu, Petr Motlicek and Andreas Stolcke, in: ICASSP 2026, 2026

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Khonglah, Srikanth Madikeri, Jonathan May and Scott Miller, in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019

SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil and Gerhard Backfried, in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018

[DOI]
[URL]

Speaker Diarization and Linking of Meeting Data, Marc Ferras, Srikanth Madikeri and Hervé Bourlard, in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(11):1935-1945, 2016

Speaker recognition on mono-channel telephony recordings, Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth Madikeri and Honza Cernocky, in: The Speaker and Language Recognition Workshop, 2022

Speech Activity Detection Based on Multilingual Speech Recognition System, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: Interspeech, 2021

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, Pradeep Rangappa, Juan Zuluaga-Gomez, Srikanth Madikeri, Andrés Carofilis, Jeena Prakash, Sergio Burdisso, Shashi Kumar, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025

[DOI]
[URL]

STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek and Hervé Bourlard, Idiap-RR-12-2019

Supervised domain adaptation for text-independent speaker verification using limited data, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek and Sébastien Marcel, in: Interspeech, pages 3815-3819, 2020

[URL]

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, Marc Ferras, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-32-2017

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, in: Speech Communication, 2017

Text-only adaptation in LLM-based ASR through text denoising, Sergio Burdisso, Esaú Villatoro-Tello, Andrés Carofilis, Shashi Kumar, Kadri Hacioğlu, Srikanth Madikeri, Pradeep Rangappa, Manjunath K E, Petr Motlicek, Shankar Venkatesan and Andreas Stolcke, in: ICASSP, 2026

TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation, Shashi Kumar, Srikanth Madikeri, Esaú Villatoro-Tello, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Petr Motlicek, Karthik Pandia D S, Shankar Venkatesan, Kadri Hacioğlu and Andreas Stolcke, in: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE, 2025

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-07-2024

[URL]

Towards a breakthrough speaker identification approach for law enforcement agencies, Khaled Khelif, yann Mombrun, Petr Motlicek, Gerhard Backfried, Damien Kelly, Farhan Sahito, Gideon Hazzani, Luca Scarpatto, Emmanouil Chatzigavriil and Srikanth Madikeri, Idiap-RR-29-2017

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil and Srikanth Madikeri, in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017

[DOI]
[URL]

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015

TRACY Canvas: A Criminal Network Visualization Tool, Alejandra Sanchez Lara, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Srikanth Madikeri and Driss Khalil, Idiap-RR-03-2025

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-09-2018

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: Proceedings of Interspeech 2016, pages 2199-2203, 2016

Unifying Global and Near-Context Biasing in a Single Trie Pass., Thorbecke Iuliia, Esaú Villatoro-Tello, Juan Zuluaga-Gomez, Shashi Kumar, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Srikanth Madikeri, Petr Motlicek, Karthik Pandia D S, Kadri Hacioğlu and Andreas Stolcke, in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025

[DOI]
[URL]

Voice Presentation Attack Detection Using Convolutional Neural Networks, Ivan Himawan, Srikanth Madikeri, Petr Motlicek, Milos Cernak, Sridha Sridharan and Clinton Fookes, in: Handbook of Biometric Anti-Spoofing, pages 391--415, Springer, 2019

[URL]

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, Idiap-RR-08-2024

[URL]

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

[DOI]
[URL]