Srikanth Madikeri - Idiap Publications

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

Analysis of Posterior Estimation Approaches to I-vector Extraction for Speaker Recognition, Srikanth Madikeri, Petr Motlicek, Marc Ferras and Subhadeep Dey, Idiap-RR-15-2018

Autocrime - open multimodal platform for combating organized crime, Srikanth Madikeri, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Joshua Hughes, Jacob Tkaczuk, Alejandra Sanchez Lara, Driss Khalil, Johan Rohdin, Dawei Zhu, Aravind Krishnan, Dietrich Klakow, Zahra Ahmadi, Marek Kovac, Dominik Boboš, Costas Kalogiros, Andreas Alexopoulos and Denis Marraud, in: Forensic Science International: Digital Investigation, 54, 2025

[DOI]
[URL]

Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, Srikanth Madikeri, Seyyed Saeed Sarfjoo, Petr Motlicek and Sébastien Marcel, Idiap-RR-17-2019

MODIFIED GROUP DELAY FEATURE BASED TOTAL VARIABILITY SPACE MODELLING FOR SPEAKER RECOGNITION, Srikanth Madikeri, Asha T and Hema A Murthy, in: International Journal of Speech Techonology, 18(1):17-23, 2014

[DOI]

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, Idiap-RR-16-2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, Idiap-RR-01-2020

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019

[URL]

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

IDIAP SUBMISSION TO NIST LRE22 LANGUAGE RECOGNITION EVALUATION, Amrutha Prasad, Driss Khalil, Srikanth Madikeri and Petr Motlicek, Idiap-RR-11-2025

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering, Pradeep Rangappa, Andrés Carofilis, Jeena Prakash, Shashi Kumar, Sergio Burdisso, Srikanth Madikeri, Esaú Villatoro-Tello, Bidisha Sharma, Petr Motlicek, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Proc. Interspeech, 2025

Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, Pradeep Rangappa, Amrutha Prasad, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2025

Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, Pradeep Rangappa, Juan Zuluaga-Gomez, Srikanth Madikeri, Andrés Carofilis, Jeena Prakash, Sergio Burdisso, Shashi Kumar, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025

[DOI]
[URL]

Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Mahdi Hajibabaei, Petr Motlicek and Sébastien Marcel, Idiap-RR-15-2019

Speech Activity Detection Based on Multilingual Speech Recognition System, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: Interspeech, 2021

Supervised domain adaptation for text-independent speaker verification using limited data, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek and Sébastien Marcel, in: Interspeech, pages 3815-3819, 2020

[URL]

Speaker recognition on mono-channel telephony recordings, Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth Madikeri and Honza Cernocky, in: The Speaker and Language Recognition Workshop, 2022

Feature Switching in the i-vector Framework for Speaker Verification, Asha T, Saranya M S, Karthik Pandia D S, Srikanth Madikeri and Hema A Murthy, in: Proc. of Interspeech 2014, pages 5, 2014

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, Idiap-RR-06-2022

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022

[DOI]

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, Esaú Villatoro-Tello, Srikanth Madikeri, Bidisha Sharma, Driss Khalil, Shashi Kumar, Nigmatulina Iuliia, Petr Motlicek and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024

[DOI]
[URL]

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Petr Motlicek, Alexei V. Ivanov and Aravind Ganapathiraju, in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-04-2021

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of Interspeech, 2021

[URL]

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021

[URL]

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-40-2020

[URL]

Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault and Khalid Choukri, in: Aerospace, 10(10):898, 2023

[DOI]
[URL]