Srikanth Madikeri - Idiap Publications

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

[DOI]
[URL]

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, Sergio Burdisso, Srikanth Madikeri and Petr Motlicek, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA, pages 5421–5440, Association for Computational Linguistics, 2024

[URL]

Entity Matching Across Small Networks Using Node Attributes, Zahra Ahmadi, Zijian Zhang, Hoang H. Nguyen, Sergio Burdisso, Srikanth Madikeri, Petr Motlicek, Erinc Dikici, Gerhard Backfried, Marek Kovac and Daniel Kudenko, in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024

[DOI]

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024

[DOI]
[URL]

Normalizing Flows for Speaker and Language Recognition Backend, Aleix Espuña, Amrutha Prasad, Petr Motlicek, Srikanth Madikeri and Schüpbach Christof, in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, Esaú Villatoro-Tello, Srikanth Madikeri, Bidisha Sharma, Driss Khalil, Shashi Kumar, Nigmatulina Iuliia, Petr Motlicek and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024

[DOI]
[URL]

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Petr Motlicek, Alexei V. Ivanov and Aravind Ganapathiraju, in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, in: Proc. Interspeech 2023, pages 4494--4498, 2023

[DOI]
[URL]

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech, 2023

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022

[DOI]

Speaker recognition on mono-channel telephony recordings, Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth Madikeri and Honza Cernocky, in: The Speaker and Language Recognition Workshop, 2022

A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, Rudolf Braun, Srikanth Madikeri and Petr Motlicek, in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of Interspeech, 2021

[URL]

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021

[DOI]

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021

[URL]

Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of Interspeech 2021, 2021

Speech Activity Detection Based on Multilingual Speech Recognition System, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: Interspeech, 2021

INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Subhadeep Dey, Hervé Bourlard, Petr Motlicek and Jayadev Billa, in: Proceedings of ICASSP 2020, 2020

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, in: In Proceedings of Interspeech 2020, pages 4746--4750, ISCA, 2020

Supervised domain adaptation for text-independent speaker verification using limited data, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek and Sébastien Marcel, in: Interspeech, pages 3815-3819, 2020

[URL]

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019

[URL]

INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Nauman Dawalatabad, Srikanth Madikeri, Hema A Murthy and C Chandra Sekhar, in: Proceedings of ICASSP 2019, pages 6291-6295, 2019

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage, Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Khonglah, Srikanth Madikeri, Jonathan May and Scott Miller, in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19-24, 2019

Analysis of Language Dependent Front-End for Speaker Recognition, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018

[DOI]

DNN based speaker embedding using content information for text-dependent speaker verification, Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek and Srikanth Madikeri, in: Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2018

End-to-end text-dependent speaker verification using novel distance measures, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, Aug 02-Sep 06, 2018, pages 3598-3602, 2018

[DOI]

SIIP: An Innovative Speaker Identification Approach for Law Enforcement Agencies, Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil and Gerhard Backfried, in: Big Data and Artificial Intelligence for Military Decision Making, http://www.sto.nato.int/, pages PT-1 - 1: PT-1 - 14, STO, 2018

[DOI]
[URL]

Content Normalization for Text-dependent Speaker Verification, Subhadeep Dey, Srikanth Madikeri, Petr Motlicek and Marc Ferras, in: Proc. of Interspeech, 2017

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, pages 5370-5374, 2017

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pages 5365-5369, 2017

[DOI]

Towards a breakthrough Speaker Identification approach for Law Enforcement Agencies: SIIP, Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil and Srikanth Madikeri, in: European Intelligence and Security Informatics Conference (EISIC) 2017, Athenes, Greece, pages 32-39, IEEE Computer Society, 2017

[DOI]
[URL]

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016

Inter-task System Fusion for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: Proceeedings of the INTERSPEECH, 2016

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, Marc Ferras, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, in: Proceedings of Interspeech 2016, pages 2199-2203, 2016

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, in: Proceedings of Interspeech 2015, pages 3105-3109, 2015

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4435-4439, 2015

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015

Feature Switching in the i-vector Framework for Speaker Verification, Asha T, Saranya M S, Karthik Pandia D S, Srikanth Madikeri and Hema A Murthy, in: Proc. of Interspeech 2014, pages 5, 2014