Amrutha Prasad - Idiap Publications

Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, Pradeep Rangappa, Amrutha Prasad, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2025

IDIAP SUBMISSION TO NIST LRE22 LANGUAGE RECOGNITION EVALUATION, Amrutha Prasad, Driss Khalil, Srikanth Madikeri and Petr Motlicek, Idiap-RR-11-2025

Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, Jehan Joachim Daniel Piaget, Amrutha Prasad and Petr Motlicek, Idiap-RR-04-2025

Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication, Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof, in: SESAR Innovation Days 2025 (https://www.sesarju.eu/SIDS2025), Eurocontrol, Bled, Slovenia, 2025

[URL]

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

Normalizing Flows for Speaker and Language Recognition Backend, Aleix Espuña, Amrutha Prasad, Petr Motlicek, Srikanth Madikeri and Schüpbach Christof, in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and Matthias Kleinert, in: Aerospace, 10(5), 2023

[DOI]
[URL]

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain, Driss Khalil, Amrutha Prasad, Petr Motlicek, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Srikanth Madikeri and Schüpbach Christof, in: Aerospace, 10(10):876, 2023

[DOI]
[URL]

Automatic Speech Analysis Framework for ATC Communication in HAAWAII, Petr Motlicek, Amrutha Prasad, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser and Matthias Kleinert, in: 13th SESAR Innovation Days, 2023

Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, Hartmut Helmke, Matthias Kleinert, Nils Ahrenhold, heiko Ehr, Thorsten Mühlhausen, Oliver Ohneiser, Petr Motlicek, Amrutha Prasad, Juan Zuluaga-Gomez, Lucas Klamert, Jelena Dokic and Ella Pinska Chauvin, in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023

[URL]

BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Karel Ondřej and Oliver Ohneiser, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Seyyed Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser and Qingran Zhan, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault and Khalid Choukri, in: Aerospace, 10(10):898, 2023

[DOI]
[URL]

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

A two-step approach to leverage contextual data: speech recognition in air-traffic communications, Nigmatulina Iuliia, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo and Petr Motlicek, in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6282-6286, IEEE, 2022

[DOI]
[URL]

Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Oliver Ohneiser and Hartmut Helmke, in: 12th SESAR Innovation Days, 2022

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia and Karel Vesely, in: 12th SESAR Innovation Days, 2022

Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, Matthias Kleinert, Hartmut Helmke, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek and Julia Harfmann, in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021

[DOI]

Automatic processing pipeline for collecting and annotating air-traffic voice communication data, Martin Kocour, Karel Vesely, Igor Szoke, Santosh Kesiraju, Juan Zuluaga-Gomez, Blatt Alexander, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and et al., in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021

BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Oliver Ohneiser and Hartmut Helmke, Idiap-RR-15-2021

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, Idiap-RR-14-2021

[URL]

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, in: Interspeech 2021, 2021

[URL]

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Seyyed Saeed Sarfjoo and Nigmatulina Iuliia, Idiap-RR-22-2021

Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, Hartmut Helmke, Shruthi Shetty, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek, Cerna Aneta and Christian Windisch, in: 11th SESAR Innovation Days, 2021

Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety, Hartmut Helmke, Matthias Kleinert, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Hörður Arilíusson, Teodor S. Simiganoschi, Amrutha Prasad, Petr Motlicek, Karel Vesely, Karel Ondřej, Pavel Smrz, Julia Harfmann and Christian Windisch, in: Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), The United States Federal Aviation Administration (FAA), EUROCONTROL, pages 10, 2021

[URL]

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, Idiap-RR-01-2020

Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, Juan Zuluaga-Gomez, Karel Vesely, Blatt Alexander, Petr Motlicek, Dietrich Klakow, Allan Tart, Igor Szoke, Amrutha Prasad, Seyyed Saeed Sarfjoo, Pavel Kolcarek, Martin Kocour, Honza Cernocky, Claudia Cevenini, Khalid Choukri, Mickael Rigault and Fabian Landis, in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020

[DOI]
[URL]

Automatic Speech Recognition Engines Adapted for Embedded Platforms, Amrutha Prasad, Idiap-Com-01-2020

Language model domain adaptation for automatic speech recognition, Amrutha Prasad, Petr Motlicek and Alexandre Nanchen, Idiap-RR-05-2020

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019

[URL]