Petr Motlicek - Idiap Publications

Entity Matching Across Small Networks Using Node Attributes, Zahra Ahmadi, Zijian Zhang, Hoang H. Nguyen, Sergio Burdisso, Srikanth Madikeri, Petr Motlicek, Erinc Dikici, Gerhard Backfried, Marek Kovac and Daniel Kudenko, in: ECAI 2024 - 27th European Conference on Artificial Intelligence, October 19-24, 2024, Santiago de Compostela, Spain - Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings, 2024

[DOI]

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Iuliia Thorbecke, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 16747–16762, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), pages 11921-11925, 2024

[DOI]

Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions, Dairazalia Sanchez-Cortes, Sergio Burdisso, Esaú Villatoro-Tello and Petr Motlicek, in: Proceedings of the 15th International Conference of the CLEF Association: Experimental IR Meets Multilinguality, Multimodality, and Interaction, Grenoble, France, pages 127-138, Springer Nature Switzerland, 2024

[DOI]
[URL]

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024

[DOI]
[URL]

Normalizing Flows for Speaker and Language Recognition Backend, Aleix Espuña, Amrutha Prasad, Petr Motlicek, Srikanth Madikeri and Schüpbach Christof, in: Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification, Esaú Villatoro-Tello, Srikanth Madikeri, Bidisha Sharma, Driss Khalil, Shashi Kumar, Nigmatulina Iuliia, Petr Motlicek and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12617-12621, IEEE, 2024

[DOI]
[URL]

Reliability Estimation of News Media Sources: Birds of a Feather Flock Together, Sergio Burdisso, Dairazalia Sanchez-Cortes, Esaú Villatoro-Tello and Petr Motlicek, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, pages 6900–6918, Association for Computational Linguistics, 2024

[DOI]
[URL]

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models, Amrutha Prasad, Srikanth Madikeri, Driss Khalil, Petr Motlicek and Schüpbach Christof, in: Interspeech 2024, pages 2825--2829, 2024

[DOI]
[URL]

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024

[DOI]
[URL]

Automatic Speech Analysis Framework for ATC Communication in HAAWAII, Petr Motlicek, Amrutha Prasad, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser and Matthias Kleinert, in: 13th SESAR Innovation Days, 2023

Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers’ Workload, Hartmut Helmke, Matthias Kleinert, Nils Ahrenhold, heiko Ehr, Thorsten Mühlhausen, Oliver Ohneiser, Petr Motlicek, Amrutha Prasad, Juan Zuluaga-Gomez, Lucas Klamert, Jelena Dokic and Ella Pinska Chauvin, in: Fifteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2023), Eurocontrol (Europe), FAA (U.S.), Savannah, Georgia, USA, 2023

[URL]

BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Karel Ondřej and Oliver Ohneiser, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training, Mrinmoy Bhattacharjee, Petr Motlicek, Nigmatulina Iuliia, Hartmut Helmke, Oliver Ohneiser, Matthias Kleinert and heiko Ehr, in: Proc. 13th SESAR Innovation Days, Seville, Spain, 2023

[DOI]
[URL]

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Petr Motlicek, Alexei V. Ivanov and Aravind Ganapathiraju, in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023

How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, Juan Zuluaga-Gomez, Amrutha Prasad, Nigmatulina Iuliia, Seyyed Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser and Qingran Zhan, in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023

[URL]

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet and Petr Motlicek, in: Proc. Interspeech 2023, Ireland, 2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, in: Proc. Interspeech 2023, pages 4494--4498, 2023

[DOI]
[URL]

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, in: Proceedings of Interspeech, 2023

Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition, Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth Madikeri, Kris Demuynck and Petr Motlicek, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'23, 2023

[DOI]

A two-step approach to leverage contextual data: speech recognition in air-traffic communications, Nigmatulina Iuliia, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo and Petr Motlicek, in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6282-6286, IEEE, 2022

[DOI]
[URL]

An Empirical Comparison of Semantic Similarity Methods for Analyzing down-streaming Automatic Minuting task, Aditya Upadhyay, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36), In ACL Anthology Proceedings, 2022

An End-to-End Multilingual System for Automatic Minuting of Multi-Party Dialogues, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Pacific Asia Conference on Language, Information and Computation (PACLIC 36) , In proceedings of ACL Anthology, 2022

Automatic Summarization for Creative Writing: Denoising Auto-Encoder based Pipeline Method for Generating Summary of Movie Scripts, Aditya Upadhyay, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: Automatic Summarization for Creative Writing, International Conference on Computational Linguistics (COLING 2022), 2022

Bio-Medical Multi-label Scientific Literature Classification using LWAN and Dual-attention module, Deepanshu Khanna, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), 2022

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2022

[DOI]

Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia, Oliver Ohneiser and Hartmut Helmke, in: 12th SESAR Innovation Days, 2022

Hierarchical Multi-task learning framework for Isometric-Speech Language Translation, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: ACL, 2022

HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh and Petr Motlicek, in: PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION (PACLIC 2022), In Proceedings of ACL Anthology, 2022

IDIAP Submission@LT-EDI-ACL2022 : Hope Speech Detection for Equality, Diversity and Inclusion, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text, Muskaan Singh and Petr Motlicek, in: ACL, 2022

IDIAP Submission@LT-EDI-ACL2022: Homophobia/Transphobia Detection in social media comments, Muskaan Singh and Petr Motlicek, in: ACL Proceedings, 2022

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, in: The 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE @ EMNLP 2022), 2022

[URL]

IDIAP_TIET@LT-EDI-ACL2022 : Hope Speech Detection in Social Media using Contextualized BERT with Attention Mechanism, Deepanshu Khanna, Muskaan Singh and Petr Motlicek, in: ACL, 2022

Innovators@SMM4H'22: An Ensembles Approach for self-reporting of COVID-19 Vaccination Status Tweets, Mohammad Zohair, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Innovators@SMM4H'22: An Ensembles Approach for Stance and Premise Classification of COVID-19 Health Mandates Tweets, Vatsal Savaliya, Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh and Petr Motlicek, in: International Conference on Computational Linguistics (COLING 2022), 2022

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Seyyed Saeed Sarfjoo, Nigmatulina Iuliia and Karel Vesely, in: 12th SESAR Innovation Days, 2022

Team Innovators at SemEval-2022 for Task 8: Multi-Task Training with Hyperpartisan and Semantic Relation for Multi-Lingual News Article Similarity, Nidhir Bhavsar, Aakash Bhatnagar, Muskaan Singh, Petr Motlicek and Tirthankar Ghosal, in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), NAACL 2022, 2022

A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET, Rudolf Braun, Srikanth Madikeri and Petr Motlicek, in: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Toronto, Ontario, Canada, 2021

Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning, Matthias Kleinert, Hartmut Helmke, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek and Julia Harfmann, in: 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), San Antonio, TX, USA, pages 1-9, IEEE, 2021

[DOI]

Automatic processing pipeline for collecting and annotating air-traffic voice communication data, Martin Kocour, Karel Vesely, Igor Szoke, Santosh Kesiraju, Juan Zuluaga-Gomez, Blatt Alexander, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek and et al., in: Proceedings of 9th OpenSky Symposium 2020, OpenSky Network, Brussels, Belgium, pages 1-9, MDPI, 2021

Boosting of contextual information in ASR for air-traffic call-sign recognition, Martin Kocour, Karel Vesely, Blatt Alexander, Juan Zuluaga-Gomez, Igor Szoke, Jan Cernocky, Dietrich Klakow and Petr Motlicek, in: Interspeech 2021, 2021

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, in: Interspeech 2021, 2021

[URL]

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021

[DOI]

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021

Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates, Hartmut Helmke, Shruthi Shetty, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek, Cerna Aneta and Christian Windisch, in: 11th SESAR Innovation Days, 2021

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of Interspeech 2021, 2021