Mathew Magimai-Doss - Idiap Publications

Update cookies preferences

First name(s):	Mathew
Last name(s):	Magimai-Doss
Email:	mathew@idiap.ch

| 1 | 2 | 3 | 4 | 5 | 6 | 7 |

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal, Mathew Magimai-Doss and Ramya Rasipuram, in: The Phonetician, 109–110:6-23, 2014

attachment

On Detection of Depression in Parkinson's Disease Patients' Speech: Handcrafted Features vs. Speech Foundation Models, Tilak Purohit, Barbara Ruvolo, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Automatic Assessment of Parkinsonian Speech, Springer Nature Switzerland AG, 2025

attachment

[URL]

Speech Processing, Mathew Magimai-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013

Exploratory analysis of yellow mongoose vocalization: detection from in-the-wild recordings and call classification, Sevada Hovsepyan, Imen Ben Mahmoud, Vanessa Rüegg, Marta Manser and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2026

attachment

Zero frequency resonator based extraction of R-peaks in ECG signals, RaviShankar Prasad, Gürkan Yilmaz and Mathew Magimai-Doss, in: Proceedings of EUSIPCO, 2026

attachment

Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, Tilak Purohit, Barbara Ruvolo, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

attachment

Children's Voice Privacy: First Steps and Emerging Challenges, Ajinkya Kulkarni, Francisco Teixeira, Enno Hermann, Thomas Rolland, Isabel Trancoso and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

attachment

Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing, Eklavya Sarkar and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, 2025

attachment

Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, Tilak Purohit and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

attachment

Exploring the Complexity of Parkinson’s Patient Speech for Depression Detection task: A Qualitative Analysis, Barbara Ruvolo, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

attachment

Idiap kNN-TTS System for the Blizzard Challenge 2025, Enno Hermann, Karl El Hajal, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Blizzard Challenge Workshop, 2025

attachment

kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech, Karl El Hajal, Ajinkya Kulkarni, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Albuquerque, New Mexico, ACL, 2025

attachment

[URL]

Multimodal Prosody Modeling: A Use Case for Multilingual Sentence Mode Prediction, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2025

attachment

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, in: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

attachment

Towards Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, in: Neural Information Processing Systems workshop: AI for Non-Human Animal Communication, 2025

attachment

Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR, Karl El Hajal, Enno Hermann, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

attachment

[URL]

Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech, Karl El Hajal, Enno Hermann, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, Rotterdam, Netherlands, ISCA, 2025

attachment

[URL]

Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion, Ajinkya Kulkarni, Dowerah Sandipana, Mathew Magimai-Doss and Tanel alumae, in: Proceedings of Interspeech, 2025

attachment

CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, Neha Tarigopula, Preyas Garg, Skanda Muralidhar, Sandrine Tornay, Dinesh Babu Jayagopi and Mathew Magimai-Doss, in: ICASSP, 2024

attachment

Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, Luis Felipe Parra-Gallego, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of Interspeech, Kos Island, Greece, ISCA, 2024

attachment

Exploring generalization to unseen audio data for spoofing: insights from SSL models, Atharva Kulkarni, Hoan My Tran, Ajinkya Kulkarni, Dowerah Sandipana, Damien Lolive and Mathew Magimai-Doss, in: ISCA Proceedings, Greece, 2024

[DOI]
[URL]

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

attachment

Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, Kos Island, Greece, pages 3590-3594, 2024

attachment

[DOI]
[URL]

On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, Eklavya Sarkar and Mathew Magimai-Doss, in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024

attachment

Predicting Heart Activity from Speech using Data-driven and Knowledge-based features, Gasser Elbanna, Zohreh Mostaani and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024

attachment

SYLLABLE LEVEL FEATURES FOR PARKINSON'S DISEASE DETECTION FROM SPEECH, Sevada Hovsepyan and Mathew Magimai-Doss, in: ICASSP, 2024

attachment

Towards interfacing large language models with ASR systems using confidence measures and prompting, Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 2980-2984, 2024

attachment

[DOI]

Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, Eklavya Sarkar and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2023

attachment

Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 156-160, 2023

attachment

[DOI]
[URL]

Implicit phonetic information modeling for speech emotion recognition, Tilak Purohit, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023

attachment

Towards learning emotion information from short segments of speech, Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023

attachment

Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report, Timothy Piton, Enno Hermann, Angela Pasqualotto, Marjolaine Cohen, Mathew Magimai-Doss and Daphné Bavelier, in: Proceedings of Interspeech, pages 4573-4577, 2023

attachment

[DOI]
[URL]

Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, Sarthak Yadav, Tilak Purohit, Zohreh Mostaani, Bogdan Vlasenko and Mathew Magimai-Doss, in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022

attachment

Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, Tilak Purohit, Imen Ben Mahmoud, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022

attachment

Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, Zohreh Mostaani, RaviShankar Prasad, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of ICASSP, 2022

attachment

On Breathing Pattern Information in Synthetic Speech, Zohreh Mostaani and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

attachment

Towards Accessible Sign Language Learning and Assessment, Neha Tarigopula, Sandrine Tornay, Skanda Muralidhar and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction, Bangalore, INDIA, pages 626-631, 2022

attachment

[DOI]

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, in: ACM International Conference on Multimodal Interaction (ICMI Companion), 2022

attachment

[DOI]

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, Eklavya Sarkar, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

attachment

An Objective Evaluation Framework for Pathological Speech Synthesis, Bence Halpern, Julian Fritsch, Enno Hermann, Rob Van Son, Odette Scharenborg and Mathew Magimai-Doss, in: Proceedings of ITG Conference on Speech Communication, 2021

attachment

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Daniel Gatica-Perez, Mathew Magimai-Doss and Héctor Jiménez-Salazar, in: Proceedings of the 2021 International Conference on Multimodal Interaction, ACM, 2021

attachment

[DOI]

Fusion of Acoustic and Linguistic Information Using Supervised Autoencoder for Improved Emotion Recognition, Bogdan Vlasenko, RaviShankar Prasad and Mathew Magimai-Doss, in: 2nd Multimodal Sentiment Analysis Challenge (MuSe '21), October 24, 2021, Virtual Event, China, 2021

attachment

[DOI]

Handling acoustic variation in dysarthric speech recognition systems through model combination, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

attachment

Identification of F1 and F2 in speech using modified zero frequency filtering, RaviShankar Prasad and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

attachment

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, in: Proceedings of Interspeech 2021, ISCA-International Speech Communication Association 2021, 2021

attachment

On Modeling Glottal Source Information for Phonation Assessment in Parkinson’s Disease, Juan Camilo Vasquez-Correa, Julian Fritsch, Juan Rafael Orozco-Arroyave, Elmar Nöth and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2021

attachment

On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proc. of ICASSP, 2021

attachment

Phoneme based Respiratory Analysis of Read Speech, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021

attachment

A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition, Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn Schuller, Mathew Magimai-Doss, Helmer Strik and Aki Härmä, in: Proceedings of Interspeech, pages 2182-2186, 2020

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 |

processing time: 0.9711 seconds.