Mathew Magimai-Doss - Idiap Publications

Update cookies preferences

First name(s):	Mathew
Last name(s):	Magimai-Doss
Email:	mathew@idiap.ch

| 1 | 2 | 3 | 4 | 5 | 6 | 7 |

On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, Mathew Magimai-Doss, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-24-2009

attachment

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, in: Proceedings of ICASSP, 2004

attachment

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, Idiap-RR-52-2003

attachment

On the Adequacy of Baseform Pronunciations and Pronunciation Variants, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-27-2004

attachment

Pronunciation models and their evaluation using confidence measures, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-29-2001

attachment

Speech Processing, Mathew Magimai-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013

Improving Continuous Speech Recognition System Performance with Grapheme Modelling, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-16-2005

attachment

Phoneme vs Grapheme Based Automatic Speech Recognition, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-48-2004

attachment

On Learning Grapheme-to-Phoneme Relationships through the Acoustic Speech Signal, Mathew Magimai-Doss and Ramya Rasipuram, in: The Phonetician, 109–110:6-23, 2014

attachment

Grapheme-based Automatic Speech Recognition using KL-HMM, Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla and Hervé Bourlard, in: Proceedings of Interspeech, 2011

attachment

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, in: Proceedings of Eurospeech, 2003

attachment

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-23-2003

attachment

Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-62-2002

attachment

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, in: Proceedings of IEEE ASRU, 2003

attachment

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2003

attachment

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, in: Proceedings of ICSLP, 2004

attachment

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, Idiap-RR-21-2004

attachment

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06, 2006

attachment

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, Idiap-RR-21-2006

attachment

On Breathing Pattern Information in Synthetic Speech, Zohreh Mostaani and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2022

attachment

Estimating Breathing Pattern from Raw Speech Waveform and Short-term Speech Spectrum using Neural Networks, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, Idiap-RR-12-2024

attachment

On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proc. of ICASSP, 2021

attachment

Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, Zohreh Mostaani, RaviShankar Prasad, Bogdan Vlasenko and Mathew Magimai-Doss, in: Proceedings of ICASSP, 2022

attachment

Understanding and Visualizing Raw Waveform-based CNNs, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, 2019

attachment

Gradient-based spectral visualization of CNNs using raw waveforms, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2018

attachment

Long Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2017

attachment

Long-Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017

attachment

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018

attachment

On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: Proceedings of Interspeech, Hyderabad, INDIA, pages 1116-1120, 2018

attachment

End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017

attachment

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-30-2017

attachment

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016

attachment

Towards interfacing large language models with ASR systems using confidence measures and prompting, Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan and Mathew Magimai-Doss, in: Proceedings of Interspeech, pages 2980-2984, 2024

attachment

[DOI]

Phoneme based Respiratory Analysis of Read Speech, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Proceedings of European Signal Processing Conference (EUSIPCO), 2021

attachment

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings, Venkata Srikanth Nallanthighal, Zohreh Mostaani, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, in: Neural Networks, 141:211--224, 2021

[DOI]

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, Idiap-RR-10-2012

attachment

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, in: 20th European Signal Processing Conference, 2012

attachment

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

attachment

Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Statistical and Perceptual Audition Workshop, 2012

attachment

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, Idiap-RR-37-2012

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 |

processing time: 0.0005 seconds.