All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
M
EEG pattern recognition through multi-stream evidence combination, , and , Idiap-RR-31-2001 |
|
Low cost duration modelling for noise robust speech recognition, , and , in: Proc. ICSLP, 2002 |
|
Low cost duration modelling for noise robust speech recognition, , and , Idiap-RR-08-2002 |
|
The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, , , , and , Idiap-RR-22-2020 |
|
On Breathing Pattern Information in Synthetic Speech, and , in: Proceedings of Interspeech, 2022 |
|
Estimating Breathing Pattern from Raw Speech Waveform and Short-term Speech Spectrum using Neural Networks, , , , and , Idiap-RR-12-2024 |
|
On The Relationship Between Speech-based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation, , , , and , in: Proc. of ICASSP, 2021 |
|
Modeling Of Pre-trained Neural Network Embeddings Learned From Raw Waveform For Covid-19 Infection Detection, , , and , in: Proceedings of ICASSP, 2022 |
|
Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, , Idiap-RR-06-2009 |
|
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009 |
|
LP-TRAPs in all senses, , Idiap-RR-66-2007 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , Idiap-RR-16-2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024 |
[DOI] [URL] |
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, , , , , , , , and , in: Advances in Multimedia, 2013:21, 2013 |
[DOI] [URL] |
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , in: Proceedings of the 21st International Conference on Pattern Recognition, 2012 |
|
Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, , and , in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, , , , and , in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , Idiap-RR-38-2013 |
|
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013 |
[DOI] |
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, , , and , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007 |
|
Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes, , , and , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2007 |
|
Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
Speech Coding based on Spectral Dynamics, , , and , in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006 |
|
Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019 |
[URL] |
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015 |
[DOI] |
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , Idiap-RR-37-2013 |
|
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013 |
[DOI] |
Automatic Speech Analysis Framework for ATC Communication in HAAWAII, , , , , and , in: 13th SESAR Innovation Days, 2023 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
English Spoken Term Detection in Multilingual Recordings, , and , Idiap-RR-21-2010 |
|
English Spoken Term Detection in Multilingual Recordings, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |