logo Idiap Research Institute        
All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 |


M

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013
attachment
[DOI]
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding, Petr Motlicek, Hynek Hermansky, Sriram Ganapathy and Harinath Garudadri, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2007
attachment
Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, in: Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD), 2006
attachment
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, in: 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Universita Degli Studi Firenze, Firenze, Italy, 2019
attachment
[URL]
Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014
attachment
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013
attachment
Exploiting foreign resources for DNN-based ASR, Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner and Ivan Himawan, in: EURASIP Journal on Audio, Speech, and Music Processing(2015:17), 2015
attachment
[DOI]
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013
attachment
[DOI]
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007
attachment
Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
attachment
English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010
attachment
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012
Evolution of the Mental States Operating a Brain-Computer Interface, J. Mouriño, Silvia Chiappa, R. Jané and José del R. Millán, in: Proceedings of the International Federation for Medical and Biological Engineering, 2002
attachment
Long-Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(11):2098-2111, 2017
attachment
Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai.-Doss and Sébastien Marcel, in: IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, CANADA, pages 4884-4888, 2018
attachment
End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection, Hannah Muckenhirn, Mathew Magimai.-Doss and Sébastien Marcel, in: International Joint Conference on Biometrics, Denver, Colorado, USA, 2017
attachment
On Job Training: Automated Interpersonal Behavior Assessment & Real-Time Feedback, Skanda Muralidhar, in: Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017
attachment
Dites-Moi: Wearable Feedback on Conversational Behavior, Skanda Muralidhar, Jean M R Costa, Laurent Son Nguyen and Daniel Gatica-Perez, in: Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, 2016
attachment
Examining Linguistic Content and Skill Impression Structure for Job Interview Analytics in Hospitality, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 |