Petr Motlicek - Idiap Publications

Inter-task System Fusion for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: Proceeedings of the INTERSPEECH, 2016

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas and Philip N. Garner, in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016

SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, Marc Ferras, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, in: Proceedings of Interspeech, pages 741-745, 2015

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, in: Proceedings of Interspeech 2015, pages 3105-3109, 2015

Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, Ivan Himawan, Petr Motlicek, David Imseng, Blaise Potard, Namhoon Kim and Jaewon Lee, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015

[DOI]

Towards utterance-based neural network adaptation in acoustic modeling, Ivan Himawan, Petr Motlicek, Marc Ferras and Srikanth Madikeri, in: IEEE Automatic Speech Recognition and Understanding Workshop, pages 289-295, 2015

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), Singapore, ISCA, 2014

Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, David Imseng, Blaise Potard, Petr Motlicek, Alexandre Nanchen and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014

[DOI]

Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, Ngoc Thang Vu, David Imseng, Daniel Povey, Petr Motlicek, Tanja Schultz and Hervé Bourlard, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014

[DOI]

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech 2014, 2014

Phoneme Background Model for Information Bottleneck based Speaker Diarization, Sree Harsha Yella, Petr Motlicek and Hervé Bourlard, in: Interspeech, Singapore, 2014

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, in: Interspeech, 2014

The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, Volha Petukhova, Martin Gropp, Dietrich Klakow, Anna Schmidt, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, O. Deroo, Ronny Egeler, Uwe Meinz and Steffen Liersch, in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014

[URL]

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7170-7174, 2013

[DOI]

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, in: Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), ISCA - International Speech Communication Association, Lyon, France, pages 510-514, ISCA, 2013

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, in: The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, BC, Canada, pages 7604-7608, 2013

[DOI]

Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition, David Imseng, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, 2013

On the (Un)importance of the Contextual Factors In HMM-Based Speech Synthesis, Milos Cernak, Petr Motlicek and Philip N. Garner, in: Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pages 8140 - 8143, 2013

Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus, Fabio Valente, Samuel Kim and Petr Motlicek, in: Proceedings of Interspeech 2012, 2012

Assessing the Impact of Language Style on Emergent Leadership Perception from Ubiquitous Audio, Dairazalia Sanchez-Cortes, Petr Motlicek and Daniel Gatica-Perez, in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, in: Proceedings of the 21st International Conference on Pattern Recognition, 2012

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Detecting and Labeling Folk Literature in Spoken Cultural Heritage Archives using Structural and Prosodic Features, Fabio Valente and Petr Motlicek, in: IEEE Content Based Multimedia Indexing, 2012

Generating Exact Lattices in The WFST Framework, Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukas Burget, Arnab Ghoshal, Milos Janda, Martin Karafiat, Stefan Kombrink, Petr Motlicek, Yanmin Qian, Korbinian Riedhammer, Karel Vesely and Ngoc Thang Vu, in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012

[DOI]

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Conference on Signal Acquisition and Processing, Singapore, 2011

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009

[URL]

Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009

Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Pilsen, Czech Republic, Springer - Verlag, Berlin Heidelberg 2009, 2009

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, in: Audio Engineering Society (AES,',','), 127th Convention, Audio Engineering Society (AES), Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, 2009

[URL]

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: AES 124th Convention, Audio Engineering Society, 2008

Exploiting Contextual Information for Speech/Non-Speech Detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri and Marios Athineos, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, in: INTERSPEECH 2008, 2008