Petr Motlicek - Idiap Publications

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, Idiap-RR-21-2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

Domain-specific language model adaptation: a case study, Gwénolé Lecorvé, Petr Motlicek and John Dines, Idiap-Com-01-2013

IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION, Srikanth Madikeri, Subhadeep Dey, Marc Ferras, Petr Motlicek and Ivan Himawan, Idiap-RR-32-2016

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, Idiap-RR-07-2020

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: In Proceedings of ICASSP 2019, Brighton, ENGLAND, pages 5786-5790, 2019

Analysis of Language Dependent Front-End for Speaker Recognition, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, in: Proceedings of Interspeech 2018, Hyderabad, INDIA, pages 1101-1105, 2018

[DOI]

Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Marc Ferras, Idiap-RR-26-2016

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, Idiap-RR-05-2017

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, pages 5365-5369, 2017

[DOI]

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, Idiap-RR-20-2015

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, in: Proceedings of Interspeech 2015, pages 3105-3109, 2015

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, Idiap-RR-28-2020

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, in: In Proceedings of Interspeech 2020, pages 4746--4750, ISCA, 2020

Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of Interspeech 2021, 2021

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-17-2015

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, in: Proceedings of ICASSP 2015, pages 4834-4837, 2015

Analysis of Posterior Estimation Approaches to I-vector Extraction for Speaker Recognition, Srikanth Madikeri, Petr Motlicek, Marc Ferras and Subhadeep Dey, Idiap-RR-15-2018

Autocrime - open multimodal platform for combating organized crime, Srikanth Madikeri, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Joshua Hughes, Jacob Tkaczuk, Alejandra Sanchez Lara, Driss Khalil, Johan Rohdin, Dawei Zhu, Aravind Krishnan, Dietrich Klakow, Zahra Ahmadi, Marek Kovac, Dominik Boboš, Costas Kalogiros, Andreas Alexopoulos and Denis Marraud, in: Forensic Science International: Digital Investigation, 54, 2025

[DOI]
[URL]

Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, Srikanth Madikeri, Seyyed Saeed Sarfjoo, Petr Motlicek and Sébastien Marcel, Idiap-RR-17-2019

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, Idiap-RR-57-2006

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition, Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet and Petr Motlicek, in: Proc. Interspeech 2023, Ireland, 2023

SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, Grisard Malo, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, Idiap-RR-08-2019

Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, Petr Motlicek, Idiap-RR-06-2009

Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, Petr Motlicek, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, Brighton, England, 2009

LP-TRAPs in all senses, Petr Motlicek, Idiap-RR-66-2007

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, Idiap-RR-16-2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015

[URL]

ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, Petr Motlicek, Erinc Dikici, Srikanth Madikeri, Pradeep Rangappa, Miroslav Janosik, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schurz, Johan Rohdin, Petr Schwarz, Marek Kovac, Květoslav Malý, Dominik Boboš, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Apostiki, Panagiotis Douris, Konstantinos Gkountas, Eleni Sergidou, Wauter Bosma, Joshua Hughues and Hellenic Police Team, in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024

[DOI]
[URL]

Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, Petr Motlicek, Stefan Duffner, Danil Korchagin, Hervé Bourlard, Carl Scheffler, Jean-Marc Odobez, Giovanni Del Galdo, Markus Kallinger and Oliver Thiergart, in: Advances in Multimedia, 2013:21, 2013

[DOI]
[URL]

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, Idiap-RR-18-2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, in: Proceedings of the 21st International Conference on Pattern Recognition, 2012

Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, in: 10th Annual Conference of the International Speech Communication Association, ISCA, Brighton, England, ISCA 2009, 2009

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-32-2009

Entropy coding of Quantized Spectral Components in FDLP audio codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-71-2008

Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-43-2007

Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2007

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri and Marios Athineos, in: Text, Speech and Dialogue, Brno, Czech Republic, Springer-Verlag Berlin, Heidelberg, 2008