logo Idiap Research Institute        
speech recognition

Related keywords:



Publications for keyword "speech recognition"
2024
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, Shashi Kumar, Srikanth Madikeri, Nigmatulina Iuliia, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia D S, S. Pavankumar Dubagunta and Aravind Ganapathiraju, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024
[DOI]
[URL]
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024
attachment
[URL]
2023
2022
Efficient Transformer-Based Speech Recognition, Apoorv Vyas, École polytechnique fédérale de Lausanne, 2022
attachment
[DOI]
2021
2020
2019
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, S. Pavankumar Dubagunta and Mathew Magimai.-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
attachment
2018
2014
2012
Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012
attachment
2011
Model-based Compressive Sensing for Multi-party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011
attachment
Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai.-Doss, Joel Praveen Pinto and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011
attachment
2009
Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, in: Proceedings of Interspeech, Brighton, U.K., 2009
attachment
2006
Ensembles for Sequence Learning, Christos Dimitrakakis, École Polytechnique Fédérale de Lausanne, 2006
attachment
2002