Update cookies preferences
 logo Idiap Research Institute        
All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K E, Kadri Hacioğlu, Pradeep Rangappa, Petr Motlicek, Aravind Ganapathiraju and Andreas Stolcke, in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025
attachment
XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025
attachment
Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, Tilak Purohit and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025
attachment
Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, Tilak Purohit, Barbara Ruvolo, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025
attachment
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech, Karl El Hajal, Ajinkya Kulkarni, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Albuquerque, New Mexico, ACL, 2025
attachment
[URL]
Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR, Karl El Hajal, Enno Hermann, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025
attachment
[URL]
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, in: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
attachment
Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models, Anjith George and Sébastien Marcel, in: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), 2025
attachment
A Bayesian Interpretation of Adaptive Low-Rank Adaptation, Haolin Chen and Philip N. Garner, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
attachment
Intuitive Robot Programming, C. Blanc, Julius Jankowski, A. Sonderegger, Sylvain Calinon and S. Dégallier Rochat, in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025
Identifying Privacy Personas, Olena Hrynenko and Andrea Cavallaro, in: Proceedings on Privacy Enhancing Technologies, 2025
attachment
TESS: Text-to-text selfconditioned simplex diffusion, Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew Peters and Arman Cohan, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2347–2361, Association for Computational Linguistics, 2024
Deep Clustering for Data Cleaning and Integration, Hafiz Rauf, Andre Freitas and Norman Paton, in: 27th International Conference on Extending Database Technology, 2024
Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: The 62nd Annual Meeting of the Association for Computational Linguistics, 2024
Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time Series, Giangiacomo Mercatali, Andre Freitas and Jie Chen, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024
Diffusion Twigs with Loop Guidance for Conditional Graph Generation, Giangiacomo Mercatali, Yogesh Verma, Andre Freitas and Vikas Garg, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024
Consistent Autoformalization for Constructing Mathematical Libraries, Lan Zhang, Xin Quan and Andre Freitas, in: The 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Tactile Ergodic Coverage on Curved Surfaces, Cem Bilaloglu, Tobias Löw and Sylvain Calinon, in: IEEE Transactions on Robotics (T-RO), 2024
attachment
Toward Semantic Gaze Target Detection, Samy Tafasca, Anshul Gupta, Victor Bros and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024
attachment
Reasoning with Natural Language Explanations, Marco Valentino and Andre Freitas, in: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2024
SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials, Mael Jullien, Marco Valentino and Andre Freitas, in: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024
An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery, Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Bogatu, Danilo Gusicuma, Maxime Delmas, Harriet Unsworth and Andre Freitas, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL, Bangkok, Thailand, pages 355-364, 2024
[DOI]
[URL]
Multi-Operational Mathematical Derivations in Latent Space, Marco Valentino, Jordan Meadows, Lan Zhang and Andre Freitas, in: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Inference to the Best Explanation in Large Language Models, Dhairya Dalal, Marco Valentino, Andre Freitas and Paul Buitelaar, in: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |