Publication list - Idiap Publications

BROWSE
EXPORT
- Export all publications
SORT BY
- Author
- Title
- Type/journal
- Year
- Recently added

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, Mutian He and Philip N. Garner, in: 13th International Conference on Learning Representations (ICLR), 2025

[URL]

Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels, Pierre Vuillecard and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

Multidisciplinary characterization of embarrassment through behavioral and acoustic modeling, Dajana Šipka, Bogdan Vlasenko, Maria Stein, Thomas Dierks, Mathew Magimai-Doss and Yosuke Morishima, in: Scientific reports, 2025

Inferring Mood-While-Eating with Smartphone Sensing and Community-Based Model Personalization, Wageesha Bangamuarachchi, Anju Chamantha, Lakmal Buddika Meegahapola, Haeeun Kim, Salvador Ruiz-Correa, Indika Perera and Daniel Gatica-Perez, in: ACM Transactions on Computing for Healthcare, 2025

DiversityOne: A Multi-Country Smartphone Sensor Dataset for Everyday Life Behavior Modeling, Matteo Busso, Andrea Bontempelli, Leonardo Javier Malcotti, Lakmal Buddika Meegahapola, PETER KUN, Shyam Diwakar, Chaitanya Nuttakki, Marcelo Rodas Britez, Hao Xu, Donglei Song, Salvador Ruiz-Correa, Andrea Mendoza-Lara, George Gaskell, Sally Stares, Miriam Bidoglia, Amarsanaa Ganbold, Altangerel Chagnaa, Luca Cernuzzi, Alethia Hume, Ronald Chenu-Abente, Roy Alia Asiku, Ivan Kayongo, Daniel Gatica-Perez, Amalia de Götzen, Ivano Bison and Fausto Giunchiglia, in: ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 9(1), 2025

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere, Hatef Otroshi Shahreza and Sébastien Marcel, in: The Thirteenth International Conference on Learning Representations, 2025

[URL]

Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K E, Kadri Hacioğlu, Pradeep Rangappa, Petr Motlicek, Aravind Ganapathiraju and Andreas Stolcke, in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

Review of Demographic Bias in Face Recognition, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2025

Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, Tilak Purohit and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, Tilak Purohit, Barbara Ruvolo, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

Exploring the Complexity of Parkinson’s Patient Speech for Depression Detection task: A Qualitative Analysis, Barbara Ruvolo, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech, Karl El Hajal, Ajinkya Kulkarni, Enno Hermann and Mathew Magimai-Doss, in: Proceedings of the Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Albuquerque, New Mexico, ACL, 2025

[URL]

Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, Pradeep Rangappa, Juan Zuluaga-Gomez, Srikanth Madikeri, Andrés Carofilis, Jeena Prakash, Sergio Burdisso, Shashi Kumar, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025

Posterior-based analysis of spatio-temporal features for Sign Language Assessment, Neha Tarigopula, Sandrine Tornay, Ozge Mercanoglu Sincan, Richard Bowden and Mathew Magimai-Doss, in: IEEE Open Journal of Signal Processing, 2025

[DOI]

Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR, Karl El Hajal, Enno Hermann, Ajinkya Kulkarni and Mathew Magimai-Doss, in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025

[URL]

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, in: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models, Anjith George and Sébastien Marcel, in: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), 2025

Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning, Alain Komaty, Hatef Otroshi Shahreza, Anjith George and Sébastien Marcel, in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing, Eklavya Sarkar and Mathew Magimai-Doss, in: International Conference on Acoustics, Speech and Signal Processing, 2025

Minimum effort adaptation of automatic speech recognition system in air traffic management, Mrinmoy Bhattacharjee, Petr Motlicek, Srikanth Madikeri, Hartmut Helmke, Oliver Ohneiser, Matthias Kleinert and heiko Ehr, in: European Journal of Transport and Infrastructure Research, 24(4 (2024)):133–153, 2025

[DOI]
[URL]

A Bayesian Interpretation of Adaptive Low-Rank Adaptation, Haolin Chen and Philip N. Garner, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

Montague semantics and modifier consistency measurement in neural language models, Danilo Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and Andre Freitas, in: 31st International Conference on Computational Linguistics, 2025

Loose Social-Interaction Recognition in Real-world Therapy Scenarios, Abid Ali, Rui Dai, Ashish Marisetty, Guillaume Astruc, Monique Thonnat, Jean-Marc Odobez, Suzanne Thümmler and Francois Bremond, in: IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Intuitive Robot Programming, C. Blanc, Julius Jankowski, A. Sonderegger, Sylvain Calinon and S. Dégallier Rochat, in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025

Identifying Privacy Personas, Olena Hrynenko and Andrea Cavallaro, in: Proceedings on Privacy Enhancing Technologies, 2025

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications, Juan Zuluaga-Gomez, Karel Vesely, Igor Szoke, Blatt Alexander, Petr Motlicek, Martin Kocour, Khalid Choukri, Nigmatulina Iuliia, Claudia Cevenini, Allan Tart, Jan Cernocky and Dietrich Klakow, in: Journal of Data-centric Machine Learning Research, 2024

[URL]

Sparse Optical Sampling in the Close Proximity of a Robotic Arm, Martin Laurenzis, Ante Marić, Emmanuel Bacher, Mateusz Pietrzak, Stéphane Schertzer, Francesco Grella and Sylvain Calinon, in: Springer Proceedings in Advanced Robotics, 2024

TESS: Text-to-text selfconditioned simplex diffusion, Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew Peters and Arman Cohan, in: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2347–2361, Association for Computational Linguistics, 2024

Formal Semantic Controls over Language Models, Danilo Carvalho, Yingji Zhang and Andre Freitas, in: LREC-COLING, 2024

Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations, Leonardo Ranaldi, Giulia Pucci and Andre Freitas, in: Findings of the ACL, 2024

Deep Clustering for Data Cleaning and Integration, Hafiz Rauf, Andre Freitas and Norman Paton, in: 27th International Conference on Extending Database Technology, 2024

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks, Yingji Zhang, Danilo Carvalho and Andre Freitas, in: The 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time Series, Giangiacomo Mercatali, Andre Freitas and Jie Chen, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Diffusion Twigs with Loop Guidance for Conditional Graph Generation, Giangiacomo Mercatali, Yogesh Verma, Andre Freitas and Vikas Garg, in: Thirty-Eighth Annual Conference on Neural Information Processing Systems, 2024

Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions, Jordan Meadows, Tamsin James and Andre Freitas, in: Findings of EMNLP, 2024

Consistent Autoformalization for Constructing Mathematical Libraries, Lan Zhang, Xin Quan and Andre Freitas, in: The 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Tactile Ergodic Coverage on Curved Surfaces, Cem Bilaloglu, Tobias Löw and Sylvain Calinon, in: IEEE Transactions on Robotics (T-RO), 2024

Temporal fine-tuning for early risk detection, Horacio Thompson, Esaú Villatoro-Tello, Manuel Montes-y-Gómez and Marcelo Errecalde, in: Memorias De Las JAIIO, Argentina, pages 137-149, 2024

[URL]

Natural Language Understanding for Navigation of Service Robots in Low-Resource Domains and Languages: Scenarios in Spanish and Nahuatl, Amadeo Hernández, Rosa M. Ortega-Mendoza, Esaú Villatoro-Tello, César Joel Camacho-Bello and Obed Pérez-Cortés, in: Mathematics, 12(8), 2024

[DOI]
[URL]

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere, Hatef Otroshi Shahreza and Sébastien Marcel, in: NeurIPS Safe Generative AI Workshop 2024, 2024

[URL]

MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction, Anshul Gupta, Samy Tafasca, Arya Farkhondeh, Pierre Vuillecard and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Toward Semantic Gaze Target Detection, Samy Tafasca, Anshul Gupta, Victor Bros and Jean-Marc Odobez, in: 38th Conf. on Neural Information Processing System, 2024

Weakly-supervised Autism Severity Assessment in Long Videos, Abid Ali, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, Francois Bremond and Suzanne Thümmler, in: International Conference on Content-based Multimedia Indexing, 2024

Automatic detection of the visual gaze components of joint attention in observational, naturalistic child language acquisition data, Miranda Dickerman, Anshul Gupta, Samy Tafasca, Xiaocheng Zhang, Jean-Marc Odobez and Sabine Stoll, in: Boston University Conference on Language Development, 2024

Reasoning with Natural Language Explanations, Marco Valentino and Andre Freitas, in: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2024

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials, Mael Jullien, Marco Valentino and Andre Freitas, in: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024

Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders, Yingji Zhang, Marco Valentino, Danilo Carvalho, Ian Pratt-Hartmann and Andre Freitas, in: In Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Large Language Models, scientific knowledge and factuality: A framework to streamline human expert evaluation, Magdalena Wysocka, Oskar Wysocki, Maxime Delmas, Vincent Mutel and Andre Freitas, in: Journal of Biomedical Informatics(158), 2024

[DOI]
[URL]

An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery, Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Bogatu, Danilo Gusicuma, Maxime Delmas, Harriet Unsworth and Andre Freitas, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL, Bangkok, Thailand, pages 355-364, 2024

[DOI]
[URL]

processing time: 0.0003 seconds.