Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

Exploratory analysis of yellow mongoose vocalization: detection from in-the-wild recordings and call classification, Sevada Hovsepyan, Imen Ben Mahmoud, Vanessa Rüegg, Marta Manser and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2026

attachment

Sparse Neuron Ablation Triggers Catastrophic Collapse of the Language Core in Large Vision-Language Models, Cen Lu, Yung-Chen Tang and Andrea Cavallaro, in: Mechanistic Interpretability Workshop at the 43rd International Conference on Machine Learning, 2026

[URL]

FAccT-Checked: A Narrative Review of Authority Reconfigurations and Retention in AI-Mediated Journalism, Matilde Barbini, Daniel Gatica-Perez and Stefano Sorrentino, in: Proc. ACM Conference on Fairness, Accountability, and Transparency, Montreal, 2026

You Are How You Pay: Understanding and Identifying the Payment Behavior of Socio-Demographic Groups, Aurel Ruben Mader, Matthias Jüttner and Daniel Gatica-Perez, in: ACM Digital Government: Research and Practice, 7(2), 2026

[URL]

RAG as a Content-Analysis Assistant: Auditing SDG Discourse in Online Videos, Victor Bros, Daniel Gatica-Perez and Cristian Safta, in: Proceedings of the Workshops and Tutorials of the ACM International Conference on Multimedia Retrieval (ICMR 2026), 2026

attachment

Zero frequency resonator based extraction of R-peaks in ECG signals, RaviShankar Prasad, Gürkan Yilmaz and Mathew Magimai-Doss, in: Proceedings of EUSIPCO, 2026

attachment

Skill Extraction from Resumes and Job Offers across Six Languages, Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas, in: Proceedings of the 11th edition of the Swiss Text Analytics Conference, 2026

attachment

Framing Migration News with LLMs: Structured CoT as a Support for Human Interpretation, David Alonso del Barrio, Jing Wen and Daniel Gatica-Perez, in: COMPASS'26, 2026

attachment

UveAI: clinic-ready scoring of retinal inflammation in uveitis on widefield fluorescein angiography using AI, Victor Amiot, Roberto Pulvirenti, Oscar Jimenez-del-Toro, Muriel Ott, Teodora-Elena Bogaciu, Shalini Banerjee, Christoph Amstutz, Jean-Marc Odobez, Christophe Chiquet, Yan Guex-Crosier, Ciara Bergin, Ilenia Meloni, André Anjos, Florence Hoogewoud and Mattia Tomasoni, in: Scientific reports, 2026

[DOI]
[URL]

When Specialization Helps (and Hurts): Cross-Modality Transfer in Ophthalmic Imaging with Foundation Models, Roberto Pulvirenti, Oscar Jimenez-del-Toro, Mattia Tomasoni, Florence Hoogewoud and André Anjos, in: 2026 IEEE 23rd International Symposium on Biomedical Imaging, 2026

[DOI]
[URL]

Évaluation de la reconnaissance automatique de la parole par les grands modèles de langage génératifs, Thibault Bañeras-Roux, Shashi Kumar, Driss Khalil, Petr Motlicek, Sergio Burdisso, Shiran Liu, Mickael Rouvier, Jane Wottawa and Richard Dufour, in: EvalLLM2026 : Atelier sur l'evaluation des modeles generatifs (LLM), le RAG et challenges, 2026

attachment

Flexible Clustering of Substations for Accurate and Rapid Hybrid Simulation of District Heating, Dubon Rodrigue, Mohamed T. Mabrouk, Bastien Pasdeloup, Patrick Meyer and Bruno Lacarrière, in: Energy Informatics, 2026

[DOI]
[URL]

Learning Ego-Exo Visual Representations for Conversational Gaze Estimation, Anshul Gupta, Yijun Qian, Ruohan Gao, Ishwarya Ananthabhotla, Jean-Marc Odobez, Vamsi Krishna Ithapu and Calvin Murdock, in: Conference on Computer Vision and Pattern Recognition Workshops, 2026

attachment

Meta-RL Induces Exploration in Language Agents, Yulun Jiang, Liangze Jiang, Damien Teney, Michael Moor and Maria Brbić, in: The Fourteenth International Conference on Learning Representations, 2026

Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models, Sandipana Dowerah, Atharva Kulkarni, Ajinkya Kulkarni, Hoan My Tran, Joonas Kalda, Artem Fedorchenko, Benoit Fauve, Damien Lolive, Tanel alumae and Mathew Magimai-Doss, in: IEEE Open Journal of Signal Processing, 7:73--81, 2026

[DOI]

The Internet of Us, Loizos Michael, Ivano Bison, Matteo Busso, Luca Cernuzzi, Amalia de Götzen, Shyam Diwakar, Kobi Gal, Amarsanaa Ganbold, George Gaskell, Daniel Gatica-Perez, Jessica Heesen, Daniele Miorandi, Salvador Ruiz-Correa, Laura Schelenz, Avi Segal, Carles Sierra, Hao Xu and Fausto Giunchiglia, in: The European Journal on Artificial Intelligence/ (SAGE Publications), 2026

attachment

Building A Civic Tool for Community-Police Engagement to Adapt Neighborhood Policing, Ravinithesh Annapureddy, Staņislavs Šeiko, Natalie Higham-James, William Droz, Alessandro Fornaroli, Sarah Vollmer, Britta Elena Hecking and Daniel Gatica-Perez, in: Designing Interactive Systems Conference (DIS '26), June 13--17, 2026, Singapore, Singapore, 2026

attachment

[DOI]

Lightweight Cross-Spectral Face Recognition via Contrastive Alignment and Distillation, Anjith George and Sébastien Marcel, in: IEEE TBIOM, 2026

attachment

Syllable-Level Features for Speech Pathology Detection: A Case Study of Parkinson’s Disease, Sevada Hovsepyan and Mathew Magimai-Doss, Idiap-RR-02-2026

attachment

A Scalable, Automatic, and Evolutionary Algorithm for Calibrating Urban Building Energy Models, Matteo Piro, Jérôme Kämpf, Ilaria Ballarini and Vincenzo Corrado, in: Sustainable Cities and Society, 2026

[DOI]
[URL]

INFLUENCE OF CLEAN SPEECH CHARACTERISTICS ON SPEECH ENHANCEMENT PERFORMANCE, Mingchi Hou and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

attachment

GENERALIZABILITY OF PREDICTIVE AND GENERATIVE SPEECH ENHANCEMENT MODELS TO PATHOLOGICAL SPEAKERS, Mingchi Hou, Ante Jukic and Ina Kodrasi, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

attachment

Migrant Voices, Local News: Insights on Bridging Community Needs with Media Content, David Alonso del Barrio, Paula Dolores Rescala, Victor Bros and Daniel Gatica-Perez, in: ACM International Conference on Interactive Media Experiences, 2026

attachment

[DOI]

CONTEXTUALISATION OF AUTOMATIC SPEECH RECOGNITION AND RELATED APPLICATIONS, Thorbecke Iuliia, University of Zurich, Faculty of Arts, 2026

attachment

[DOI]
[URL]

Triangulating Temporal Dynamics in Multilingual Swiss Online News, Victor Bros, Evan Dufraisse, Adrian Popescu and Daniel Gatica-Perez, in: Vol. 20 (2026): Proceedings of the Twentieth International AAAI Conference on Web and Social Media, 2026

attachment

Politics of Questions in News: A Mixed-Methods Study of Interrogative Stances as Markers of Voice and Power, Victor Bros, Matilde Barbini, Patrick Gerard and Daniel Gatica-Perez, in: Vol. 20 (2026): Proceedings of the Twentieth International AAAI Conference on Web and Social Media, 2026

attachment

Graph neural network-based surrogate modeling for fast and scalable simulations of meshed district heating networks, Roberto Boghetti, Jean-Marc Odobez and Jérôme Kämpf, in: Energy and AI, 24, 2026

[DOI]
[URL]

DDialogue: A Collaborative Framework for Cross-Sectoral Dialogue through Data, Alessandro Fornaroli, Ravinithesh Annapureddy and Daniel Gatica-Perez, in: Participatory Design Conference 2026, , June 15--19, 2026, Milan, Italy, ACM, 2026

attachment

[DOI]

Effects of cool coatings on urban microclimate and outdoor thermal Comfort: A CFD–CitySim pro coupled simulation study, Da-Som Mun, Jérôme Kämpf and Jae-Jin Kim, in: Energy and Buildings, 2026

[DOI]
[URL]

Advancing Neural Representations for Paralinguistic Analysis: From Speech Emotion to Parkinson’s Disease Assessment, Tilak Purohit, EPFL, 2026

attachment

[DOI]
[URL]

The EMN Country Factsheets Structured Dataset, David Alonso del Barrio and Daniel Gatica-Perez, Idiap-Com-01-2026

attachment

Rethinking the Role of Collaborative Robots in Rehabilitation, Vivek Gupte, Shalutha Rajapakshe and Emmanuel Senft, in: Companion Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction (HRI Companion '26), March 16--19, 2026, Edinburgh, Scotland Uk, 2026

attachment

The impact of abstract and object tags on image privacy classification, Darya Baranouskaya and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

attachment

[DOI]
[URL]

Which private attributes do VLMs agree on and predict well?, Olena Hrynenko, Darya Baranouskaya, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

attachment

Geometry-aware Policy Imitation, Yiming Li, Nael Darwiche, Amirreza Razmjoo, sichao Liu, Yilun Du, Auke Ijspeert and Sylvain Calinon, in: International Conference on Learning Representations, 2026

attachment

A Riemannian Take on Distance Fields and Geodesic Flows in Robotics, Yiming Li, Jiacheng Qiu and Sylvain Calinon, in: International Journal of Robotics Research, 2026

attachment

Benchmarking Multimodal Large Language Models for Face Recognition, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2026

attachment

[URL]

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection, Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioğlu, Petr Motlicek and Andreas Stolcke, in: ICASSP 2026, 2026

attachment

Text-only adaptation in LLM-based ASR through text denoising, Sergio Burdisso, Esaú Villatoro-Tello, Andrés Carofilis, Shashi Kumar, Kadri Hacioğlu, Srikanth Madikeri, Pradeep Rangappa, Manjunath K E, Petr Motlicek, Shankar Venkatesan and Andreas Stolcke, in: ICASSP, 2026

attachment

PrivLEX: Detecting legal concepts in images through Vision-Language Models, Darya Baranouskaya and Andrea Cavallaro, in: arXiv, 2026

[DOI]
[URL]

Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives, Dilermando Queiroz Neto, Anderson Carlos, André Anjos and Lilian Berton, in: ACM Transactions on Computing for Healthcare, 2026

attachment

[DOI]

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering, Marco Valentino, Geonhee Kim, Dhairya Dalal, Zhixue Zhao and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study, Yingji Zhang, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Optimizing Supply Temperature Control in District Heating Networks via Differentiable Dynamic Simulation and Gradient Descent, Roberto Boghetti and Jérôme Kämpf, in: Construction, Energy, Environment and Sustainability. Proceedings of CEES 2025 (Volume 2: Energy), Springer Singapore, 2026

attachment

[DOI]
[URL]

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction, Mutian He and Philip N. Garner, Idiap-RR-01-2026

attachment

[URL]

Meaningful Pose-Based Sign Language Evaluation, Zifan Jiang, Colin Leong, Amit Moryossef, Oliver Cory, Maksym Ivashechkin, Neha Tarigopula, Biao Zhang, Anne Göhring, Annette Rios, Rico Sennrich and Sarah Ebling, in: Proceedings of the Tenth Conference on Machine Translation (WMT), 2025

[DOI]
[URL]

Open Challenge: Exploring People's Everyday Life Behavior with Mobile Data, Andrea Bontempelli, Matteo Busso, Lakmal Buddika Meegahapola, Amalia de Götzen, Fausto Giunchiglia and Daniel Gatica-Perez, in: Companion of the 2025 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2025

attachment

Minimal neuron ablation triggers catastrophic collapse in the language core of Large Vision-Language Models, Cen Lu, Yung-Chen Tang and Andrea Cavallaro, in: arXiv, 2025

CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning, Yung-Chen Tang, Pin-Yu Chen and Andrea Cavallaro, in: arXiv, 2025

A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems, Gökhan Özbulak, Oscar Jimenez-del-Toro, Maíra Fatoretto, Lilian Berton and André Anjos, in: The Journal of Machine Learning for Biomedical Imaging, 3:938-957, 2025

attachment

[DOI]

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

processing time: 0.7368 seconds.