Publication list - Idiap Publications

Advancing Neural Representations for Paralinguistic Analysis: From Speech Emotion to Parkinson’s Disease Assessment, Tilak Purohit, EPFL, 2026

[DOI]
[URL]

The EMN Country Factsheets Structured Dataset, David Alonso del Barrio and Daniel Gatica-Perez, Idiap-Com-01-2026

Rethinking the Role of Collaborative Robots in Rehabilitation, Vivek Gupte, Shalutha Rajapakshe and Emmanuel Senft, in: Companion Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction (HRI Companion '26), March 16--19, 2026, Edinburgh, Scotland Uk, 2026

The impact of abstract and object tags on image privacy classification, Darya Baranouskaya and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

[DOI]
[URL]

Which private attributes do VLMs agree on and predict well?, Olena Hrynenko, Darya Baranouskaya, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026

Geometry-aware Policy Imitation, Yiming Li, Nael Darwiche, Amirreza Razmjoo, sichao Liu, Yilun Du, Auke Ijspeert and Sylvain Calinon, in: International Conference on Learning Representations, 2026

A Riemannian Take on Distance Fields and Geodesic Flows in Robotics, Yiming Li, Jiacheng Qiu and Sylvain Calinon, in: International Journal of Robotics Research, 2026

Benchmarking Multimodal Large Language Models for Face Recognition, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2026

[URL]

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection, Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioğlu, Petr Motlicek and Andreas Stolcke, in: ICASSP 2026, 2026

Text-only adaptation in LLM-based ASR through text denoising, Sergio Burdisso, Esaú Villatoro-Tello, Andrés Carofilis, Shashi Kumar, Kadri Hacioğlu, Srikanth Madikeri, Pradeep Rangappa, Manjunath K E, Petr Motlicek, Shankar Venkatesan and Andreas Stolcke, in: ICASSP, 2026

PrivLEX: Detecting legal concepts in images through Vision-Language Models, Darya Baranouskaya and Andrea Cavallaro, in: arXiv, 2026

[DOI]
[URL]

Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives, Dilermando Queiroz Neto, Anderson Carlos, André Anjos and Lilian Berton, in: ACM Transactions on Computing for Healthcare, 2026

[DOI]

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering, Marco Valentino, Geonhee Kim, Dhairya Dalal, Zhixue Zhao and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study, Yingji Zhang, Marco Valentino, Danilo Carvalho and Andre Freitas, in: The 40th Annual AAAI Conference on Artificial Intelligence, 2026

Optimizing Supply Temperature Control in District Heating Networks via Differentiable Dynamic Simulation and Gradient Descent, Roberto Boghetti and Jérôme Kämpf, in: Construction, Energy, Environment and Sustainability. Proceedings of CEES 2025 (Volume 2: Energy), Springer Singapore, 2026

[DOI]
[URL]

Minimal neuron ablation triggers catastrophic collapse in the language core of Large Vision-Language Models, Cen Lu, Yung-Chen Tang and Andrea Cavallaro, in: arXiv, 2025

CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning, Yung-Chen Tang, Pin-Yu Chen and Andrea Cavallaro, in: arXiv, 2025

A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems, Gökhan Özbulak, Oscar Jimenez-del-Toro, Maíra Fatoretto, Lilian Berton and André Anjos, in: The Journal of Machine Learning for Biomedical Imaging, 3:938-957, 2025

[DOI]

On the Generation of Face Morphs by Inversion of Optimal Morph Embeddings, Hatef Otroshi Shahreza, Laurent Colbois and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 2025

[DOI]
[URL]

Grey-Box RC Building Models for Intelligent Management of Large-Scale Energy Flexibility: From Mass Modeling to Decentralized Digital Twins, Leonardo A. Bisogno Bernardini, Jérôme Kämpf, Umberto Desideri, Francesco Leccese and Giacomo Salvadori, in: Energies, 19(1), 2025

[DOI]
[URL]

Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction, Chunyang Jiang and Paola Merlo, in: arXiv cs.CL.2511.10441, 2025

Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication, Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof, in: SESAR Innovation Days 2025 (https://www.sesarju.eu/SIDS2025), Eurocontrol, Bled, Slovenia, 2025

[URL]

Text-Graph Encoders and Retrieval-Augmented Generation, Andrei Catalin Coman, EPFL, 2025

[URL]

Towards Integrated Processing of Physiological Signals and Speech, Zohreh Mostaani, Ecole polytechnique fédérale de Lausanne (EPFL), 2025

[DOI]
[URL]

Advancing Phonology-Based Sign Language Assessment: From Learner to Machine-Generated Videos, Neha Tarigopula, Ecole polytechnique fédérale de Lausanne (EPFL), 2025

[DOI]
[URL]

Measuring negative emotions and stress through acoustic correlates in speech: A systematic review, Lilien Schewski, Mathew Magimai-Doss, Guido Beldi and Sandra Keller, in: PLoS One, 20(7), 2025

[DOI]

UpSMART: five years of digital innovation in cancer clinical research---achievements, challenges, and recommendations, Paul O'Regan, Fouziah Butt, Louise Carter, Donna M. Graham, Anja Le Blanc, Richard Hoskins, Laura Stephenson, Akshita Patil, Muhammad Shabbir, Dilan Eken, Subir Singh, Andrea Villa, Luca Agnelli, Silvia Damian, Christopher Grave, Giulia Pretelli, Elena Garralda, Hannah Frost, Filippo de Braud, Andre Freitas, Caroline Dive and Harriet Unsworth, in: Frontiers in Digital Health, 7, 2025

[DOI]

SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning, Magdalena Wysocka, Danilo Carvalho, Oskar Wysocki, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Inductive Learning of Logical Theories with LLMs: A Complexity-graded Analysis, João Gandarela, Danilo Carvalho and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

Synergy and diversity in CLIP: Enhancing performance through adaptive backbone ensembling, Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Hamed Damirchi, Edison Marrese-Taylor and Anton van den Hengel, in: International Conference on Learning Representations, 2025

Bayesian low-rank learning (Bella): A practical approach to bayesian neural networks, Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo, Arash Mohammadi, Hamid Alinejad-Rokny, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection, Ignacio Meza De la Jara, Cristian Rodriguez-Opazo, Damien Teney, Damith Ranasinghe and Ehsan Abbasnejad, in: Advances in neural information processing systems, 2025

Cancelable Face Biometrics With Soft-Biometric Privacy Enhancement, Pietro Melzi, Hatef Otroshi Shahreza, Christian Rathgeb, Ruben Tolosana, Ruben Vera-Rodriguez, Julian Fierrez, Sébastien Marcel and Christoph Busch, in: IEEE Access, 2025

[DOI]
[URL]

Subtask C: Tools and methods to leverage the thermal demand response potential in buildings connected to thermal networks, Hicham Johra, Markus Schaffer, Daniel Leiria, Roberto Boghetti, Elisa Guelpa, Christopher Graf, Benedetto Nastasi, Ingo Leusbrock, Anders Rhiger Hansen, Stefano Mazzoni, Salam Al-Saegh, Qian Wang, Yangzhe Chen, Zeng Peng, Jad Al Koussa, Steffen Petersen and Jérôme Kämpf, in: EBC Annex 84: Demand Management of Buildings in Thermal Networks, Aalborg University, Denmark, 2025

[DOI]

Subtask D: Description and comparative analysis of case studies, Christopher Graf, Anna Cadenbach, Ruben Otte, Anna Marszal-Pomianowska, Elisa Guelpa, Vittorio Verda, Ingo Leusbrock, Demet Suna, Ralf-Roman Schmidt, Ole Michael Jensen, Laura Lehmann, Clemens Felsmann, Axel Oliva, Toke Haunstrup Bach Christensen, Jad Al Koussa, Tijs van Oevelen, Dirk Vanhoudt, Michele Tunzi, Roberto Boghetti and Jérôme Kämpf, in: EBC Annex 84: Demand Management of Buildings in Thermal Networks, Aalborg University, Denmark, 2025

[DOI]

Assessing the reliability of archetype-based Urban Building Energy Simulations: A case study analysis in Turin (Italy), Matteo Piro, Jérôme Kämpf, Ilaria Ballarini and Vincenzo Corrado, in: Journal of Physics: Conference Series, pages 062028, IOP Publishing, 2025

[DOI]
[URL]

OpenBEERS: A digital platform for urban scale simulation of building energy efficiency, David Geissbuhler, Alejandro Pena-Bello, Jérôme Kämpf and Jakob Rager, in: Journal of Physics: Conference Series, pages 042013, IOP Publishing, 2025

[DOI]
[URL]

Listening to Hypoglycemia: Voice as a Biomarker for Detection of a Medical Emergency Using Machine Learning, Vera Lehmann, Martin Hilpert, Zohreh Mostaani, Sevada Hovsepyan, Esmé Wallace, Colombine Verzat, Stefan Feuerriegel, Mathias Kraus, James Rosenthal, Gürkan Yilmaz, Mathew Magimai-Doss and Christoph Stettler, in: Diabetes Care, 2025

[DOI]

Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs, Maxime Delmas, Magdalena Wysocka, Danilo Gusicuma and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, pages 693–705, Association for Computational Linguistics, 2025

[DOI]
[URL]

An evidence-based guidance framework for neural network system diagrams, Guy Marshall, Andre Freitas and Caroline Jay, in: PLOS One, 2025

Montague semantics and modifier consistency measurement in neural language models, Danilo Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and Andre Freitas, in: 31st International Conference on Computational Linguistics, 2025

Effective Graph and Rank-based Contextual Embeddings for Textual and Multimedia Data, Thiago Almeida, Gustavo Leticio, Lucas Pascotti, Andre Freitas and Daniel Pedronette, in: International Joint Conference on Neural Networks, 2025

TableDC: Deep Clustering for Tabular Data, Hafiz Rauf, Andre Freitas and Norman Paton, in: ACM SIGMOD International Conference on Management of Data, 2025

Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions, Hafiz Rauf, Alex Bogatu, Norman Paton and Andre Freitas, in: 8th International Conference on Extending Database Technology, 2025

Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations, Leonardo Ranaldi, Marco Valentino and Andre Freitas, in: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025

Controlling Equational Reasoning in Large Language Models with Prompt Interventions, Jordan Meadows, Marco Valentino and Andre Freitas, in: The 39th Annual AAAI Conference on Artificial Intelligence, 2025

A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models, Geonhee Kim, Marco Valentino and Andre Freitas, in: Findings of the ACL, 2025

PEIRCE: Unifying Material and Formal Reasoning via LLM-Driven Neuro-Symbolic Refinement, Xin Quan, Marco Valentino, Danilo Carvalho, Dhairya Dalal and Andre Freitas, in: Demonstration at 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Improving chain-of-thought reasoning via quasi-symbolic abstractions, Leonardo Ranaldi, Marco Valentino, Alexander Polonsky and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025

Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations, Xin Quan, Marco Valentino, Louise Dennis and Andre Freitas, in: 63rd Annual Meeting of the Association for Computational Linguistics, 2025