Idiap Publications - Public RSS - Publications 2.0 Idiap Publications - Public RSS - Publications 2.0 https://publications.idiap.ch/ idiap.ch Zero frequency resonator based extraction of R-peaks in ECG signals RaviShankar Prasad, Gürkan Yilmaz and Mathew Magimai-Doss: "Zero frequency resonator based extraction of R-peaks in ECG signals" https://publications.idiap.ch/publications/show/5860 Skill Extraction from Resumes and Job Offers across Six Languages Laura Vásquez-Rodríguez, Bertrand Audrin, Samuel Michel, Samuele Galli, Julneth Rogenhofer, Jacopo Negro Cusa and Lonneke van der Plas: "Skill Extraction from Resumes and Job Offers across Six Languages" https://publications.idiap.ch/publications/show/5859 Framing Migration News with LLMs: Structured CoT as a Support for Human Interpretation David Alonso del Barrio, Jing Wen and Daniel Gatica-Perez: "Framing Migration News with LLMs: Structured CoT as a Support for Human Interpretation" https://publications.idiap.ch/publications/show/5858 UveAI: clinic-ready scoring of retinal inflammation in uveitis on widefield fluorescein angiography using AI Victor Amiot, Roberto Pulvirenti, Oscar Jimenez-del-Toro, Muriel Ott, Teodora-Elena Bogaciu, Shalini Banerjee, Christoph Amstutz, Jean-Marc Odobez, Christophe Chiquet, Yan Guex-Crosier, Ciara Bergin, Ilenia Meloni, André Anjos, Florence Hoogewoud and Mattia Tomasoni: "UveAI: clinic-ready scoring of retinal inflammation in uveitis on widefield fluorescein angiography using AI" https://publications.idiap.ch/publications/show/5857 When Specialization Helps (and Hurts): Cross-Modality Transfer in Ophthalmic Imaging with Foundation Models Roberto Pulvirenti, Oscar Jimenez-del-Toro, Mattia Tomasoni, Florence Hoogewoud and André Anjos: "When Specialization Helps (and Hurts): Cross-Modality Transfer in Ophthalmic Imaging with Foundation Models" https://publications.idiap.ch/publications/show/5856 Évaluation de la reconnaissance automatique de la parole par les grands modèles de langage génératifs Thibault Bañeras-Roux, Shashi Kumar, Driss Khalil, Petr Motlicek, Sergio Burdisso, Shiran Liu, Mickael Rouvier, Jane Wottawa and Richard Dufour: "Évaluation de la reconnaissance automatique de la parole par les grands modèles de langage génératifs" https://publications.idiap.ch/publications/show/5855 Flexible Clustering of Substations for Accurate and Rapid Hybrid Simulation of District Heating Dubon Rodrigue, Mohamed T. Mabrouk, Bastien Pasdeloup, Patrick Meyer and Bruno Lacarrière: "Flexible Clustering of Substations for Accurate and Rapid Hybrid Simulation of District Heating" https://publications.idiap.ch/publications/show/5854 Learning Ego-Exo Visual Representations for Conversational Gaze Estimation Anshul Gupta, Yijun Qian, Ruohan Gao, Ishwarya Ananthabhotla, Jean-Marc Odobez, Vamsi Krishna Ithapu and Calvin Murdock: "Learning Ego-Exo Visual Representations for Conversational Gaze Estimation" https://publications.idiap.ch/publications/show/5852 Meta-RL Induces Exploration in Language Agents Yulun Jiang, Liangze Jiang, Damien Teney, Michael Moor and Maria Brbić: "Meta-RL Induces Exploration in Language Agents" https://publications.idiap.ch/publications/show/5851 Design and Control of Roller Grasper V3 for In-Hand Manipulation Shenli Yuan, Lin Shao, Yunhai Feng, Jiatong Sun, Teng Xue, Connor Yako, Jeannette Bohg and J. Kenneth Salisbury: "Design and Control of Roller Grasper V3 for In-Hand Manipulation" https://publications.idiap.ch/publications/show/5850 Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models Sandipana Dowerah, Atharva Kulkarni, Ajinkya Kulkarni, Hoan My Tran, Joonas Kalda, Artem Fedorchenko, Benoit Fauve, Damien Lolive, Tanel alumae and Mathew Magimai-Doss: "Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models" https://publications.idiap.ch/publications/show/5849 Meaningful Pose-Based Sign Language Evaluation Zifan Jiang, Colin Leong, Amit Moryossef, Oliver Cory, Maksym Ivashechkin, Neha Tarigopula, Biao Zhang, Anne Göhring, Annette Rios, Rico Sennrich and Sarah Ebling: "Meaningful Pose-Based Sign Language Evaluation" https://publications.idiap.ch/publications/show/5848 Open Challenge: Exploring People's Everyday Life Behavior with Mobile Data Andrea Bontempelli, Matteo Busso, Lakmal Buddika Meegahapola, Amalia de Götzen, Fausto Giunchiglia and Daniel Gatica-Perez: "Open Challenge: Exploring People's Everyday Life Behavior with Mobile Data" https://publications.idiap.ch/publications/show/5847 The Internet of Us Loizos Michael, Ivano Bison, Matteo Busso, Luca Cernuzzi, Amalia de Götzen, Shyam Diwakar, Kobi Gal, Amarsanaa Ganbold, George Gaskell, Daniel Gatica-Perez, Jessica Heesen, Daniele Miorandi, Salvador Ruiz-Correa, Laura Schelenz, Avi Segal, Carles Sierra, Hao Xu and Fausto Giunchiglia: "The Internet of Us" https://publications.idiap.ch/publications/show/5846 Building A Civic Tool for Community-Police Engagement to Adapt Neighborhood Policing Ravinithesh Annapureddy, Staņislavs Šeiko, Natalie Higham-James, William Droz, Alessandro Fornaroli, Sarah Vollmer, Britta Elena Hecking and Daniel Gatica-Perez: "Building A Civic Tool for Community-Police Engagement to Adapt Neighborhood Policing" https://publications.idiap.ch/publications/show/5845 Lightweight Cross-Spectral Face Recognition via Contrastive Alignment and Distillation Anjith George and Sébastien Marcel: "Lightweight Cross-Spectral Face Recognition via Contrastive Alignment and Distillation" https://publications.idiap.ch/publications/show/5844 Syllable-Level Features for Speech Pathology Detection: A Case Study of Parkinson’s Disease Sevada Hovsepyan and Mathew Magimai-Doss: "Syllable-Level Features for Speech Pathology Detection: A Case Study of Parkinson’s Disease" https://publications.idiap.ch/publications/show/5843 A Scalable, Automatic, and Evolutionary Algorithm for Calibrating Urban Building Energy Models Matteo Piro, Jérôme Kämpf, Ilaria Ballarini and Vincenzo Corrado: "A Scalable, Automatic, and Evolutionary Algorithm for Calibrating Urban Building Energy Models" https://publications.idiap.ch/publications/show/5841 INFLUENCE OF CLEAN SPEECH CHARACTERISTICS ON SPEECH ENHANCEMENT PERFORMANCE Mingchi Hou and Ina Kodrasi: "INFLUENCE OF CLEAN SPEECH CHARACTERISTICS ON SPEECH ENHANCEMENT PERFORMANCE" https://publications.idiap.ch/publications/show/5840 GENERALIZABILITY OF PREDICTIVE AND GENERATIVE SPEECH ENHANCEMENT MODELS TO PATHOLOGICAL SPEAKERS Mingchi Hou, Ante Jukic and Ina Kodrasi: "GENERALIZABILITY OF PREDICTIVE AND GENERATIVE SPEECH ENHANCEMENT MODELS TO PATHOLOGICAL SPEAKERS" https://publications.idiap.ch/publications/show/5839 Migrant Voices, Local News: Insights on Bridging Community Needs with Media Content David Alonso del Barrio, Paula Dolores Rescala, Victor Bros and Daniel Gatica-Perez: "Migrant Voices, Local News: Insights on Bridging Community Needs with Media Content" https://publications.idiap.ch/publications/show/5838 Improving Generalization of Pretrained Language Models Rabeeh Karimi Mahabadi: "Improving Generalization of Pretrained Language Models" https://publications.idiap.ch/publications/show/5836 Fast and Future: Towards Efficient Forecasting in Video Semantic Segmentation Evann Courdier: "Fast and Future: Towards Efficient Forecasting in Video Semantic Segmentation" https://publications.idiap.ch/publications/show/5835 CONTEXTUALISATION OF AUTOMATIC SPEECH RECOGNITION AND RELATED APPLICATIONS Thorbecke Iuliia: "CONTEXTUALISATION OF AUTOMATIC SPEECH RECOGNITION AND RELATED APPLICATIONS" https://publications.idiap.ch/publications/show/5834 Triangulating Temporal Dynamics in Multilingual Swiss Online News Victor Bros, Evan Dufraisse, Adrian Popescu and Daniel Gatica-Perez: "Triangulating Temporal Dynamics in Multilingual Swiss Online News" https://publications.idiap.ch/publications/show/5833 Politics of Questions in News: A Mixed-Methods Study of Interrogative Stances as Markers of Voice and Power Victor Bros, Matilde Barbini, Patrick Gerard and Daniel Gatica-Perez: "Politics of Questions in News: A Mixed-Methods Study of Interrogative Stances as Markers of Voice and Power" https://publications.idiap.ch/publications/show/5832 Graph neural network-based surrogate modeling for fast and scalable simulations of meshed district heating networks Roberto Boghetti, Jean-Marc Odobez and Jérôme Kämpf: "Graph neural network-based surrogate modeling for fast and scalable simulations of meshed district heating networks" https://publications.idiap.ch/publications/show/5831 DDialogue: A Collaborative Framework for Cross-Sectoral Dialogue through Data Alessandro Fornaroli, Ravinithesh Annapureddy and Daniel Gatica-Perez: "DDialogue: A Collaborative Framework for Cross-Sectoral Dialogue through Data" https://publications.idiap.ch/publications/show/5828 Effects of cool coatings on urban microclimate and outdoor thermal Comfort: A CFD–CitySim pro coupled simulation study Da-Som Mun, Jérôme Kämpf and Jae-Jin Kim: "Effects of cool coatings on urban microclimate and outdoor thermal Comfort: A CFD–CitySim pro coupled simulation study" https://publications.idiap.ch/publications/show/5827 Advancing Neural Representations for Paralinguistic Analysis: From Speech Emotion to Parkinson’s Disease Assessment Tilak Purohit: "Advancing Neural Representations for Paralinguistic Analysis: From Speech Emotion to Parkinson’s Disease Assessment" https://publications.idiap.ch/publications/show/5826 The EMN Country Factsheets Structured Dataset David Alonso del Barrio and Daniel Gatica-Perez: "The EMN Country Factsheets Structured Dataset" https://publications.idiap.ch/publications/show/5825 Minimal neuron ablation triggers catastrophic collapse in the language core of Large Vision-Language Models Cen Lu, Yung-Chen Tang and Andrea Cavallaro: "Minimal neuron ablation triggers catastrophic collapse in the language core of Large Vision-Language Models" https://publications.idiap.ch/publications/show/5824 CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning Yung-Chen Tang, Pin-Yu Chen and Andrea Cavallaro: "CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning" https://publications.idiap.ch/publications/show/5823 Rethinking the Role of Collaborative Robots in Rehabilitation Vivek Gupte, Shalutha Rajapakshe and Emmanuel Senft: "Rethinking the Role of Collaborative Robots in Rehabilitation" https://publications.idiap.ch/publications/show/5822 The impact of abstract and object tags on image privacy classification Darya Baranouskaya and Andrea Cavallaro: "The impact of abstract and object tags on image privacy classification" https://publications.idiap.ch/publications/show/5821 Which private attributes do VLMs agree on and predict well? Olena Hrynenko, Darya Baranouskaya, Alina Elena Baia and Andrea Cavallaro: "Which private attributes do VLMs agree on and predict well?" https://publications.idiap.ch/publications/show/5820 A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems Gökhan Özbulak, Oscar Jimenez-del-Toro, Maíra Fatoretto, Lilian Berton and André Anjos: "A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems" https://publications.idiap.ch/publications/show/5819 Geometry-aware Policy Imitation Yiming Li, Nael Darwiche, Amirreza Razmjoo, sichao Liu, Yilun Du, Auke Ijspeert and Sylvain Calinon: "Geometry-aware Policy Imitation" https://publications.idiap.ch/publications/show/5818 A Riemannian Take on Distance Fields and Geodesic Flows in Robotics Yiming Li, Jiacheng Qiu and Sylvain Calinon: "A Riemannian Take on Distance Fields and Geodesic Flows in Robotics" https://publications.idiap.ch/publications/show/5817 Benchmarking Multimodal Large Language Models for Face Recognition Hatef Otroshi Shahreza and Sébastien Marcel: "Benchmarking Multimodal Large Language Models for Face Recognition" https://publications.idiap.ch/publications/show/5816 Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection Sergio Burdisso, Esaú Villatoro-Tello, Shashi Kumar, Srikanth Madikeri, Andrés Carofilis, Pradeep Rangappa, Manjunath K E, Kadri Hacioğlu, Petr Motlicek and Andreas Stolcke: "Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection" https://publications.idiap.ch/publications/show/5815 Text-only adaptation in LLM-based ASR through text denoising Sergio Burdisso, Esaú Villatoro-Tello, Andrés Carofilis, Shashi Kumar, Kadri Hacioğlu, Srikanth Madikeri, Pradeep Rangappa, Manjunath K E, Petr Motlicek, Shankar Venkatesan and Andreas Stolcke: "Text-only adaptation in LLM-based ASR through text denoising" https://publications.idiap.ch/publications/show/5814 PrivLEX: Detecting legal concepts in images through Vision-Language Models Darya Baranouskaya and Andrea Cavallaro: "PrivLEX: Detecting legal concepts in images through Vision-Language Models" https://publications.idiap.ch/publications/show/5813 Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives Dilermando Queiroz Neto, Anderson Carlos, André Anjos and Lilian Berton: "Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives" https://publications.idiap.ch/publications/show/5812 On the Generation of Face Morphs by Inversion of Optimal Morph Embeddings Hatef Otroshi Shahreza, Laurent Colbois and Sébastien Marcel: "On the Generation of Face Morphs by Inversion of Optimal Morph Embeddings" https://publications.idiap.ch/publications/show/5810 Grey-Box RC Building Models for Intelligent Management of Large-Scale Energy Flexibility: From Mass Modeling to Decentralized Digital Twins Leonardo A. Bisogno Bernardini, Jérôme Kämpf, Umberto Desideri, Francesco Leccese and Giacomo Salvadori: "Grey-Box RC Building Models for Intelligent Management of Large-Scale Energy Flexibility: From Mass Modeling to Decentralized Digital Twins" https://publications.idiap.ch/publications/show/5809 Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction Chunyang Jiang and Paola Merlo: "Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction" https://publications.idiap.ch/publications/show/5806 Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof: "Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication" https://publications.idiap.ch/publications/show/5804 Text-Graph Encoders and Retrieval-Augmented Generation Andrei Catalin Coman: "Text-Graph Encoders and Retrieval-Augmented Generation" https://publications.idiap.ch/publications/show/5803 Towards Integrated Processing of Physiological Signals and Speech Zohreh Mostaani: "Towards Integrated Processing of Physiological Signals and Speech" https://publications.idiap.ch/publications/show/5802