Human-AI Teaming - Idiap Publications

Update cookies preferences

Name:

Human-AI Teaming

| 1 | 2 |

A Riemannian Take on Distance Fields and Geodesic Flows in Robotics, Yiming Li, Jiacheng Qiu and Sylvain Calinon, in: International Journal of Robotics Research, 2026

attachment

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction, Mutian He and Philip N. Garner, Idiap-RR-01-2026

attachment

[URL]

CONTEXTUALISATION OF AUTOMATIC SPEECH RECOGNITION AND RELATED APPLICATIONS, Thorbecke Iuliia, University of Zurich, Faculty of Arts, 2026

attachment

[DOI]
[URL]

Exploratory analysis of yellow mongoose vocalization: detection from in-the-wild recordings and call classification, Sevada Hovsepyan, Imen Ben Mahmoud, Vanessa Rüegg, Marta Manser and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2026

attachment

Geometry-aware Policy Imitation, Yiming Li, Nael Darwiche, Amirreza Razmjoo, sichao Liu, Yilun Du, Auke Ijspeert and Sylvain Calinon, in: International Conference on Learning Representations, 2026

attachment

Rethinking the Role of Collaborative Robots in Rehabilitation, Vivek Gupte, Shalutha Rajapakshe and Emmanuel Senft, in: Companion Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction (HRI Companion '26), March 16--19, 2026, Edinburgh, Scotland Uk, 2026

attachment

Advancing Phonology-Based Sign Language Assessment: From Learner to Machine-Generated Videos, Neha Tarigopula, Ecole polytechnique fédérale de Lausanne (EPFL), 2025

attachment

[DOI]
[URL]

Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction, Chunyang Jiang and Paola Merlo, in: arXiv cs.CL.2511.10441, 2025

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering, Andrés Carofilis, Pradeep Rangappa, Srikanth Madikeri, Shashi Kumar, Sergio Burdisso, Jeena Prakash, Esaú Villatoro-Tello, Petr Motlicek, Bidisha Sharma, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Interspeech 2025, Rotterdam, The Netherlands, pages 3618--3622, 2025

attachment

[DOI]
[URL]

Differentiable rasterization of minimum-time sigma-lognormal trajectories, D. Berio, Sylvain Calinon, R. Plamondon and F. F. Leymarie, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

attachment

Distilling Contact Planning for Fast Trajectory Optimization in Robot Air Hockey, Julius Jankowski, Ante Marić, Puze Liu, Davide Tateo, Jan Peters and Sylvain Calinon, in: Proceedings of Robotics: Science and Systems, 2025

attachment

[DOI]
[URL]

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild, Damien Teney, Liangze Jiang, Florin Gogianu and Ehsan Abbasnejad, in: The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Efficient Adaptation for Speech Technology, Haolin Chen, EPFL, 2025

attachment

Efficient and Real-Time Motion Planning for Robotics Using Projection-Based Optimization, Xuemin Chi, Hakan Girgin, Tobias Löw, Yangyang Xie, Teng Xue, Jihao Huang, Zhitao Liu and Sylvain Calinon, in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering, Pradeep Rangappa, Andrés Carofilis, Jeena Prakash, Shashi Kumar, Sergio Burdisso, Srikanth Madikeri, Esaú Villatoro-Tello, Bidisha Sharma, Petr Motlicek, Kadri Hacioğlu, Shankar Venkatesan, Saurabh Vyas and Andreas Stolcke, in: Proc. Interspeech, 2025

attachment

Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels, Pierre Vuillecard and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

attachment

Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, Pradeep Rangappa, Amrutha Prasad, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2025

attachment

Ergodic exploration of dynamic distribution, L. Lanča, K. Jakac, Sylvain Calinon and S. Ivić, in: IEEE Robotics and Automation Letters (RA-L), 2025

attachment

From forest to zoo: great ape behavior recognition with ChimpBehave, Michael Fuchs, Emilie Genty, Adrian Bangerter, Klaus Zuberbühler, Jean-Marc Odobez and Paul Cotofrei, in: International Journal of Computer Vision, 133:6668–6688, 2025

[DOI]

GAFRO: Geometric Algebra for Robotics [Tutorial], Tobias Löw, Philip Abbet and Sylvain Calinon, in: IEEE Robotics and Automation Magazine, 32(3):184-194, 2025

attachment

[DOI]

Geometric Structures for Learning and Optimization in Robotics, Sylvain Calinon, in: Annual Review of Control, Robotics, and Autonomous Systems., 2025

Giving Sense to Inputs: Toward an Accessible Control Framework for Shared Autonomy, Shalutha Rajapakshe, Jean-Marc Odobez and Emmanuel Senft, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Melbourne, Australia, ACM, 2025

attachment

[URL]

Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration, J. Liu, Z. Li, M. Yu, Z. Dong, Sylvain Calinon, D. G. Caldwell and F. Chen, in: IEEE Robotics and Automation Magazine (RAM), 32(1):68-78, 2025

attachment

Identifying storytelling in job interviews using deep learning, Elisabeth Germanier, Mutian He, Amina Mardiyyah Rufai, Philip N. Garner, Adrian Bangerter, Laetitia A. Renier, Marianne Schmid Mast and Koralie Orji, in: Computers in Human Behavior Reports, 19(100688), 2025

[DOI]

Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space, Luis S. Luevano, Pavel Korshunov and Sébastien Marcel, in: 2025 IEEE International Joint Conference on Biometrics (IJCB), IEEE, 2025

attachment

Image-driven robot drawing with rapid lognormal movements, D. Berio, G. Clivaz, M. Stroh, O. Deussen, R. Plamondon, Sylvain Calinon and F. F. Leymarie, in: In Proc. IEEE Intl Symp. on Robot and Human Interactive Communication (Ro-Man), 2025

attachment

Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, Jehan Joachim Daniel Piaget, Amrutha Prasad and Petr Motlicek, Idiap-RR-04-2025

attachment

Intuitive Robot Programming, C. Blanc, Julius Jankowski, A. Sonderegger, Sylvain Calinon and S. Dégallier Rochat, in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, Mutian He and Philip N. Garner, in: 13th International Conference on Learning Representations (ICLR), 2025

attachment

[URL]

Latent Space Factorization in LoRA, Shashi Kumar, Yacouba Kaloga, John Mitros, Petr Motlicek and Ina Kodrasi, in: 39th Conference on Neural Information Processing Systems, 2025

attachment

[URL]

Learning problem decomposition for efficient sequential multi-object manipulation planning, Yan Zhang, Teng Xue, Amirreza Razmjoo Fard and Sylvain Calinon, in: IEEE Robotics and Automation Letters, 2025

Leveraging Untranscribed Data for End-to-End Speech and Callsign Recognition in Air-Traffic Communication, Petr Motlicek, Shashi Kumar, Driss Khalil, Amrutha Prasad and Schüpbach Christof, in: SESAR Innovation Days 2025 (https://www.sesarju.eu/SIDS2025), Eurocontrol, Bled, Slovenia, 2025

[URL]

ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation, Z. Li, J. Liu, D. Li, T. Teng, M. Li, Sylvain Calinon, D. G. Caldwell and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

Meaningful Pose-Based Sign Language Evaluation, Zifan Jiang, Colin Leong, Amit Moryossef, Oliver Cory, Maksym Ivashechkin, Neha Tarigopula, Biao Zhang, Anne Göhring, Annette Rios, Rico Sennrich and Sarah Ebling, in: Proceedings of the Tenth Conference on Machine Translation (WMT), 2025

[DOI]
[URL]

Measuring negative emotions and stress through acoustic correlates in speech: A systematic review, Lilien Schewski, Mathew Magimai-Doss, Guido Beldi and Sandra Keller, in: PLoS One, 20(7), 2025

[DOI]

Movement Generation and Drawing in Robotics, Sylvain Calinon, in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025

attachment

Neural Image Abstraction Using Long Smoothing B-splines, D. Berio, M. Stroh, Sylvain Calinon, F. F. Leymarie, O. Deussen and A. Shamir, in: ACM Transactions on Graphics (ToG), 2025

attachment

Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K E, Kadri Hacioğlu, Pradeep Rangappa, Petr Motlicek, Aravind Ganapathiraju and Andreas Stolcke, in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

attachment

[URL]

Responses to Past-Behavior Questions in Face-To-Face and Asynchronous Video Interviews: Storytelling, Interview Performance and Criterion Validity, Elisabeth Germanier, Adrian Bangerter, Koralie Orji, Laetitia A. Renier, Marianne Schmid Mast, Mutian He and Philip N. Garner, in: Human Performance, 38(5):284-298, 2025

[DOI]

Robot Manipulation with Geometric Algebra: A Unified Geometric Framework for Control and Optimization, Tobias Löw, EDEE, 2025

attachment

Robust Pushing: Exploiting Quasi-static Belief Dynamics and Contact-informed Optimization, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, in: International Journal of Robotics Research (IJRR), 2025

attachment

Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets, Vassiliy Cheremetiev, Quang Long Ho Ngo, Chau Ying Kot, Alina Elena Baia and Andrea Cavallaro, in: Proceedings of the 2nd International Workshop on Diffusion of Harmful Content on Online Web (DHOW '25), October 27--28, 2025, Dublin, Ireland, 2025

attachment

Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, Pradeep Rangappa, Juan Zuluaga-Gomez, Srikanth Madikeri, Andrés Carofilis, Jeena Prakash, Sergio Burdisso, Shashi Kumar, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025

attachment

[DOI]
[URL]

TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation, Shashi Kumar, Srikanth Madikeri, Esaú Villatoro-Tello, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Petr Motlicek, Karthik Pandia D S, Shankar Venkatesan, Kadri Hacioğlu and Andreas Stolcke, in: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE, 2025

attachment

Towards Accessible and Intuitive Shared Autonomy, Shalutha Rajapakshe, in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025

attachment

[URL]

Towards interpretable emotion recognition: Identifying key features with machine learning, Yacouba Kaloga and Ina Kodrasi, in: Forum Acusticum/EuroNoise, Malaga, Spain, 2025

attachment

Unified and Multimodal Learning for Gaze Prediction in Naturalistic Settings, Anshul Gupta, EPFL, 2025

attachment

Unifying Global and Near-Context Biasing in a Single Trie Pass., Thorbecke Iuliia, Esaú Villatoro-Tello, Juan Zuluaga-Gomez, Shashi Kumar, Sergio Burdisso, Pradeep Rangappa, Andrés Carofilis, Srikanth Madikeri, Petr Motlicek, Karthik Pandia D S, Kadri Hacioğlu and Andreas Stolcke, in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025

attachment

[DOI]
[URL]

Whole-Body Impedance Control of a Humanoid Robot Based on Human-Human Demonstration for Human-Robot Collaboration, C. Li, J. Liu, T. Teng, S. Wang, Sylvain Calinon and F. Chen, in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025

attachment

[DOI]
[URL]

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications, Juan Zuluaga-Gomez, Karel Vesely, Igor Szoke, Blatt Alexander, Petr Motlicek, Martin Kocour, Khalid Choukri, Nigmatulina Iuliia, Claudia Cevenini, Allan Tart, Jan Cernocky and Dietrich Klakow, in: Journal of Data-centric Machine Learning Research, 2024

[URL]

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

attachment

Design and Control of Roller Grasper V3 for In-Hand Manipulation, Shenli Yuan, Lin Shao, Yunhai Feng, Jiatong Sun, Teng Xue, Connor Yako, Jeannette Bohg and J. Kenneth Salisbury, in: IEEE Transactions on Robotics, 2024

attachment

Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, Sergio Burdisso, Srikanth Madikeri and Petr Motlicek, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA, pages 5421–5440, Association for Computational Linguistics, 2024

attachment

[URL]

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Thorbecke Iuliia, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-10-2024

attachment

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Iuliia Thorbecke, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 16747–16762, Association for Computational Linguistics (ACL), 2024

attachment

[DOI]
[URL]

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024

attachment

[DOI]
[URL]

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-07-2024

attachment

[URL]

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, Idiap-RR-08-2024

attachment

[URL]

| 1 | 2 |

processing time: 0.0030 seconds.