Publications of research program Human-AI Teaming
2025
| Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering, , , , , , , , , , , , and , in: Interspeech 2025, Rotterdam, The Netherlands, pages 3618--3622, 2025 |
[DOI] [URL] |
| Differentiable rasterization of minimum-time sigma-lognormal trajectories, , , and , in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025 |
|
| Distilling Contact Planning for Fast Trajectory Optimization in Robot Air Hockey, , , , , and , in: Proceedings of Robotics: Science and Systems, 2025 |
[DOI] [URL] |
| Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild, , , and , in: The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025 |
| Efficient Adaptation for Speech Technology, , EPFL, 2025 |
|
| Efficient and Real-Time Motion Planning for Robotics Using Projection-Based Optimization, , , , , , , and , in: Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025 |
| Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering, , , , , , , , , , , , and , in: Proc. Interspeech, 2025 |
|
| Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 |
|
| Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, , , and , Idiap-RR-09-2025 |
|
| Ergodic exploration of dynamic distribution, , , and , in: IEEE Robotics and Automation Letters (RA-L), 2025 |
|
| From forest to zoo: great ape behavior recognition with ChimpBehave, , , , , and , in: International Journal of Computer Vision, 133:6668–6688, 2025 |
[DOI] |
| GAFRO: Geometric Algebra for Robotics [Tutorial], , and , in: IEEE Robotics and Automation Magazine, 32(3):184-194, 2025 |
[DOI] |
| Geometric Structures for Learning and Optimization in Robotics, , in: Annual Review of Control, Robotics, and Autonomous Systems., 2025 |
| Giving Sense to Inputs: Toward an Accessible Control Framework for Shared Autonomy, , and , in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Melbourne, Australia, ACM, 2025 |
[URL] |
| Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration, , , , , , and , in: IEEE Robotics and Automation Magazine (RAM), 32(1):68-78, 2025 |
|
| Identifying storytelling in job interviews using deep learning, , , , , , , and , in: Computers in Human Behavior Reports, 19(100688), 2025 |
[DOI] |
| Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space, , and , in: 2025 IEEE International Joint Conference on Biometrics (IJCB), IEEE, 2025 |
|
| Image-driven robot drawing with rapid lognormal movements, , , , , , and , in: In Proc. IEEE Intl Symp. on Robot and Human Interactive Communication (Ro-Man), 2025 |
|
| Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, , and , Idiap-RR-04-2025 |
|
| Intuitive Robot Programming, , , , and , in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025 |
| Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, and , in: 13th International Conference on Learning Representations (ICLR), 2025 |
[URL] |
| Latent Space Factorization in LoRA, , , , and , in: 39th Conference on Neural Information Processing Systems, 2025 |
[URL] |
| Learning problem decomposition for efficient sequential multi-object manipulation planning, , , and , in: IEEE Robotics and Automation Letters, 2025 |
| ManiDP: Manipulability-Aware Diffusion Policy for Posture-Dependent Bimanual Manipulation, , , , , , , and , in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025 |
| Measuring negative emotions and stress through acoustic correlates in speech: A systematic review, , , and , in: PLoS One, 20(7), 2025 |
[DOI] |
| Movement Generation and Drawing in Robotics, , in: In Proc. 22nd Conference of the International Graphonomics Society (IGS), 2025 |
|
| Neural Image Abstraction Using Long Smoothing B-splines, , , , , and , in: ACM Transactions on Graphics (ToG), 2025 |
|
| Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, , , , , , , , , and , in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
[URL] |
| Responses to Past-Behavior Questions in Face-To-Face and Asynchronous Video Interviews: Storytelling, Interview Performance and Criterion Validity, , , , , , and , in: Human Performance, 38(5):284-298, 2025 |
[DOI] |
| Robot Manipulation with Geometric Algebra: A Unified Geometric Framework for Control and Optimization, , EDEE, 2025 |
|
| Robust Pushing: Exploiting Quasi-static Belief Dynamics and Contact-informed Optimization, , , and , in: International Journal of Robotics Research (IJRR), 2025 |
|
| Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets, , , , and , in: Proceedings of the 2nd International Workshop on Diffusion of Harmful Content on Online Web (DHOW '25), October 27--28, 2025, Dublin, Ireland, 2025 |
|
| Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, , , , , , , , , , , and , in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025 |
[DOI] [URL] |
| TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation, , , , , , , , , , and , in: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE, 2025 |
|
| Towards Accessible and Intuitive Shared Autonomy, , in: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025 |
[URL] |
| Towards interpretable emotion recognition: Identifying key features with machine learning, and , in: Forum Acusticum/EuroNoise, Malaga, Spain, 2025 |
|
| Unifying Global and Near-Context Biasing in a Single Trie Pass., , , , , , , , , , , and , in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025 |
[DOI] [URL] |
| Whole-Body Impedance Control of a Humanoid Robot Based on Human-Human Demonstration for Human-Robot Collaboration, , , , , and , in: In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), 2025 |
| XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, , , , , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
[DOI] [URL] |
2024
| ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications, , , , , , , , , , , and , in: Journal of Data-centric Machine Learning Research, 2024 |
[URL] |
| CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
| Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA, pages 5421–5440, Association for Computational Linguistics, 2024 |
[URL] |
| Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
| Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 16747–16762, Association for Computational Linguistics (ACL), 2024 |
[DOI] [URL] |
| TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024 |
[DOI] [URL] |
| TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , Idiap-RR-07-2024 |
[URL] |
| XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, , , , , , , and , Idiap-RR-08-2024 |
[URL] |