All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition, , and , in: European Conference on Computer Vision Workshops, 2024 |
|
On the Information in Deep Biometric Templates: from Vulnerability of Unprotected Templates to Leakage in Protected Templates, , EPFL, 2024 |
[DOI] [URL] |
A Human Perspective to AI-based Candidate Screening, , , , , , and , in: Proceedings of the 58th Hawaii International Conference on System Sciences (HICSS), 2024 |
Hardware-effective Approaches for Skill Extraction in Job Offers and Resumes, , , , , , and , in: The 4th Workshop on Recommender Systems for Human Resources, in conjunction with the 18th ACM Conference on Recommender Systems, 2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
[URL] |
Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction, , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
DiffuCOMET: Contextual Commonsense Knowledge Diffusion, , , , , and , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Bangkok, Thailand, pages 4809–4831, Association for Computational Linguistics, 2024 |
[DOI] [URL] |
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, and , Idiap-RR-09-2024 |
|
Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project, , , , , , , , , and , in: 15th EAI International Conference on Digital Forensics & Cyber Crime, 2024 |
|
Nonparametric Variational Regularisation of Pretrained Transformers, and , in: First conference on Language Modelling, 2024 |
[URL] |
Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 3173-3183, 2024 |
[URL] |
FRCSyn Challenge at WACV 2024: Face Recognition Challenge in the Era of Synthetic Data, , , , , and , in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pages 892-901, 2024 |
[URL] |
Test-time adaptation for automatic pathological speech detection in noisy environments, and , in: EUSIPCO, 2024 |
|
ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild, , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting, and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 |
[DOI] |
Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection, and , in: IWAENC, 2024 |
Recursive Forward Dynamics for Serial Kinematic Chains using Conformal Geometric Algebra, and , in: In Proc. Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2024 |
Adversarial Robustness Analysis in Automatic Pathological Speech Detection Approaches, and , in: Interspeech, 2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
Synergizing Natural Language Towards Enhanced Shared Autonomy, , and , in: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024 |
[URL] |
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks, and , in: Frontiers in Neuroscience, 2024 |
Missed Opportunities in Building Energy Performance Assessment, , , and , in: Journal of Sustainable Real Estate, 16(1), 2024 |
[DOI] |
CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition, , , and , in: 18th IEEE Int. Conference on Automatic Face and Gesture Recognition (FG), Istanbul,, 2024 |
|
Sharingan: A Transformer Architecture for Multi-Person Gaze Following, , and , in: Int. Conference Computer Vision and Pattern Recognition (CVPR), Seatle, 2024 |
|
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following, , , and , in: Int. Conf. Computer Vision and Pattern Recognition (CVPR), Workshop on Gaze Estimation and Prediction in the Wild, 2024 |
|
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale, , , and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Image-guided topic modeling for interpretable privacy classification, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data, , and , in: Proceedings of the IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024 |
|
GLoFool: global enhancements and local perturbations to craft adversarial images, and , in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2024 |
|
Explaining models relating objects and privacy, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 |
[URL] |
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Open-Vocabulary Object 6D Pose Estimation, , , , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 |
[URL] |
Test-time adaptation for 6D pose tracking, , and , in: Pattern Recognition, 152, 2024 |
[DOI] [URL] |
Training a Filter-Based Model of the Cochlea in the Context of Pre-Trained Acoustic Models, and , in: Acoustics, 6:470 - 488, 2024 |
[DOI] |
Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?, , , , , and , in: Proceedings of the 18th European Conference on Computer Vision, 2024 |
|
Refining Tuberculosis Detection in CXR Imaging: Addressing Bias in Deep Neural Networks via Interpretability, , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
Extending Capabilities of Attention-based Models, , EDIC - EPFL, 2024 |
|
Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson’s disease speech detection, and , in: Proceedings of Interspeech, Kos Island, Greece, pages 3590-3594, 2024 |
[DOI] [URL] |
Face Liveness Detection Competition (LivDet-Face) - 2024, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE International Joint Conference on Biometrics, 2024 |
|
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, , , , , , , and , Idiap-RR-08-2024 |
[URL] |
TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , Idiap-RR-07-2024 |
[URL] |
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection, and , in: International Joint Conference on Biometrics, 2024 |
|
On Measuring Linkability of Multiple Protected Biometric Templates using Maximal Leakage, , and , in: IEEE Access, 2024 |
[DOI] [URL] |
Towards Wine Tasting Activity Recognition for a Digital Sommelier, , , and , in: Proc. ACM Workshop on Exploring Innovative Technology for Commensality and Human-Food Interaction, 2024 |
Integrating large language models and ASR systems using confidence measures and prompting, , Idiap-Com-02-2024 |
|
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |