All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
J
BLM-It - Blackbird Language Matrices for Italian: A CALAMITA Challenge, , , and , in: Proceedings of the 10th Italian Conference on Computational Linguistics, 2024 |
Can We Learn to Select the Right Algorithm for OOD Generalization?, and , in: Out Of Distribution Generalization in Computer Vision, Workshop at ECCV, 2024 |
Comparing Stability and Discriminatory Power of Hand-crafted Versus Deep Radiomics: A 3D-Printed Anthropomorphic Phantom Study, , , , , , , , , , and , in: Proceedings of the 12th European Workshop on Visual Information Processing, 2024 |
|
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
|
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), pages 17408-17419, 2023 |
[DOI] |
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation, , and , in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039-6048, 2021 |
[URL] |
GeoNeRF: Generalizing NeRF with Geometry Priors, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022 |
[URL] |
VoicePhone: An Interactive Vocal Server for Telephone Numbers, , Idiap-Com-04-1996 |
|
Learning embeddings: efficient algorithms and applications, , École Polytechnique Fédérale de Lausanne, 2018 |
[DOI] |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Scalable Metric Learning via Weighted Approximate Rank Component Analysis, and , in: ECCV 2016, 2016 |
|
Finding groups of people in Google news, and , in: ACM Int. Conf. on Human-Centered Multimedia (HCM), 2006 |
|
Finding groups of people in Google news, and , Idiap-RR-68-2005 |
|
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology, , , , , and , in: Proceedings of the ACM International Conference on Multimedia, pages 159--168, 2015 |
|
Acoustic-Labial Speaker Verification, , , and , in: Pattern Recognition Letters, 18(09), 1997 |
|
Integrating Acoustic and Labial Information for Speaker Identification and Verification, , , and , in: Proceedings of the European Conference on Speech Communication and Technology, 1997 |
Acoustic-Labial Speaker Verification, , , and , Idiap-RR-13-1997 |
|
Acoustic-Labial Speaker Verification, , , and , in: Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Springer Verlag, 1997 |
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation, , , , , , and , in: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 2023 |
[URL] |
Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding, , , , , , , , , , and , in: Aerospace, 10(10):898, 2023 |
[DOI] [URL] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , Idiap-RR-14-2021 |
[URL] |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , in: Interspeech 2021, 2021 |
[URL] |
Automatic Speech Recognition Benchmark for Air-Traffic Communications, , , , and , in: Proc. Interspeech 2020, pages 2297-2301, 2020 |
[DOI] |
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers, , , , and , in: Aerospace, 10(5), 2023 |
[DOI] [URL] |
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice, , , and , in: Proc. Interspeech 2023, 2023 |
[URL] |
BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, , , , , , and , Idiap-RR-15-2021 |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials, , and , in: In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), 2024 |
SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data, , , , , and , in: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics, 2023 |
[DOI] [URL] |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports, , , , , and , in: Proceedings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 |
Understanding the effects of language-specific class imbalance in multilingual fine-tuning, and , in: Findings of the European chapter of Association for Computational Linguistics, 2024, 2024 |
|
Two-Handed Gestures for Human-Computer Interaction, , Idiap-RR-73-2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , École Polytechnique Fédérale de Lausanne, 2006 |
|
HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-39-2004 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , in: Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-63-2003 |
|
Two-Handed Gesture Recognition, and , Idiap-RR-24-2005 |
|
Reconnaissance de gestes 3D bi-manuels, , , and , Idiap-RR-79-2003 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , in: IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR), 2006 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , Idiap-RR-02-2006 |
|
K
Sparse Autoencoders for Speech Modeling and Recognition, , École polytechnique fédérale de Lausanne, 2023 |
[DOI] |
SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, and , Idiap-RR-10-2022 |
|
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI Speech Recognition, and , in: Proceedings of Interspeech Conference, 2022 |
|
SPEECH MODELING USING SPARSE AUTOENCODERS, and , Idiap-RR-11-2022 |
|
On Learning to Identify Genders from Raw Speech Signal Using CNNs, , and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 287-291, 2018 |
[DOI] |
Modeling dominance effects on nonverbal behaviors using granger causality, , , , , and , in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |