logo Idiap Research Institute        
All publications sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |

Conversational Speech Recognition Needs Data? Experiments with Austrian German, Julian Linke, Philip N. Garner, Gernot Kubin and Barbara Schuppler, in: Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, pages 4684--4691, 2022
[URL]
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, Tilak Purohit, Imen Ben Mahmoud, Bogdan Vlasenko and Mathew Magimai.-Doss, in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022
attachment
Autoencoders Reloaded, Hervé Bourlard and Selen Hande Kabil, in: Springer Biological Cybernetics, 2022
[DOI]
[URL]
How Did Europe’s Press Cover Covid-19 Vaccination News? A Five-Country Analysis, David Alonso del Barrio and Daniel Gatica-Perez, in: MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, 2022
attachment
[DOI]
[URL]
Visually Grounded Interpretation of Noun-Noun Compounds in English, Inga Lang, Lonneke van der Plas, Malvina Nissim and Albert Gatt, in: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, Association for Computational Linguistics, 2022
A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, Anshul Gupta, Samy Tafasca and Jean-Marc Odobez, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
attachment
Residual Feature Pyramid Network for Enhancement of Vascular Patterns, Ketan Kotwal and Sébastien Marcel, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
attachment
The societal and ethical relevance of computational Creativity, Michele Loi, Eleonora Viganò and Lonneke van der Plas, in: Proceedings of the International Conference on Computational Creativity, 2020
Compositionality in English deverbal compounds:The role of the head, Gianina Iordachioaia, Lonneke van der Plas and Glorianna Jagfeld, in: The role of constituents in multiword expressions. Phraseology and Multiword Expressions, Language Science Press, Berlin, 2020
Voyager: Data Discovery for Onboarding in Data Science, Alex Bogatu, Norman Paton, Mark Douthwaite and Andre Freitas, in: 37th IEEE International Conference on Data Engineering (ICDE), 2022
Active Learning by Feature Mixing, Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Reza Haffari, Anton van den Hengel and Javen Qinfeng Shi, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
GeoNeRF: Generalizing NeRF with Geometry Priors, Mohammad Mahdi Johari, Yann Lepoittevin and Francois Fleuret, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022
attachment
[URL]
End-to-End Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, in: International Conference on Speech and Language Processing, Interspeech, ISCA, Graz, Austria, pages 2140-2144, 2019
attachment
[DOI]
Generating Exact Lattices in The WFST Framework, Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukas Burget, Arnab Ghoshal, Milos Janda, Martin Karafiat, Stefan Kombrink, Petr Motlicek, Yanmin Qian, Korbinian Riedhammer, Karel Vesely and Ngoc Thang Vu, in: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing., The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP, Kyoto, Japan, pages 4213-4216, IEEE Signal Processing Societ, 2012
attachment
[DOI]
Controllability and Interpretability in Affective Speech Synthesis, Bastian Schnell, École polytechnique fédérale de Lausanne, 2022
attachment
[DOI]
[URL]
Gradient-based Methods for Deep Model Interpretability, Suraj Srinivas, École polytechnique fédérale de Lausanne, 2021
attachment
[DOI]
Are GAN-based Morphs Threatening Face Recognition?, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, in: International Conference on Acoustics, Speech and Signal Processing, 2022
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |