All publications sorted by title
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
I
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , Idiap-RR-19-2011 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011 |
|
Improving Object Classification using Pose Information, , , and , Idiap-RR-30-2012 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , in: Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia, 1998 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , Idiap-RR-11-1998 |
|
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders, , , , and , in: Findings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024 |
Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, and , Idiap-RR-52-2004 |
|
Improving Speaker Diarization using social role information, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014 |
|
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
Improving speech embedding using crossmodal transfer learning with audio-visual data, and , in: Multimedia Tools and Applications, 78(11):15681-15704, 2019 |
[DOI] |
Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns., and , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , in: Proceedings of Interspeech, 2005, 2005 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , Idiap-RR-66-2005 |
|
Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters, and , in: EURASIP Journal on Advances in Signal Processing(11), 2018 |
|
Improving the control of prosthetic hands with tactile sensing, and , in: Micro & Nano Magazine, Micronarc:42-43, 2018 |
|
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, and , in: Proceedings of Interspeech, 2016 |
|
In Search of a Good BET, and , Idiap-Com-11-2003 |
|
In the Mood for Vlog: Multimodal Inference in Conversational Social Video, , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2), 2015 |
[DOI] |
In-Context Phone Posteriors as Complementary Features for Tandem ASR, and , in: ICSLP'08, 2008 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , in: Proceedings of the Sixth ACM International Conference on Knowledge Discovery and Data Mining, ACM, Boston, MA, USA, 2000 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , Idiap-RR-14-2000 |
|
Incorporation of Liquid-Crystal Light Valve Non-Linearities in Optical Multilayer Neural Networks, , and , in: Applied Optics, 35(26), 1996 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 02), 2002 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , Idiap-RR-36-2001 |
|
Incremental Enrollment of Speech Recognizers, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999 |
Incremental Learning for Place Recognition in Dynamic Environments, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), 2007 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , Idiap-RR-52-2006 |
|
INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION, , , , , and , in: Proceedings of ICASSP 2020, 2020 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015 |
[URL] |
INCREMENTAL TRANSFER LEARNING IN TWO-PASS INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, , , and , in: Proceedings of ICASSP 2019, pages 6291-6295, 2019 |
Indexation de Documents Manuscrits, , in: Proceedings du Colloque International Francophone sur l'Ecrit et le Document (CIFED06), 2006 |
|
Indexation de Documents Manuscrits, , Idiap-RR-31-2006 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , in: Kohonen Maps, Elsevier, 1999 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , Idiap-RR-13-1999 |
|
Indexing Protected Deep Face Templates by Frequent Binary Patterns, , , , and , in: Proceedings of the 2022 International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates (UAE), IEEE, 2022 |
[DOI] [URL] |
Indexing spoken audio by LSA and SOMs, , in: Proceedings of the European Signal Processing Conference EUSIPCO'2000, 2000 |
Indexing spoken audio by LSA and SOMs, , Idiap-RR-06-2000 |
|
Indoor Place Recognition using Online Independent Support Vector Machines, , , , and , in: 18th British Machine Vision Conference (BMVC07), 2007 |
|
Indoor Radon Risk Assessment with Geostatistics and Artificial Neural Networks, , , , , , and , in: Geostatistical congress 2000, 2000 |
Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, and , in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012 |
|
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention, and , in: Transactions on Machine Learning Research, 2023 |
[URL] |
Inference from Real-World Sparse Measurements, , and , in: TMLR, 2024 |
|
Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, , Idiap-RR-35-2008 |
|
Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
|
Inferring competitive role patterns in reality TV show through nonverbal analysis, and , in: Multimedia Tools and Applications, Special issue on Social Media, 2010 |
|
Inferring Document Similarity from Hyper-links, and , Idiap-RR-21-2005 |
|
Inferring Document Similarity from Hyperlinks, and , in: ACM Conference on Information and Knowledge Management, 2005 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |