All publications sorted by title
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
M
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , Idiap-RR-13-2021 |
|
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , in: Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021), Online (Virtual Mode), pages 31--39, INCOMA Ltd., 2021 |
[URL] |
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
[DOI] |
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Multimodal Signal Processing for Meetings: an Introduction, and , in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012 |
|
Multimodal Signal Processing: Human Interactions in Meetings, , , and , Cambridge University Press, 2012 |
[URL] |
Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, , and , Academic Press, 2009 |
Multimodal Speech Processing Using Asynchronous Hidden Markov Models, , in: Information Fusion, 5(2), 2004 |
|
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers, , , and , in: International Conference on Language Resources and Evaluation (LREC 2022), 2022 |
|
Multiple Hypotheses Video OCR, and , in: Proceedings of the 4th International Workshop on Document Analysis System, 2000 |
|
Multiple Hypotheses Video OCR, and , Idiap-RR-28-2000 |
|
Multiple Object Tracking using Flow Linear Programming, , and , Idiap-RR-10-2009 |
|
Multiple Object Tracking using K-Shortest Paths Optimization, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011 |
Multiple Timescale Feature Combination towards Robust Speech Recognition, , in: KONVENS 2000 / Sprachkommunikation, 2000 |
|
Multiple Timescale Feature Combination towards Robust Speech Recognition, , Idiap-RR-29-2000 |
|
Multispectral Deep Embeddings As a Countermeasure To Custom Silicone Mask Presentation Attacks, , and , in: IEEE Transactions on Biometrics, Behavior, and Identity Science, 2019 |
|
Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, , and , in: Speech Communication, 54(1), 2012 |
[DOI] |
MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011 |
|
Multitask adaptation with Lattice-Free MMI for multi-genre speech recognition of low resource languages, , and , in: Proceedings of Interspeech 2021, 2021 |
|
Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, and , Idiap-RR-21-2011 |
|
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024 |
[DOI] [URL] |
Multiview Face Detection, , and , Idiap-RR-49-2005 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , in: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02), 2002 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-52-2002 |
|
MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009 |
|
Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, , and , in: Proceedings of International conference on acoustics speech and signal processing, 2009 |
My Own Private Nightlife: Understanding Youth Personal Spaces from Crowdsourced Video, , and , in: Proc. ACM Hum.-Comput. Interact, 3(189), 2019 |
|
N
N-gram-Based Low-Dimensional Representation for Document Classification, and , in: International Conference on Learning Representations, 2015 |
[URL] |
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL, and , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018 |
|
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, and , Idiap-RR-28-2017 |
|
Natural Language Inference over Tables: Enabling Explainable Data Exploration on Data Lakes, , , and , in: 18th Extended Semantic Web Conference (ESWC), 2021 |
[URL] |
Natural Language Processing (Almost) from Scratch, , , , , and , in: Journal of Machine Learning Research, 12:2493-2537, 2011 |
|
Natural Language Processing in Healthcare, , , , and , Taylor & Francis Groups, 2022 |
[DOI] [URL] |
Natural Language Understanding for Navigation of Service Robots in Low-Resource Domains and Languages: Scenarios in Spanish and Nahuatl, , , , and , in: Mathematics, 12(8), 2024 |
[DOI] [URL] |
Natural Scene Image Modeling using Color and Texture Visterms., and , in: Conference on Image and Video Retrieval CIVR, 2006 |
|
Natural Scene Image Modeling using Color and Texture Visterms., and , Idiap-RR-17-2006 |
|
Nearly optimal exploration-exploitation decision thresholds, , in: Int. Conf. on Artificial Neural Networks (ICANN), 2006 |
|
Nearly optimal exploration-exploitation decision thresholds, , Idiap-RR-12-2006 |
|
Nested Mini-Batch K-Means, and , in: Proceedings of NIPS, 2016 |
Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
Neural nets approaches to Speaker Verification: comparison with Second Order Statistical Measure, and , in: ICASSP, 1995 |
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021 |
[DOI] [URL] |
Neural Network Adaptations to Hardware Implementations, and , in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997 |
|
Neural Network Adaptations to Hardware Implementations, and , Idiap-RR-17-1997 |
|
Neural Network based End-to-End Query by Example Spoken Term Detection, , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020 |
|
Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, , , and , Idiap-RR-09-2008 |
|
Neural Network Classification and Formalization, , in: Computer Standards & Interfaces, 16(03), 1994 |
Neural Network Formalization, , Idiap-RR-01-1992 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |