All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
O
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , Idiap-RR-10-2012 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , in: 20th European Signal Processing Conference, 2012 |
|
A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, , , , , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, , , and , in: Statistical and Perceptual Audition Workshop, 2012 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , Idiap-RR-37-2012 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , Idiap-RR-26-2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , in: Proceedings of the 22nd British Machine Vision Conference, 2011 |
|
P
Probabilistic models for music, , Ecole Polytechnique Fédérale de Lausanne, 2008 |
[URL] |
Probabilistic Models for Melodic Prediction, , and , Idiap-RR-50-2008 |
|
A Probabilistic Model for Chord Progressions, , and , in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005 |
|
Chord Representations for Probabilistic Models, , and , Idiap-RR-58-2005 |
|
A Probabilistic Model for Chord Progressions, , and , Idiap-RR-57-2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , in: Proceedings of the 22nd International Conference on Machine Learning, 2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , Idiap-RR-33-2005 |
|
Predictive Models for Music, , and , Idiap-RR-51-2008 |
|
A Distance Model for Rhythms, , , and , in: 25th International Conference on Machine Learning (ICML), 2008 |
|
A Distance Model for Rhythms, , , and , Idiap-RR-33-2008 |
|
A Generative Model for Rhythms, , , and , in: NIPS Workshop on Brain, Music and Cognition, 2007 |
|
A Generative Model for Rhythms, , , and , Idiap-RR-70-2007 |
|
Evaluation of 1-Year in-Home Monitoring Technology by Home-Dwelling Older Adults, Family Caregivers, and Nurses, , , , , , , and , in: Frontiers in Public Health, 8:9, 2020 |
[DOI] [URL] |
Towards End-to-End Speech Recognition, , Ecole polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , Idiap-RR-13-2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , in: Proceedings of Interspeech, 2013 |
|
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, , and , Idiap-RR-40-2013 |
|
End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition, , and , in: Speech Communication, 108:15--32, 2019 |
[DOI] |
End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, , and , Idiap-RR-18-2016 |
|
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015 |
|
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , Idiap-RR-23-2015 |
|
Learning linearly separable features for speech recognition using convolutional neural networks, , and , Idiap-RR-24-2015 |
[URL] |
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015 |
|
Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, , and , Idiap-RR-15-2014 |
|
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , Idiap-RR-18-2014 |
|
Joint Phoneme Segmentation Inference and Classification using CRFs, , and , in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014 |
[DOI] |
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, , , , and , in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021 |
[DOI] [URL] |
Predicting the Causal Effect Relationship Between COPD and Cardio Vascular Diseases, , , and , in: Informatica, 44(4), 2020 |
[DOI] [URL] |
Extreme Learning Machines with feature selection using GA for effective prediction of fetal heart disease: A Novel Approach, , , and , in: Informatica, 45(3), 2021 |
[DOI] [URL] |
Sparse multi-view hand-object reconstruction for unseen environments, , and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024 |
[URL] |
Extending Capabilities of Attention-based Models, , EDIC - EPFL, 2024 |
|
σ-GPTs: A New Approach to Autoregressive Models., , and , in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024 |
|
Inference from Real-World Sparse Measurements, , and , in: TMLR, 2024 |
|
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , Idiap-RR-05-2022 |
|
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , in: Proceedings of SIAM Data Mining, Virginia US and Virtual, 2022 |
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, , and , in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020 |
[DOI] [URL] |
Social Signal Processing: The Research Agenda, , , , , , , , and , in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011 |
Implicit Human Centered Tagging, and , in: IEEE Signal Processing Magazine, 26, 2009 |
|
A memory of motion for visual predictive control tasks, , and , in: International Conference on Robotics and Automation, 2020 |
|
Enhancing user acceptance in automated systems with human-centric lighting: the role of visual comfort, personality, and preference, , , , , , , and , in: Journal of Physics: Conference Series, IOP Publishing Ltd, 2023 |
[DOI] [URL] |
Learning Explainable User Sentiment and Preferences for Information Filtering, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
GILE: A Generalized Input-Label Embedding for Text Classification, and , in: Transactions of the Association for Computational Linguistics (TACL), 2019 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |