logo Idiap Research Institute        
All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |


O

A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, Youssef Oualil, Dietrich Klakow, Gyorgy Szaszak, Ajay Srinivasamurthy, Hartmut Helmke and Petr Motlicek, in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017
attachment
A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013
attachment
A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, in: Proceedings of the 22nd British Machine Vision Conference, 2011
attachment

P

Probabilistic models for music, Jean-François Paiement, Ecole Polytechnique Fédérale de Lausanne, 2008
attachment
[URL]
A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, in: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR), 2005
attachment
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, in: Proceedings of the 22nd International Conference on Machine Learning, 2005
attachment
A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: 25th International Conference on Machine Learning (ICML), 2008
attachment
A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, in: NIPS Workshop on Brain, Music and Cognition, 2007
attachment
Towards End-to-End Speech Recognition, Dimitri Palaz, Ecole polytechnique Fédérale de Lausanne, 2016
attachment
[DOI]
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert, in: International Conference on Acoustics, Speech and Signal Procecssing, IEEE, South Brisbane, QLD, pages 4295 - 4299, IEEE, 2015
attachment
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert, in: Proceedings of Interspeech, ISCA, Dresden, pages 11-15, ISCA, 2015
attachment
Joint Phoneme Segmentation Inference and Classification using CRFs, Dimitri Palaz, Mathew Magimai.-Doss and Ronan Collobert, in: Global Conference on Signal and Information Processing, Atlanta, GA, pages 587 - 591, IEEE, 2014
attachment
[DOI]
Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks, Danick Panchard, François Marelli, Edouard De Moura Presa, Peter Wellig and Michael Liebling, in: Security + Defence, Target and Background Signatures VII, Proc. of SPIE, online only, pages 1186509-1--1186509-8, SPIE, 2021
attachment
[DOI]
[URL]
Sparse multi-view hand-object reconstruction for unseen environments, Yik Lung Pang, Changjae Oh and Andrea Cavallaro, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024
[URL]
σ-GPTs: A New Approach to Autoregressive Models., Arnaud Pannatier, Evann Courdier and Francois Fleuret, in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024
attachment
Learning, Generating and Adapting Wave Gestures for Expressive Human-Robot Interaction, M. Panteris, S. Manschitz and Sylvain Calinon, in: Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386-388, 2020
attachment
[DOI]
[URL]
Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011
Implicit Human Centered Tagging, Maja Pantic and Alessandro Vinciarelli, in: IEEE Signal Processing Magazine, 26, 2009
attachment
A memory of motion for visual predictive control tasks, Antonio Paolillo, Teguh Santoso Lembono and Sylvain Calinon, in: International Conference on Robotics and Automation, 2020
attachment
GILE: A Generalized Input-Label Embedding for Text Classification, Nikolaos Pappas and James Henderson, in: Transactions of the Association for Computational Linguistics (TACL), 2019
attachment
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |