Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

attachment

Progress report of a project in very low bit-rate speech coding, Milos Cernak, Philip N. Garner and Petr Motlicek, Idiap-RR-08-2012

attachment

From Nonverbal Cues to Perception: Personality and Social Attractiveness, Alessandro Vinciarelli, Hugues Salamin, Anna Polychroniou, Gelareh Mohammadi and Antonio Origlia, in: LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS, Springer, 2012

Automatic Attribution of Personality Traits Based on Prosodic Features, Gelareh Mohammadi and Alessandro Vinciarelli, in: IEEE Transactions on Affective Computing, 2012

attachment

Translation Error Spotting from a User's Point of View, Thomas Meyer, Idiap-RR-31-2012

attachment

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

attachment

Decision tree clustering for KL-HMM, David Imseng and John Dines, Idiap-Com-01-2012

attachment

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, Idiap-RR-07-2012

attachment

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, in: IEEE TRANSACTIONS ON ROBOTICS, 2012

attachment

[DOI]

Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, Sriram Prasath Elango, Tatiana Tommasi and Barbara Caputo, Idiap-RR-06-2012

attachment

The INTERSPEECH 2012 Speaker Trait Challenge, Björn Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob Van Son, felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi and Benjamin Weiss, in: in Proceedings of INTERSPEECH, 2012

The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012

[DOI]

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

attachment

A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth, L. Fusco, Kevin C. Smith, F. Benmansour, Riwal Lefort, Francois Fleuret, Pascal Fua and O. Pertz, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Automated Quantification of Morphodynamics for High-Throughput Live Cell Imaging Datasets, German Gonzalez, L. Fusco, Riwal Lefort, F. Benmansour, Pascal Fua and Kevin C. Smith, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes, Riwal Lefort, L. Fusco, F. Benmansour, Kevin C. Smith, O. Pertz and Francois Fleuret, in: 1st International SystemsX.ch Conference on Systems Biology, 2011

Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

attachment

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(2):553 -- 562, 2012

attachment

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-03-2012

attachment

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

attachment

Hierarchical Tandem Features for ASR in Mandarin, Joel Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, 2011

Look at who's talking, M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco and V. Murino, in: Proceedings of International Conference on Ambient Intelligence, pages 68-76, 2011

Recent Developments in Social Signal Processing, Albert Ali Salah, Maja Pantic and Alessandro Vinciarelli, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 380-385, 2011

Understanding Social Signals in Multi-party Conversations: Automatic Recognition of Socio-Emotional Roles in the AMI Meeting Corpus, Fabio Valente, Alessandro Vinciarelli, Sree Harsha Yella and A. Sapru, in: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 374-379, 2011

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, A. Pesarin, M. Cristani, V. Murino and Alessandro Vinciarelli, in: Cognitive Processing, 2012

Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, Alessandro Vinciarelli, Maja Pantic, Dirk Heylen, C. Pelachaud, I. Poggi, F. D'Errico and M. Schroeder, in: IEEE Transactions on Affective Computing, 2012

Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, Hugues Salamin and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 2012

Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011

Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011

Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues, A. Esposito, Alessandro Vinciarelli, K. Vicsi, C. Pelachaud and A. Nijholt, Springer Verlag, 2011

Open-ended Learning of Visual and Multi-modal Patterns, Jie Luo, Ecole polytechnique fédérale de Lausanne, 2011

attachment

A Bimodal Sound Source Model for Vehicle Tracking in Traffic Monitoring, Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd and Hervé Lissek, in: European Signal Processing Conference, 2011

attachment

Torch7: A Matlab-like Environment for Machine Learning, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: BigLearn, NIPS Workshop, 2011

attachment

Learning Structured Embeddings of Knowledge Bases, Antoine Bordes, Jason Weston, Ronan Collobert and Yoshua Bengio, in: Conference on Artificial Intelligence, 2011

attachment

Deep Learning for Efficient Discriminative Parsing, Ronan Collobert, in: International Conference on Artificial Intelligence and Statistics, 2011

attachment

Natural Language Processing (Almost) from Scratch, Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu and Pavel Kuksa, in: Journal of Machine Learning Research, 12:2493-2537, 2011

attachment

Fast Human Detection from Joint Appearance and Foreground Feature Subset Covariances, Jian Yao and Jean-Marc Odobez, in: Computer Vision and Image Understanding, 115(10):1414-1426, 2011

attachment

Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012

Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012

attachment

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

attachment

Finding Audio-Visual Events in Informal Social Gatherings, Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud and Florence Forbes, in: IEEE/ACM 13th International Conference on Multimodal Interaction, 2011

attachment

Engagement-based Multi-party Dialog with a Humanoid Robot, David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341-343, 2011

attachment

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

attachment

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, in: In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop, 2010

attachment

Hand Gesture Analysis, Cem Keskin, Oya Aran and Lale Akarun, in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011

Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012

attachment

[DOI]

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

processing time: 0.0004 seconds.