logo Idiap Research Institute        
All publications sorted by journal and type
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |


Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2011)

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
attachment

International Conference on Computer Vision (2011)

Tasting Families of Features for Image Classification, Charles Dubout and Francois Fleuret, in: International Conference on Computer Vision, 2011
attachment

IEEE 2011 Workshop on Automatic Speech Recognition and Understanding (2011)

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011
attachment

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2011)

The MASH Project, Francois Fleuret, Philip Abbet, Charles Dubout and Leonidas Lefakis, in: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011
attachment

International Conference on Signal Acquisition and Processing (2011)

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, in: International Conference on Signal Acquisition and Processing, Singapore, 2011
attachment

BigLearn, NIPS Workshop (2011)


Proceedings of the IEEE International Conference on Social Computing (2011)

Towards Computational Proxemics: Inferring Social Relations from Interpersonal Distances, M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz and V. Murino, in: Proceedings of the IEEE International Conference on Social Computing, pages 290-297, 2011

IEEE International Conference on Robotics and Automation (2011)

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, in: IEEE International Conference on Robotics and Automation, 2011
attachment

Proceedings of the IEEE International Conference on Computer Vision (2011)

Tracking Multiple Objects under Global Appearance Constraints, Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2011

Visual Surveillance Workshop at ICCV (2011)


Proceedings of the 28th International Conference on Machine Learning (2011)

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, in: Proceedings of the 28th International Conference on Machine Learning, 2011
attachment

Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (2011)


Graph-based Methods for Natural Language Processing (2011)


International Symposium on Wearable Computing (2011)


Proceedings of AAAI International Conference on Weblogs and Social Media (2011)

You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, Joan-Isaac Biel, Oya Aran and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011
attachment

Proceedings of Interspeech, Japan (2010)


Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (2010)

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010
attachment

CLEF 2010 Notebook Papers/LABs/Workshops (2010)


LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010 (2010)

A Multimodal Corpus for Studying Dominance in Small Group Conversations, Oya Aran, Hayley Hung and Daniel Gatica-Perez, in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010
attachment

Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA (2010)

A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010
attachment

NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions (2010)

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010
attachment

Proceedings of Interspeech (2010)


Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010
attachment
An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
attachment

Proceedings of Interspeech (2010)


2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
attachment

IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game, Gokul Chittaranjan and Hayley Hung, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010
attachment

Proceedings of the 33rd Annual ACM SIGIR Conference (2010)


Proceedings of the ACM International Conference on Multimedia (2010)


Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010
attachment

2010 IEEE International Conference on Acoustics, Speech and Signal Processing (2010)

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai.-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010
attachment

The 9th International Conference on Mobile and Ubiquitous Multimedia (2010)

By their apps you shall understand them: mining large-scale patterns of mobile phone usage, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010
attachment

20th International Conference on Pattern Recognition, Istanbul, Turkey (2010)

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010
attachment

Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (2010)

Delineating Trees in Noisy 2D Images and 3D Image Stacks, German Gonzalez, Engin Turetken, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2799–2806, 2010

Proceedings of 5th International Symposium on Telecommunications (2010)


Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus (2010)

Discovering Human Places of Interest from Multimodal Mobile Phone Data, Raul. Montoliu and Daniel Gatica-Perez, in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 2010
attachment

Proceedings of Interspeech, Makuhari, Japan, 2010 (2010)

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010
attachment

ICASSP 2010 (2010)


NIPS workshop on Learning and Planning from Batch Time Series Data (2010)

Extracting Motifs from Time Series Generated by Concurrent Activities., Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: NIPS workshop on Learning and Planning from Batch Time Series Data, 2010
attachment

ECCV, Workshop on Face Detection: Where we are, and what next? (2010)

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010
attachment
[URL]

International Conference on Speech and Language Processing, Interspeech (2010)

Floor Holder Detection and End of Speaker Turn Prediction in Meetings, Alfred Dielmann, Giulia Garau and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010
attachment

20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010 (2010)

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010
attachment

Proceedings of Interspeech (2010)

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, in: Proceedings of Interspeech, Makuhari, Japan, 2010
attachment
Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai.-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010
attachment

Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction (2010)

Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, Beijing, ACM New York, NY, USA ©2010, 2010
attachment
[DOI]

Proceedings of ISCA Speech Synthesis Workshop (2010)

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010
attachment

Proceedings of ACM Multimedia Workshop on Social Signal Processing (2010)


IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems (2010)

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010
attachment

Proceedings of the Neural Information Processing Systems Conference (2010)

Joint Cascade Optimization Using a Product Of Boosted Classifiers, Leonidas Lefakis and Francois Fleuret, in: Proceedings of the Neural Information Processing Systems Conference, pages 1315–1323, 2010
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |