Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Alternative search techniques for face detection using location estimation and binary features, Venkatesh Bala Subburaman, ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012

attachment

Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, A. Sapru and Fabio Valente, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012

attachment

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, Idiap-RR-10-2012

attachment

Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012

attachment

Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, Bruno Cartoni and Thomas Meyer, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012

attachment

Using Sense-labeled Discourse Connectives for Statistical Machine Translation, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012

attachment

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, Idiap-RR-09-2012

attachment

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

attachment

Progress report of a project in very low bit-rate speech coding, Milos Cernak, Philip N. Garner and Petr Motlicek, Idiap-RR-08-2012

attachment

From Nonverbal Cues to Perception: Personality and Social Attractiveness, Alessandro Vinciarelli, Hugues Salamin, Anna Polychroniou, Gelareh Mohammadi and Antonio Origlia, in: LNCS Proceedings on COGNITIVE BEHAVIOURAL SYSTEMS, Springer, 2012

Automatic Attribution of Personality Traits Based on Prosodic Features, Gelareh Mohammadi and Alessandro Vinciarelli, in: IEEE Transactions on Affective Computing, 2012

attachment

Translation Error Spotting from a User's Point of View, Thomas Meyer, Idiap-RR-31-2012

attachment

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

attachment

Decision tree clustering for KL-HMM, David Imseng and John Dines, Idiap-Com-01-2012

attachment

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, Idiap-RR-07-2012

attachment

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, in: IEEE TRANSACTIONS ON ROBOTICS, 2012

attachment

[DOI]

Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, Sriram Prasath Elango, Tatiana Tommasi and Barbara Caputo, Idiap-RR-06-2012

attachment

The INTERSPEECH 2012 Speaker Trait Challenge, Björn Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob Van Son, felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi and Benjamin Weiss, in: in Proceedings of INTERSPEECH, 2012

The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012

[DOI]

A tree-based distance between distributions: application to classification of neurons, Riwal Lefort and Francois Fleuret, in: ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

Computational Methods For Structured Sparse Component Analysis of Convolutive Speech Mixtures, Afsaneh Asaei, Michael E. Davies, Hervé Bourlard and Volkan Cevher, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2012

attachment

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(2):553 -- 562, 2012

attachment

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-03-2012

attachment

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

attachment

Conversation Analysis at Work: Detection of Conflict in Competitive Discussions through Automatic Turn-Organization Analysis, A. Pesarin, M. Cristani, V. Murino and Alessandro Vinciarelli, in: Cognitive Processing, 2012

Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, Alessandro Vinciarelli, Maja Pantic, Dirk Heylen, C. Pelachaud, I. Poggi, F. D'Errico and M. Schroeder, in: IEEE Transactions on Affective Computing, 2012

Automatic Role Recognition in Multiparty Conversations: an Approach Based on Turn Organization, Prosody and Conditional Random Fields, Hugues Salamin and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 2012

Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012

Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012

attachment

A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, Dairazalia Sanchez-Cortes, Oya Aran, Marianne Schmid Mast and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 14(3-2):816-832, 2012

attachment

[DOI]

Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012

[URL]

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Speech Communication, 54(1), 2012

[DOI]

Transcribing meetings with the AMIDA systems, Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Frantisek Grezl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln and Vincent Wan, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):486--498, 2012

attachment

[DOI]
[URL]

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, Lakshmi Saheer, Hui Liang, John Dines and Philip N. Garner, Idiap-RR-12-2012

attachment

Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-11-2012

attachment

Human Interaction Discovery in Smartphone Proximity Networks, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Personal and Ubiquitous Computing, 2012

attachment

Mining Large-Scale Smartphone Data for Personality Studies, Gokul Chittaranjan, Jan Blom and Daniel Gatica-Perez, in: Personal and Ubiquitous Computing, 2012

attachment

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012

attachment

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-01-2012

attachment

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012

attachment

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, Idiap-RR-04-2012

attachment

Multimodal Signal Processing: Human Interactions in Meetings, Steve Renals, Hervé Bourlard, Jean Carletta and Andrei Popescu-Belis, Cambridge University Press, 2012

[URL]

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, in: IEEE Multimedia, 19(2):48-57, 2012

[DOI]
[URL]

A real-time deformable detector., Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012

attachment

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-28-2012

attachment

Face detection using boosted Jaccard distance-based regression, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-02-2012

attachment

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Gelareh Mohammadi and Alessandro Vinciarelli, Idiap-RR-05-2012

attachment

Bayesian Approaches to Uncertainty in Speech Processing, Philip N. Garner, School of Computing Sciences, University of East Anglia, 2011

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

processing time: 0.0005 seconds.