Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

attachment

Structured Sparsity Models for Reverberant Speech Separation, Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard and Volkan Cevher, in: IEEE/ACM Transaction on Audio, Speech and Language Processing, 2014

attachment

Structured Sparse Coding for Microphone Array Location Calibration, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012

attachment

Using self-context for multimodal detection of head nods in face-to-face interactions, Laurent Son Nguyen, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-27-2012

attachment

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2012

attachment

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: Artificial Intelligence Journal, 194:176–202, 2013

attachment

[DOI]

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012

attachment

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-16-2012

attachment

Integrating Language Identification to improve Multilingual Speech Recognition, Holger Caesar, Idiap-RR-24-2012

attachment

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, Dairazalia Sanchez-Cortes, Oya Aran, Dinesh Babu Jayagopi, Marianne Schmid Mast and Daniel Gatica-Perez, in: Journal on Multimodal User Interfaces, 2012

attachment

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

attachment

Speaker diarization of overlapping speech based on silence distribution in meeting recordings, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, Portland, Oregon, USA, 2012

attachment

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

attachment

Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012

attachment

[URL]

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, Idiap-RR-02-2013

attachment

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

attachment

Synthetic References for Template-based ASR using Posterior Features, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, USA, 2012

attachment

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

A Survey on Language Modeling using Neural Networks, Nikolaos Pappas and Thomas Meyer, Idiap-RR-32-2012

attachment

Notes on Probabilistic Linear Discriminant Analysis, Chris McCool and Laurent El Shafey, Idiap-Com-03-2013

attachment

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, Idiap-RR-25-2012

attachment

Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, Edgar Roman-Rangel, Jean-Marc Odobez and Daniel Gatica-Perez, in: Journal of Multimedia, 7(2):179--192, 2012

attachment

Multivariate Boosting with Look-up Tables for Face Processing, Cosmin Atanasoaei, EPFL, 2012

attachment

ScoreToolKit Documentation, André Anjos and Sébastien Marcel, Idiap-Com-02-2012

attachment

Bridging the Past, Present and Future: Modeling Scene Activities From Event Relationships and Global Rules, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: IEEE Conference on Computer Vision and Pattern Recognition, 2012, Providence, Rhode Island, USA, 2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, Idiap-RR-21-2012

attachment

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-22-2012

attachment

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012

attachment

Audiovisual Diarization Of People In Video Content, Elie Khoury, Christine Sénac and Philippe Joly, in: Multimedia Tools and Applications, 2012

attachment

Combining transcription-based and acoustic-based speaker identifications for broadcast news, Elie Khoury, Antoine Laurent, Sylvain Meignier and Simon Petitrenaud, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012

attachment

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, in: IEEE ICME Workshop on Hot Topics in Mobile Multimedia, 2012

attachment

Session Variability Modelling for Face Authentication, Chris McCool, Roy Wallace, Mitchell McLaren, Laurent El Shafey and Sébastien Marcel, Idiap-RR-17-2013

attachment

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, Idiap-RR-13-2012

attachment

Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns, Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), pages 5, 2012

attachment

Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis, Lakshmi Saheer, John Dines and Philip N. Garner, in: IEEE Transactions on Audio, Speech and Language Processing, 2012

attachment

COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, in: Proceedings in International conference on Speech and Signal processing, Kyoto, Japan, pages 4493-4496, IEEE SPS (ICASSP), 2012

attachment

On the Challenge of Classifying 52 Hand Movements from Surface Electromyography, Ilja Kuzborskij, Arjan Gijsberts and Barbara Caputo, in: 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

attachment

Alternative search techniques for face detection using location estimation and binary features, Venkatesh Bala Subburaman, ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012

attachment

Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles, A. Sapru and Fabio Valente, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012, 2012

attachment

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, Idiap-RR-10-2012

attachment

Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, in: Proceedings of the IEEE International Symposium on Wearable Computers, Newcastle, 2012

attachment

Bayesian Approaches to Uncertainty in Speech Processing, Philip N. Garner, School of Computing Sciences, University of East Anglia, 2011

attachment

Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies, Bruno Cartoni and Thomas Meyer, in: Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), Istanbul, TR, pages 6, 2012

attachment

Using Sense-labeled Discourse Connectives for Statistical Machine Translation, Thomas Meyer and Andrei Popescu-Belis, in: Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), Avignon, FR, pages 129--138, 2012

attachment

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, Idiap-RR-09-2012

attachment

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 |

processing time: 0.0005 seconds.