Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, Yang Sun, Mathew Magimai-Doss, Jort F. Gemmeke, B. Cranen, Louis ten Bosch and Lou Boves, in: Proceedings of Interspeech, 2012

attachment

Baseline System for Automatic Speech Recognition with French GlobalPhone Database, Sandrine Revaz and Milos Cernak, Idiap-RR-26-2012

attachment

Reading Companion: The Technical and Social Design of an Automated Reading Tutor, Arthur Kantor, Milos Cernak, Jiri Havelka, Sean Huber, Jan Kleindienst and Doris B. Gonzalez, in: Workshop on Child, Computer and Interaction, Portland, Oregon, U.S.A., 2012

attachment

Discovering Places of Interest in Everyday Life from Smartphone Data, R. Montoliu, Jan Blom and Daniel Gatica-Perez, in: Multimedia Tools and Applications, 2012

attachment

The Mobile Data Challenge: Big Data for Mobile Computing Research, J. K. Laurila, Daniel Gatica-Perez, I. Aad, Blom J., Olivier Bornet, Trinh-Minh-Tri Do, O. Dousse, J. Eberle and M. Miettinen, in: Pervasive Computing, Newcastle, 2012

attachment

The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, Idiap-RR-33-2012

attachment

Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012

attachment

Indoor Scene Recognition using Task and Saliency-driven Feature Pooling, Marco Fornoni and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, Guildford, UK, 2012

attachment

Leveraging over prior knowledge for online learning of visual categories, Tatiana Tommasi, Francesco Orabona, Mohsen Kaboli and Barbara Caputo, in: Proceedings of the British Machine Vision Conference, 2012

attachment

Contextual Conditional Models for Smartphone-based Human Mobility Prediction, Trinh-Minh-Tri Do and Daniel Gatica-Perez, in: Proceedings of the 14th ACM International Conference on Ubiquitous Computing, 2012

attachment

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012

attachment

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-23-2012

attachment

Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, Petr Motlicek, Philip N. Garner, David Imseng and Fabio Valente, Idiap-RR-20-2012

attachment

Building the NinaPro Database: a Resource for the Biorobotics Community, Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Patrick van der Smagt, Claudio Castellini, Barbara Caputo and Henning Müller, in: Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012

attachment

Who Wants To Be A Millionaire?, Huseyn Gasimov, Aleksei Triastcyn, Petr Motlicek and Hervé Bourlard, Idiap-Com-03-2012

attachment

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012

attachment

Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Statistical and Perceptual Audition Workshop, 2012

attachment

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, in: 20th European Signal Processing Conference, 2012

attachment

Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, Serena Soldo and Mathew Magimai-Doss, Idiap-RR-17-2012

attachment

The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012

attachment

From Speech to Personality: Mapping Voice Quality and Intonation into Personality Differences, Gelareh Mohammadi, Antonio Origlia, Maurizio Pili and Alessandro Vinciarelli, in: in Proceedings of ACM Multimedia 2012, 2012

attachment

Microphone Array Beampattern Characterization for Hands-free Speech Applications, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012

attachment

Sparsity in Topic Models, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012

attachment

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, Idiap-RR-18-2012

attachment

Template-based ASR using Posterior features and synthetic references: comparing different TTS systems, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: SAPA-SCALE Conference, International Speech Communication Association, 2012

attachment

Gaze Estimation From Multimodal Kinect Data, Kenneth Alberto Funes Mora and Jean-Marc Odobez, in: IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, Providence, RI, USA, 2012

attachment

[DOI]

On Speaker-Independent Personality Perception and Prediction from Speech, Polzehl Tim, Schoenenberg Katrin, Moller Sebastian, Metze Florian, Gelareh Mohammadi and Alessandro Vinciarelli, in: in Proceedings of INTERSPEECH 2012, 2012

attachment

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

attachment

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012

attachment

Structured Sparse Coding for Microphone Array Location Calibration, Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard and Volkan Cevher, in: SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, 2012

attachment

Using self-context for multimodal detection of head nods in face-to-face interactions, Laurent Son Nguyen, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-27-2012

attachment

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2012

attachment

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, in: Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech), Portland, Oregon, 2012

attachment

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-16-2012

attachment

Integrating Language Identification to improve Multilingual Speech Recognition, Holger Caesar, Idiap-RR-24-2012

attachment

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition, Dairazalia Sanchez-Cortes, Oya Aran, Dinesh Babu Jayagopi, Marianne Schmid Mast and Daniel Gatica-Perez, in: Journal on Multimodal User Interfaces, 2012

attachment

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

attachment

Speaker diarization of overlapping speech based on silence distribution in meeting recordings, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, Portland, Oregon, USA, 2012

attachment

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

attachment

Extracting Informative Textual Parts from Web Pages Containing User-Generated Content, Nikolaos Pappas, Georgios Katsimpras and Efstathios Stamatatos, in: 12th International Conference on Knowledge Management and Knowledge Technologies, ACM ICPS, Graz, Austria, pages 4:1--4:8, ACM, 2012

attachment

[URL]

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Synthetic References for Template-based ASR using Posterior Features, Serena Soldo, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, USA, 2012

attachment

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

A Survey on Language Modeling using Neural Networks, Nikolaos Pappas and Thomas Meyer, Idiap-RR-32-2012

attachment

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, Idiap-RR-25-2012

attachment

Assessing Sparse Coding Methods for Contextual Shape Indexing of Maya Hieroglyphs, Edgar Roman-Rangel, Jean-Marc Odobez and Daniel Gatica-Perez, in: Journal of Multimedia, 7(2):179--192, 2012

attachment

Multivariate Boosting with Look-up Tables for Face Processing, Cosmin Atanasoaei, EPFL, 2012

attachment

ScoreToolKit Documentation, André Anjos and Sébastien Marcel, Idiap-Com-02-2012

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

processing time: 0.0005 seconds.