Research reports list - Idiap Publications

Syllable-Level Features for Speech Pathology Detection: A Case Study of Parkinson’s Disease, Sevada Hovsepyan and Mathew Magimai-Doss, Idiap-RR-02-2026

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction, Mutian He and Philip N. Garner, Idiap-RR-01-2026

[URL]

IDIAP SUBMISSION TO NIST LRE22 LANGUAGE RECOGNITION EVALUATION, Amrutha Prasad, Driss Khalil, Srikanth Madikeri and Petr Motlicek, Idiap-RR-11-2025

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, Pradeep Rangappa, Amrutha Prasad, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2025

EdgeDoc: Hybrid CNN-Transformer Model for Accurate Forgery Detection and Localization in ID Documents, Anjith George and Sébastien Marcel, Idiap-RR-08-2025

Tokenwise Contrastive Speech and Text Pre-Training for Speech Emotion Recognition, Eklavya Sarkar and Neha Tarigopula, Idiap-RR-07-2025

Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, Idiap-RR-06-2025

Adaptation of Speech and Bioacoustics Models, Eklavya Sarkar, Amir Mohammadi and Mathew Magimai-Doss, Idiap-RR-05-2025

Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, Jehan Joachim Daniel Piaget, Amrutha Prasad and Petr Motlicek, Idiap-RR-04-2025

TRACY Canvas: A Criminal Network Visualization Tool, Alejandra Sanchez Lara, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Srikanth Madikeri and Driss Khalil, Idiap-RR-03-2025

Speech power spectra: a window into neural oscillations in Parkinson's disease, Sevada Hovsepyan and Mathew Magimai-Doss, Idiap-RR-02-2025

Review of Demographic Bias in Face Recognition, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2025

Investigating Semantic Segmentation Models to Assist Visually Impaired People, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, Idiap-RR-13-2024

Estimating Breathing Pattern from Raw Speech Waveform and Short-term Speech Spectrum using Neural Networks, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, Idiap-RR-12-2024

Posterior-based analysis of spatio-temporal features for Sign Language Assessment, Neha Tarigopula, Sandrine Tornay, Ozge Mercanoglu Sincan, Richard Bowden and Mathew Magimai-Doss, Idiap-RR-11-2024

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Thorbecke Iuliia, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-10-2024

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, Idiap-RR-09-2024

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, Idiap-RR-08-2024

[URL]

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-07-2024

[URL]

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, Idiap-RR-06-2024

Sentiment Analysis using pretrained LLMs, Alexandre Huou, Petr Motlicek and Esaú Villatoro-Tello, Idiap-RR-05-2024

Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, Idiap-RR-04-2024

VRBiom: A New Periocular Dataset for Biometric Applications of HMD, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, Idiap-RR-03-2024

Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, Idiap-RR-02-2024

EdgeFace: Efficient Face Recognition Model for Edge Devices, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2024

Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, Anjith George and Sébastien Marcel, Idiap-RR-09-2023

Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, Anjith George and Sébastien Marcel, Idiap-RR-08-2023

Approximating Optimal Morphing Attacks using Template Inversion, Laurent Colbois, Hatef Otroshi Shahreza and Sébastien Marcel, Idiap-RR-07-2023

When Differential Privacy Meets Graph Neural Networks, Sina Sajadmanesh and Daniel Gatica-Perez, Idiap-RR-06-2023

Idiap Scientific Report 2022, Hervé Bourlard, Daniel Gatica-Perez, Jean-Marc Odobez, Philip N. Garner, Petr Motlicek, Mathew Magimai-Doss, Sylvain Calinon, Sébastien Marcel, Jérôme Kämpf, Raphaelle Luisier, Michael Liebling, Lonneke van der Plas, Damien Teney, Ina Kodrasi, Emmanuel Senft, James Henderson, Andre Freitas and André Anjos, Idiap-RR-05-2023

VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, Idiap-RR-04-2023

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, Idiap-RR-03-2023

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-02-2023

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, Idiap-RR-01-2023

[URL]

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, Idiap-RR-13-2022

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, Idiap-RR-12-2022

SPEECH MODELING USING SPARSE AUTOENCODERS, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-11-2022

SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-10-2022

Eight Years of Face Recognition Research: Reproducibility, Achievements and Open Issues, Tiago de Freitas Pereira, Dominic Schmidli, Yu Linghu, Xinyi Zhang, Sébastien Marcel and Manuel Günther, Idiap-RR-09-2022

[URL]

An anomaly detection approach for backdoored neural networks: face recognition as a case study, Alexander Unnervik and Sébastien Marcel, Idiap-RR-08-2022

[URL]

On the detection of morphing attacks generated by GANs, Laurent Colbois and Sébastien Marcel, Idiap-RR-07-2022

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, Idiap-RR-06-2022

Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, Arnaud Pannatier, Ricardo Picatoste and Francois Fleuret, Idiap-RR-05-2022

End-to-end Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, Idiap-RR-04-2022

Robust Face Presentation Attack Detection with Multi-channel Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-03-2022

A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, Anjith George, David Geissbuhler and Sébastien Marcel, Idiap-RR-02-2022

Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar and Petr Motlicek, Idiap-RR-01-2022

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Seyyed Saeed Sarfjoo and Nigmatulina Iuliia, Idiap-RR-22-2021

Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, Florian Mai and James Henderson, Idiap-RR-21-2021

Improving callsign recognition with air-surveillance data in air-traffic communication, Nigmatulina Iuliia, Rudolf Braun, Juan Zuluaga-Gomez and Petr Motlicek, Idiap-RR-20-2021

[URL]

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Héctor Jiménez-Salazar, Daniel Gatica-Perez and Mathew Magimai-Doss, Idiap-RR-19-2021

Test time Adaptation through Perturbation Robustness, Prabhu Teja Sivaprasad and Francois Fleuret, Idiap-RR-17-2021

BertOdia: BERT pre-training for low resource Odia language, Shantipriya Parida, Satya Prakash Biswal, Biranchi Narayan Nayak, Mael Fabien, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-16-2021

BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Oliver Ohneiser and Hartmut Helmke, Idiap-RR-15-2021

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, Idiap-RR-14-2021

[URL]

Multimodal Neural Machine Translation System for English to Bengali, Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash and Petr Motlicek, Idiap-RR-13-2021

Adjustable Deterministic Pseudonymization of Speech, S. Pavankumar Dubagunta, Rob Van Son and Mathew Magimai-Doss, Idiap-RR-12-2021

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, Idiap-RR-11-2021

NLPHut’s Participation at WAT2021, Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek and Ondrej Bojar, Idiap-RR-10-2021

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, Idiap-RR-09-2021

Supervised Speech Representation Learning for Parkinson's Disease Classification, Parvaneh Janbakhshi and Ina Kodrasi, Idiap-RR-08-2021

Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú Villatoro-Tello, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma and Petr Motlicek, Idiap-RR-07-2021

Broadcast Media Content Categorization Using Low-Resolution Concepts, Esaú Villatoro-Tello, Shantipriya Parida, Petr Motlicek, Subhadeep Dey and Qingran Zhan, Idiap-RR-06-2021

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, Qingran Zhan, Shixuan Du, Petr Motlicek, Yahui Shan and Xiang Xie, Idiap-RR-05-2021

[URL]

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-04-2021

An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, Marco Ewerton, Sylvain Calinon and Jean-Marc Odobez, Idiap-RR-03-2021

CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, Shantipriya Parida, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-02-2021

Probabilistic Symbol Sequence Matching and its Application to Pathological Speech Intelligibility Assessment, Julian Fritsch, Guillem Quer and Mathew Magimai-Doss, Idiap-RR-01-2021

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-40-2020

[URL]

Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, Idiap-RR-38-2020

Deepfake detection: humans vs. machines, Pavel Korshunov and Sébastien Marcel, Idiap-RR-36-2020

COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis, Idiap-RR-34-2020

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, Idiap-RR-32-2020

On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, Anjith George and Sébastien Marcel, Idiap-RR-30-2020

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, Idiap-RR-28-2020

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-26-2020

Plug and Play Autoencoders for Conditional Text Generation, Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith and James Henderson, Idiap-RR-24-2020

The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, Zohreh Mostaani, Anjith George, Guillaume Heusch, David Geissbuhler and Sébastien Marcel, Idiap-RR-22-2020

Taming GANs with Lookahead, Tatjana Chavdarova, Matteo Pagliardini, Martin Jaggi and Francois Fleuret, Idiap-RR-20-2020

[URL]

Face Recognition Systems Under Spoofing Attacks, Ivana Chingovska, Nesli Erdogmus, André Anjos and Sébastien Marcel, Idiap-RR-18-2020

Smartphone Multi-modal Biometric Authentication: Database and Evaluation, Ramachandra Raghavendra, Martin Stokkenes, Amir Mohammadi, Sushma Venkatesh, Kiran B. Raja, Pankaj Wasnik, Eric Poiret, Sébastien Marcel and Christoph Busch, Idiap-RR-17-2020

[URL]

Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-15-2020

Gradient Alignment in Deep Neural Networks, Suraj Srinivas and Francois Fleuret, Idiap-RR-14-2020

Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, Anjith George and Sébastien Marcel, Idiap-RR-12-2020

Idiap Submission to Swiss-German Language Detection Shared Task, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Petr Motlicek and Qingran Zhan, Idiap-RR-11-2020

CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, Ketan Kotwal and Sébastien Marcel, Idiap-RR-10-2020

German News Article Classification : A Multichannel CNN Approach, Shantipriya Parida, Petr Motlicek and Satya Ranjan Dash, Idiap-RR-09-2020

OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, Shantipriya Parida, Satya Ranjan Dash, Ondrej Bojar, Petr Motlicek, Priyanka Pattnaik and Debasish Kumar Mallick, Idiap-RR-08-2020

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, Idiap-RR-07-2020

Language model domain adaptation for automatic speech recognition, Amrutha Prasad, Petr Motlicek and Alexandre Nanchen, Idiap-RR-05-2020

Idiap NMT System for WAT 2019 Multimodal Translation Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-04-2020

Idiap Abstract Text Summarization System for German Text Summarization Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-03-2020

Extractive Odia Text Summarization System: An OCR based Approach, Shantipriya Parida, Idiap-RR-02-2020

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, Idiap-RR-01-2020

Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis

Learning Entailment-Based Sentence Embeddings from Natural Language Inference, Rabeeh Karimi Mahabadi, Florian Mai and James Henderson, Idiap-RR-20-2019

[URL]

On the Tunability of Optimizers in Deep Learning, Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi and Francois Fleuret, Idiap-RR-19-2019

[URL]

Reconstruction of image sequences from ungated and scanning-aberrated laser scanning microscopy images of the beating heart, Olivia Mariani, Alexander Ernst, Nadia Mercader and Michael Liebling, Idiap-RR-18-2019

Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, Srikanth Madikeri, Seyyed Saeed Sarfjoo, Petr Motlicek and Sébastien Marcel, Idiap-RR-17-2019

TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-16-2019

Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Mahdi Hajibabaei, Petr Motlicek and Sébastien Marcel, Idiap-RR-15-2019

The Speed Submission to DIHARD II: Contributions & Lessons Learned, Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Herve Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas Evans, Sébastien Marcel, Stefano Squartini and Claude Barras, Idiap-RR-14-2019

[UNK]: https://arxiv.org/abs/1911.02388

INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-13-2019

STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek and Hervé Bourlard, Idiap-RR-12-2019

Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, Vinayak Abrol, S. Pavankumar Dubagunta and Mathew Magimai-Doss, Idiap-RR-11-2019

Domain Adaptation and Investigation of Robustness of DNN-based Embeddings for Text-Independent Speaker Verification Using Dilated Residual Networks, Seyyed Saeed Sarfjoo, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-10-2019

A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, Guillaume Heusch, Tiago de Freitas Pereira and Sébastien Marcel, Idiap-RR-09-2019

SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, Grisard Malo, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, Idiap-RR-08-2019

Processing Megapixel Images with Deep Attention-Sampling Models, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-07-2019

[URL]

Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, Julian Fritsch, S. Pavankumar Dubagunta and Mathew Magimai-Doss, Idiap-RR-06-2020

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, Idiap-RR-06-2019

[URL]

AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, Idiap-RR-05-2019

Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, Olivia Mariani, Kevin G. Chan, Alexander Ernst, Nadia Mercader and Michael Liebling, Idiap-RR-04-2019

Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, Yu Yu, Gang Liu and Jean-Marc Odobez, Idiap-RR-03-2019

Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-02-2019

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, Idiap-RR-01-2019

DeepFakes: a New Threat to Face Recognition? Assessment and Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-18-2018

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-RR-17-2018

Designing second order recurrent neural networks for prosody modelling, François Marelli, Idiap-RR-16-2018

Analysis of Posterior Estimation Approaches to I-vector Extraction for Speaker Recognition, Srikanth Madikeri, Petr Motlicek, Marc Ferras and Subhadeep Dey, Idiap-RR-15-2018

Combining the SNR Spectrum with a Cochlear Model, Philip N. Garner, Idiap-RR-14-2018

Modelling glottal source information for depression detection, D S Pavan Kumar, Bogdan Vlasenko and Mathew Magimai-Doss, Idiap-RR-13-2018

Not All Samples Are Created Equal: Deep Learning with Importance Sampling, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-12-2018

Gradient-based spectral visualization of CNNs using raw waveforms, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2018

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, Idiap-RR-10-2018

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-09-2018

Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, Adrian Shajkofci and Michael Liebling, Idiap-RR-07-2018

DNN based speaker embedding using content information for text-dependent speaker verification, Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek and Srikanth Madikeri, Idiap-RR-06-2018

Knowledge Transfer with Jacobian Matching, Suraj Srinivas and Francois Fleuret, Idiap-RR-04-2018

[URL]

Implémentation d'un algorithme de réduction de taille des réseaux de neurones, François Marelli, Idiap-RR-03-2018

Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-RR-02-2018

Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model, Sibo Tong, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2018

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-32-2017

CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Marc Ferras, Petr Motlicek and Srikanth Madikeri, Idiap-RR-31-2017

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-30-2017

Towards a breakthrough speaker identification approach for law enforcement agencies, Khaled Khelif, yann Mombrun, Petr Motlicek, Gerhard Backfried, Damien Kelly, Farhan Sahito, Gideon Hazzani, Luca Scarpatto, Emmanouil Chatzigavriil and Srikanth Madikeri, Idiap-RR-29-2017

NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, Milos Cernak and Sibo Tong, Idiap-RR-28-2017

Evaluating Attention Networks for Anaphora Resolution, Jonathan Pilault, Nikolaos Pappas, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-27-2017

Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, Khalil Mrini, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-26-2017

Towards Document-Level Neural Machine Translation, Lesly Miculicich, Idiap-RR-25-2017

Supervised Gaze Bias Correction for Gaze Coding in Interactions, Remy Siegfried and Jean-Marc Odobez, Idiap-RR-23-2017

Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, Ajay Srinivasamurthy, Petr Motlicek, Ivan Himawan, Gyorgy Szaszak, Youssef Oualil and Hartmut Helmke, Idiap-RR-21-2017

Perceptual Information Loss due to Impaired Speech Production, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-20-2017

A Sub-Quadratic Exact Medoid Algorithm, James Newling and Francois Fleuret, Idiap-RR-19-2017

Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, Yang Wang, Alexandre Nanchen, Alexandros Lazaridis, David Imseng and Philip N. Garner, Idiap-RR-18-2017

Multilingual Hierarchical Attention Networks for Document Classification, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-17-2017

[URL]

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez-Correa and Elmar Nöth, Idiap-RR-16-2017

Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2017

BEAT: An Open-Source Web-Based Open-Science Platform, André Anjos, Laurent El Shafey and Sébastien Marcel, Idiap-RR-14-2017

2D Face Recognition: An Experimental and Reproducible Research Survey, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-13-2017

From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, Andrei Popescu-Belis, Maryam Habibi, Philip N. Garner and Nan Li, Idiap-RR-12-2017

Long Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2017

Topic and Sentiment in Phrase-Based Statistical Machine Translation, Maryam Habibi, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-10-2017

Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, Yu Yu, Kenneth Alberto Funes Mora and Jean-Marc Odobez, Idiap-RR-09-2017

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, Xiao Pu, Laura Mascarell and Andrei Popescu-Belis, Idiap-RR-08-2017

Using Coreference Links to Improve Spanish-to-English Machine Translation, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-07-2017

Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, Ngoc-Quang Luong and Andrei Popescu-Belis, Idiap-RR-06-2017

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, Idiap-RR-05-2017

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-04-2017

The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Philip N. Garner and Junichi Yamagishi, Idiap-RR-03-2017

Real-time Multiple Head Tracking Using Texture and Colour Cues, Vasil Khalidov and Jean-Marc Odobez, Idiap-RR-02-2017

Maya Codical Glyph Segmentation: A Crowdsourcing Approach, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-01-2017

IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION, Srikanth Madikeri, Subhadeep Dey, Marc Ferras, Petr Motlicek and Ivan Himawan, Idiap-RR-32-2016

Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, Afsaneh Asaei, Dhananjay Ram and Hervé Bourlard, Idiap-RR-31-2016

Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-30-2016

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-29-2016

On the impact of non-modal phonation on phonological features, Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler, Idiap-RR-28-2016

Cognitive speech coding, Milos Cernak and Afsaneh Asaei, Idiap-RR-27-2016

Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Marc Ferras, Idiap-RR-26-2016

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-25-2016

[URL]

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, Flávio Simões, Mário Uliani Neto, Marcus de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Yanmin Qian, D. Paul, G. Saha and Md Sahidullah, Idiap-RR-24-2016

[URL]

Cross-database evaluation of audio-based spoofing detection systems, Pavel Korshunov and Sébastien Marcel, Idiap-RR-23-2016

[URL]

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-22-2016

Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-21-2016

Feature mapping using far-field microphones for distant speech recognition, Ivan Himawan, Petr Motlicek, David Imseng and Sridha Sridharan, Idiap-RR-20-2016

Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-19-2016

End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2016

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, Idiap-RR-17-2016

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-16-2016

Twitter Sentiment Analysis (Almost) from Scratch, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-15-2016

Intonation atom based emphasis transfer, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-14-2016

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, Idiap-RR-13-2016

Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, Idiap-RR-12-2016

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei and Philip N. Garner, Idiap-RR-11-2016

Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, Idiap-RR-10-2016

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2016

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, Idiap-RR-08-2016

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, Milos Cernak, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-07-2016

[URL]

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-06-2016

Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, Gil Luyet, Idiap-RR-05-2016

Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, Gil Luyet, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-04-2016

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, Idiap-RR-03-2016

An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, Sucheta Ghosh, Milos Cernak, Sarbani Palit and B. B. Chaudhuri, Idiap-RR-02-2016

Sparse Subspace Modeling for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-01-2016

A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, Saeid Haghighatshoar, Mohammad J. Taghizadeh and Afsaneh Asaei, Idiap-RR-35-2015

Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-34-2015

Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-33-2015

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, Idiap-RR-32-2015

Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, Lakshmi Saheer, Xingyu Na and Milos Cernak, Idiap-RR-31-2015

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, Idiap-RR-30-2015

Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, Majid Yazdani and Andrei Popescu-Belis, Idiap-RR-29-2015

Exploiting foreign resources for DNN-based ASR, Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner and Ivan Himawan, Idiap-RR-27-2015

Transfer Learning through Greedy Subset Selection, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, Idiap-RR-26-2015

Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, Joël Legrand and Ronan Collobert, Idiap-RR-25-2015

Learning linearly separable features for speech recognition using convolutional neural networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-24-2015

[URL]

Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-23-2015

Simple Image Description Generator via a Linear Phrase-based Model, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2015

"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, Rémi Lebret and Ronan Collobert, Idiap-RR-21-2015

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, Idiap-RR-20-2015

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-19-2015

Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, Srikanth Madikeri, David Imseng and Hervé Bourlard, Idiap-RR-18-2015

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-17-2015

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, Idiap-RR-16-2015

Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, Alexandre Hyafil and Milos Cernak, Idiap-RR-14-2015

On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-13-2015

Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai-Doss, Idiap-RR-12-2015

An Empirical Model of Emphatic Word Detection, Milos Cernak and Pierre-Edouard Honnet, Idiap-RR-11-2015

Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-10-2015

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong and Martin Volk, Idiap-RR-09-2015

Phrase-based Image Captioning, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-08-2015

Speech vocoding for laboratory phonology, Milos Cernak, Štefan Beňuš and Alexandros Lazaridis, Idiap-RR-07-2015

Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-06-2015

Incremental Syllable-Context Phonetic Vocoding, Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na, Idiap-RR-05-2015

Phonological vocoding using artificial neural networks, Milos Cernak, Blaise Potard and Philip N. Garner, Idiap-RR-04-2015

A simple continuous excitation model for parametric vocoding, Philip N. Garner, Milos Cernak and Blaise Potard, Idiap-RR-03-2015

Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, Blaise Potard, Petr Motlicek and David Imseng, Idiap-RR-02-2015

LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto and Francois Fleuret, Idiap-RR-22-2014

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, Idiap-RR-21-2014

Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Idiap-RR-20-2014

Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-19-2014

Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2014

Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard and Qingmin Liao, Idiap-RR-17-2014

Objective Speech Intelligibility Assessment through Comparison of Phoneme Class Conditional Probability Sequences, Raphael Ullmann, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-16-2014

Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-15-2014

Weakly Supervised Object Segmentation with Convolutional Neural Networks, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-13-2014

Biometrics Evaluation under Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-12-2014

Exemplar-based Sparse Representation for Posterior Features, Sara Bahaadini, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-11-2014

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, Idiap-RR-10-2014

Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, Chidansh A. Bhatt and Andrei Popescu-Belis, Idiap-RR-09-2014

EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, Kenneth Alberto Funes Mora, Florent Monay and Jean-Marc Odobez, Idiap-RR-08-2014

Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, Raphael Ullmann and Hervé Bourlard, Idiap-RR-07-2014

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez Mendez and Jean-Marc Odobez, Idiap-RR-06-2014

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez-Mendez and Jean-Marc Odobez, Idiap-RR-05-2014

Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman and Philip N. Garner, Idiap-RR-04-2014

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, Alexandros Lazaridis, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-03-2014

Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2014

Score Calibration in Face Recognition, Miranti I. Mantasari, Manuel Günther, Roy Wallace, Rahim Saedi, Sébastien Marcel and David Van Leeuwen, Idiap-RR-01-2014

Is Deep Learning Really Necessary for Word Embeddings?, Rémi Lebret, Joël Legrand and Ronan Collobert, Idiap-RR-44-2013

On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-43-2013

Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-42-2013

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-41-2013

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-40-2013

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, Idiap-RR-39-2013

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, Idiap-RR-38-2013

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, Idiap-RR-37-2013

The 2013 Face Recognition Evaluation in Mobile Environment, Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato and Sébastien Marcel, Idiap-RR-36-2013

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-35-2013

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, Rahim Saedi, Kong Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David Van Leeuwen, Bin Ma, Haizhou Li, John Hansen, Jean-François Bonastre, Sébastien Marcel, John Mason and Eliathamby Ambikairajah, Idiap-RR-34-2013

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, Idiap-RR-33-2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus de Assis Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, Idiap-RR-32-2013

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, Idiap-RR-31-2013

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, Idiap-RR-30-2013

Word Embeddings through Hellinger PCA, Rémi Lebret and Ronan Collobert, Idiap-RR-29-2013

Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, Idiap-RR-28-2013

Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-27-2013

Investigating time-sensitive topic model approaches for action recognition, Romain Tavenard, Remi Emonet and Jean-Marc Odobez, Idiap-RR-26-2013

Automatic Speech Indexing System of Bilingual Video Parliament Interventions, Gyorgy Szaszak, Milos Cernak, Philip N. Garner, Petr Motlicek, Alexandre Nanchen and Flavio Tarsetti, Idiap-RR-25-2013

Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, Milos Cernak, Xingyu Na and Philip N. Garner, Idiap-RR-24-2013

Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, Gyorgy Szaszak and Andras Beke, Idiap-RR-23-2013

Recurrent Convolutional Neural Networks for Scene Parsing, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2013

Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, Remi Emonet and Jean-Marc Odobez, Idiap-RR-21-2013

Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, Remi Emonet and Jean-Marc Odobez, Idiap-RR-20-2013

[URL]

Anti-spoofing in action: joint operation with a verification system, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2013

The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-18-2013

Session Variability Modelling for Face Authentication, Chris McCool, Roy Wallace, Mitchell McLaren, Laurent El Shafey and Sébastien Marcel, Idiap-RR-17-2013

Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-16-2013

Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2013

Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-14-2013

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-13-2013

Bias Adaptation for Vocal Tract Length Normalization, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-12-2013

Statistical models for HMM/ANN hybrids, Philip N. Garner and David Imseng, Idiap-RR-11-2013

Adaptation Experiments on French MediaParl ASR, Gyorgy Szaszak, Idiap-RR-10-2013

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, Idiap-RR-09-2013

Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, Hui Liang and John Dines, Idiap-RR-08-2013

A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, Laurent El Shafey, Chris McCool, Roy Wallace and Sébastien Marcel, Idiap-RR-07-2013

[URL]

ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, Milos Cernak, Petr Motlicek and Philip N. Garner, Idiap-RR-06-2013

Convolutional Pitch Target Approximation Model for Speech Synthesis, Xingyu Na and Philip N. Garner, Idiap-RR-05-2013

KL-HMM and Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-04-2013

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, Idiap-RR-03-2013

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, Idiap-RR-02-2013

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang, Idiap-RR-38-2012

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, Idiap-RR-37-2012

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, Idiap-RR-36-2012

Automatic Social Role Recognition In Professional Meetings, A. Sapru and Hervé Bourlard, Idiap-RR-35-2012

Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, Ramya Rasipuram, Peter Bell and Mathew Magimai-Doss, Idiap-RR-34-2012

The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, Idiap-RR-33-2012

A Survey on Language Modeling using Neural Networks, Nikolaos Pappas and Thomas Meyer, Idiap-RR-32-2012

Translation Error Spotting from a User's Point of View, Thomas Meyer, Idiap-RR-31-2012

Improving Object Classification using Pose Information, Hugo Penedones, Ronan Collobert, Francois Fleuret and David Grangier, Idiap-RR-30-2012

An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, Manuel Günther, Roy Wallace and Sébastien Marcel, Idiap-RR-29-2012

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-28-2012

Using self-context for multimodal detection of head nods in face-to-face interactions, Laurent Son Nguyen, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-27-2012

Baseline System for Automatic Speech Recognition with French GlobalPhone Database, Sandrine Revaz and Milos Cernak, Idiap-RR-26-2012

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, Idiap-RR-25-2012

Integrating Language Identification to improve Multilingual Speech Recognition, Holger Caesar, Idiap-RR-24-2012

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-23-2012

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-22-2012

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, Idiap-RR-21-2012

Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, Petr Motlicek, Philip N. Garner, David Imseng and Fabio Valente, Idiap-RR-20-2012

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2012

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, Idiap-RR-18-2012

Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, Serena Soldo and Mathew Magimai-Doss, Idiap-RR-17-2012

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-16-2012

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, Idiap-RR-13-2012

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, Lakshmi Saheer, Hui Liang, John Dines and Philip N. Garner, Idiap-RR-12-2012

Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-11-2012

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, Idiap-RR-10-2012

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, Idiap-RR-09-2012

Progress report of a project in very low bit-rate speech coding, Milos Cernak, Philip N. Garner and Petr Motlicek, Idiap-RR-08-2012

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, Idiap-RR-07-2012

Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, Sriram Prasath Elango, Tatiana Tommasi and Barbara Caputo, Idiap-RR-06-2012

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Gelareh Mohammadi and Alessandro Vinciarelli, Idiap-RR-05-2012

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, Idiap-RR-04-2012

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-03-2012

Face detection using boosted Jaccard distance-based regression, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-02-2012

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-01-2012

IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, Idiap-RR-40-2011

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, Idiap-RR-37-2011

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, Idiap-RR-34-2011

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, Idiap-RR-33-2011

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-32-2011

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, Idiap-RR-31-2011

Learning from Images with Captions Using the Maximum Margin Set Algorithm, Jie Luo, Francesco Orabona, Barbara Caputo and Vittorio Ferrari, Idiap-RR-30-2011

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-28-2011

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, Idiap-RR-27-2011

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, Idiap-RR-26-2011

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, Idiap-RR-25-2011

Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, Hamid Reza Abutalebi, Mehdi Rashidinejad, Hervé Bourlard and Ali Akbar Tadaion, Idiap-RR-24-2011

Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, Georgios Skoumas and Philip N. Garner, Idiap-RR-23-2011

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, Idiap-RR-22-2011

Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-21-2011

Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, Danil Korchagin, Idiap-RR-20-2011

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, Idiap-RR-19-2011

Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, Mirjam Wester and Hui Liang, Idiap-RR-18-2011

Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, Idiap-RR-17-2011

AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, Idiap-RR-16-2011

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., Philip N. Garner, Idiap-RR-15-2011

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-14-2011

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-12-2011

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, Idiap-RR-11-2011

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, Idiap-RR-10-2011

Social Focus of Attention as a Time Function Derived from Multimodal Signals, Danil Korchagin and Hamid Reza Abutalebi, Idiap-RR-09-2011

Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, Danil Korchagin, Idiap-RR-08-2011

On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, Niklas Johansson, Chris McCool and Sébastien Marcel, Idiap-RR-07-2011

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-06-2011

When Users Meet Technology: The Meeting Browser Development Helix, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-05-2011

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, Idiap-RR-04-2011

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, Idiap-RR-03-2011

Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2011

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, Idiap-RR-01-2011

On Improving Face Detection Performance by Modelling Contextual Information, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-43-2010

Automatic Time Skew Detection and Correction, Danil Korchagin, Idiap-RR-42-2010

The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Hynek Hermansky and Alon Zweig, Idiap-RR-41-2010

Towards Robust Place Recognition for Robot Localization, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo, Patric Jensfelt and Henrik I. Christensen, Idiap-RR-40-2010

Hierarchical Tandem Features for ASR in Mandarin, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-39-2010

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-RR-38-2010

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, Idiap-RR-37-2010

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-36-2010

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, Idiap-RR-35-2010

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-34-2010

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-33-2010

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, Idiap-RR-32-2010

MOBIO: Mobile Biometric Face and Speaker Authentication, Sébastien Marcel, Chris McCool, Cosmin Atanasoaei, Flavio Tarsetti, Jan Pesan, Pavel Matejka, Jan Cernocky, Mika Helistekangas and Markus Turtinen, Idiap-RR-31-2010

On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen, Jan Cernocky and al, Idiap-RR-30-2010

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2010

Mining Human Location-Routines using a Multi-Level Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-28-2010

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, Idiap-RR-27-2010

The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, Idiap-RR-26-2010

Study of Jacobian Normalization for VTLN, Lakshmi Saheer, Philip N. Garner and John Dines, Idiap-RR-25-2010

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-24-2010

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-23-2010

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-22-2010

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, Idiap-RR-21-2010

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, Idiap-RR-20-2010

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-19-2010

Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-18-2010

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, Idiap-RR-17-2010

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, Idiap-RR-16-2010

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2010

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-14-2010

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-12-2010

Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Benjamin Picart, Idiap-RR-11-2010

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, Idiap-RR-10-2010

Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen and Jan Cernocky, Idiap-RR-09-2010

The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, Andrzej Pronobis, Jie Luo and Barbara Caputo, Idiap-RR-08-2010

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, Idiap-RR-07-2010

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, Idiap-RR-06-2010

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, Idiap-RR-05-2010

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, Idiap-RR-04-2010

AMIDA/Klewel Mini-Project, Petr Motlicek, Philip N. Garner, Maël Guillemot and Vincent Bozzo, Idiap-RR-03-2010

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, Idiap-RR-02-2010

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-01-2010

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, Idiap-RR-41-2009

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-40-2009

Automatic Temporal Alignment of AV Data, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-39-2009

User Interface Design in a Just-in-time Retrieval System for Meetings, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Mike Flynn, Sebastian Germesin, Alexandre Nanchen and Majid Yazdani, Idiap-RR-38-2009

On MLP-based Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, Idiap-RR-37-2009

Memoirs of Togetherness from Audio Logs, Danil Korchagin, Idiap-RR-36-2009

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, Idiap-RR-35-2009

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-34-2009

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-33-2009

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-32-2009

Out-of-Scene AV Data Detection, Danil Korchagin, Idiap-RR-31-2009

Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-30-2009

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2009

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, Idiap-RR-28-2009

Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-27-2009

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, Idiap-RR-26-2009

SNR Features for Automatic Speech Recognition, Philip N. Garner, Idiap-RR-25-2009

On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, Mathew Magimai-Doss, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-24-2009

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, Idiap-RR-23-2009

Co-occurrence Models for Image Annotation and Retrieval, Nikhil Garg, Idiap-RR-22-2009

Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, Nikhil Garg and Daniel Gatica-Perez, Idiap-RR-21-2009

Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, Hayley Hung and Silèye O. Ba, Idiap-RR-20-2009

Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, Jian Yao and Jean-Marc Odobez, Idiap-RR-19-2009

Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, Benjamin Picart, Idiap-RR-18-2009

Speech recognition with speech synthesis models by marginalising over decision tree leaves, John Dines, Lakshmi Saheer and Hui Liang, Idiap-RR-17-2009

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-16-2009

Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, Idiap-RR-15-2009

Robustness of Phase based Features for Speaker Recognition, Padmanabhan Rajan, Sree Hari Krishnan Parthasarathi and Hema A Murthy, Idiap-RR-14-2009

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, Idiap-RR-13-2009

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-12-2009

Comparing meeting browsers using a task-based evaluation method, Andrei Popescu-Belis, Idiap-RR-11-2009

Multiple Object Tracking using Flow Linear Programming, Jerome Berclaz, Francois Fleuret and Pascal Fua, Idiap-RR-10-2009

ClusterRank: A Graph Based Method for Meeting Summarization, Nikhil Garg, Benoit Favre, Korbinian Reidhammer and Dilek Hakkani Tür, Idiap-RR-09-2009

A MAP Approach to Noise Compensation of Speech, Philip N. Garner, Idiap-RR-08-2009

Novel initialization methods for Speaker Diarization, David Imseng, Idiap-RR-07-2009

Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, Petr Motlicek, Idiap-RR-06-2009

Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla and Giulio Sandini, Idiap-RR-05-2009

Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-04-2009

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-03-2009

Visual activity context for focus of attention estimation in dynamic meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, Idiap-RR-02-2009

Support Vector Machines with a Reject Option, Yves Grandvalet, Joseph Keshet, Alain Rakotomamonjy and Stéphane Canu, Idiap-RR-01-2009

CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-77-2008

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, Idiap-RR-76-2008

Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-75-2008

MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-74-2008

Integrating audio and vision for robust automatic gender recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-73-2008

How does a dictation machine recognize speech?, T. Dutoit, L. Couvreur and Hervé Bourlard, Idiap-RR-72-2008

Entropy coding of Quantized Spectral Components in FDLP audio codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-71-2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-70-2008

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-69-2008

Kernel Based Text-Independnent Speaker Verification, Johnny Mariéthoz, Samy Bengio and Yves Grandvalet, Idiap-RR-68-2008

Acoustic Models for Posterior Features in Speech Recognition, Guillermo Aradilla, Idiap-RR-67-2008

Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-66-2008

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-65-2008

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin and Alessandro Vinciarelli, Idiap-RR-64-2008

Calibration from statistical properties of the visual world, Etienne Grossmann, José António Gaspar and Francesco Orabona, Idiap-RR-63-2008

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-61-2008

Composite Kernel Learning, Marie Szafranski, Yves Grandvalet and Alain Rakotomamonjy, Idiap-RR-59-2008

An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-58-2008

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, N. P. Garg, Sarah Favre, Hugues Salamin, D. Hakkani Tür and Alessandro Vinciarelli, Idiap-RR-57-2008

Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips and José del R. Millán, Idiap-RR-53-2008

Recognition of Anticipatory Behavior from Human EEG, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, Idiap-RR-52-2008

Predictive Models for Music, Jean-François Paiement, Yves Grandvalet and Samy Bengio, Idiap-RR-51-2008

Probabilistic Models for Melodic Prediction, Jean-François Paiement, Samy Bengio and Douglas Eck, Idiap-RR-50-2008

What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-49-2008

Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot and Ricardo Chavarriaga, Idiap-RR-48-2008

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-47-2008

Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, Nicolas Scaringella, Idiap-RR-46-2008

Fast Approximate Spoken Term Detection from Sequence of Phonemes, Joel Praveen Pinto, Igor Szoke, S. R. Mahadeva Prasanna and Hynek Hermansky, Idiap-RR-45-2008

Hilbert Envelope Based Features for Far-Field Speech Recognition, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-42-2008

Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-41-2008

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-40-2008

Enhanced Phone Posteriors for Improving Speech Recognition Systems, Hamed Ketabdar and Hervé Bourlard, Idiap-RR-39-2008

understanding metro station usage using closed circuit television cameras analysis, C. Carincotte, M. Hick, Xavier Naturel, Jean-Marc Odobez, Jian Yao, A. Bastide and B. Corbucci, Idiap-RR-38-2008

Asynchronous detection and classification of oscillatory brain activity, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-36-2008

Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, Bertrand Mesot, Idiap-RR-35-2008

Machine Learning for Information Retrieval, David Grangier, Idiap-RR-34-2008

A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-33-2008

Discovering Human Routines from Cell Phone Data with Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-32-2008

Discriminatove Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, Idiap-RR-31-2008

The Projectron: a Bounded Kernel-Based Perceptron, Francesco Orabona, Joseph Keshet and Barbara Caputo, Idiap-RR-30-2008

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li and John Dines, Idiap-RR-29-2008

Characterizing the EEG Correlates of Exploratory Behavior, Nicolas Bourdaud, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-28-2008

Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, Idiap-RR-27-2008

Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-26-2008

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-25-2008

Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-24-2008

A Data-driven Approach to Speech/Non-speech Detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-23-2008

Exploiting contextual information for speech/non-speech detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-22-2008

Exploiting temporal context for speech/non-speech detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, Idiap-RR-21-2008

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-20-2008

Silence Models in Weighted Finite-State Transducers, Philip N. Garner, Idiap-RR-19-2008

Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-18-2008

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-17-2008

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2008

Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2008

Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-14-2008

Reverse Correlation for analyzing MLP Posterior Features in ASR, Joel Praveen Pinto, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-13-2008

On the Combination of Auditory and Modulation Frequency Channels for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-12-2008

A Neural Network based Regression Approach for Recognizing Simultaneous Speech, Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-10-2008

Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2008

Predicting the dominant clique in meetings through fusion of nonverbal cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-08-2008

Maximum Negentropy Beamforming, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-07-2008

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-06-2008

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-05-2008

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, Idiap-RR-04-2008

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-03-2008

Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-02-2008

A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart and José del R. Millán, Idiap-RR-78-2007

Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-77-2007

Hierarchical Penalization, Marie Szafranski, Yves Grandvalet and Pierre Morizet-Mahoudeaux, Idiap-RR-76-2007

Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-75-2007

Adaptive Beamforming with a Minimum Mutual Information Criterion, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-74-2007

Minimum Mutual Information Beamforming for Simultaneous Active Speakers, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-73-2007

Effective post-processing for single-channel frequency-domain speech enhancement, Weifeng Li, Idiap-RR-71-2007

A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-70-2007

Classifying Materials in the Real World, Barbara Caputo, Eric Hayman, Mario Fritz and Jan-Olof Eklhund, Idiap-RR-69-2007

Fast Human Detection from Videos Using Covariance Features, Jian Yao and Jean-Marc Odobez, Idiap-RR-68-2007

Multi-Layer Background Subtraction Based on Color and Texture, Jian Yao and Jean-Marc Odobez, Idiap-RR-67-2007

LP-TRAPs in all senses, Petr Motlicek, Idiap-RR-66-2007

Exploiting Contextual Information for Improved Phoneme Recognition, Joel Praveen Pinto, B. Yegnanarayana, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-65-2007

Discriminative Cue Integration for Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-64-2007

On-line Independent Support Vector Machines for Cognitive Systems, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, Idiap-RR-63-2007

Daily Routine Classification from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-62-2007

The use of brain-computer interfacing for ambient intelligence, Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew and José del R. Millán, Idiap-RR-61-2007

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Daniel Gatica-Perez, Yan Huang and Gerald Friedland, Idiap-RR-60-2007

Object Category Detection using Audio-visual Cues, Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back and Joern Anemueller, Idiap-RR-58-2007

Human-Centered Computing: Toward a Human Revolution, Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe and Thomas S. Huang, Idiap-RR-57-2007

Stationary Features and Cat Detection, Francois Fleuret and Donald Geman, Idiap-RR-56-2007

Robust overlapping speech recognition based on neural networks, Weifeng Li, John Dines and Mathew Magimai-Doss, Idiap-RR-55-2007

MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, Weifeng Li, Mathew Magimai-Doss, John Dines and Hervé Bourlard, Idiap-RR-54-2007

Non-linear Spectral Contrast Stretching for In-car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-53-2007

A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, Bertrand Mesot and David Barber, Idiap-RR-52-2007

COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-51-2007

Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-50-2007

The COLD Database, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo and Patric Jensfelt, Idiap-RR-49-2007

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-48-2007

Unsupervised Learning for Information Distillation, Kamand Kamangar, Idiap-RR-47-2007

Recognition and Understanding of Meetings The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, Idiap-RR-46-2007

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-45-2007

Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, Joseph Keshet, Idiap-RR-44-2007

Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-43-2007

Detection and Recognition of Number Sequences in Spoken Utterances, Guillermo Aradilla and Jitendra Ajmera, Idiap-RR-42-2007

Posterior-Based Features and Distances in Template Matching for Speech Recognition, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-41-2007

Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-40-2007

A Novel Statistical Generative Model Dedicated To Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-39-2007

A Discriminative Kernel-based Model to Rank Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-38-2007

To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, Ricardo Chavarriaga, Pierre W. Ferrez and José del R. Millán, Idiap-RR-37-2007

Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, Alessandro Vinciarelli, Idiap-RR-33-2007

Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, Joel Praveen Pinto, Hervé Bourlard, Zacharie De Greve and Hynek Hermansky, Idiap-RR-32-2007

AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-31-2007

Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-30-2007

Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, Idiap-RR-29-2007

Significance of Contextual Information in Phoneme Recognition, Joel Praveen Pinto, S. R. Mahadeva Prasanna, B. Yegnanarayana and Hynek Hermansky, Idiap-RR-28-2007

Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, S. R. Mahadeva Prasanna, B. Yegnanarayana, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-27-2007

Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart and José del R. Millán, Idiap-RR-26-2007

Feature Extraction for Multi-class BCI using Canonical Variates Analysis, Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia and José del R. Millán, Idiap-RR-23-2007

Keyword Spotting on Word Lattices, De Greve Zacharie and Joel Praveen Pinto, Idiap-RR-22-2007

Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-21-2007

A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, Jean-Marc Odobez and Silèye O. Ba, Idiap-RR-20-2007

Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, Idiap-RR-19-2007

More Efficiency in Multiple Kernel Learning, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, Idiap-RR-18-2007

Confidence-based Cue Integration for Visual Place Recognition, Andrzej Pronobis and Barbara Caputo, Idiap-RR-17-2007

Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2007

Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, David Grangier and Samy Bengio, Idiap-RR-15-2007

Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, Sébastien Marcel, Idiap-RR-14-2007

Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, John Dines and Jithendra Vepa, Idiap-RR-13-2007

A study of phoneme and grapheme based context-dependent ASR systems, John Dines and Mathew Magimai-Doss, Idiap-RR-12-2007

Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, Joel Praveen Pinto, Andrew Lovitt and Hynek Hermansky, Idiap-RR-11-2007

On Confusions in a Phoneme Recognizer, Andrew Lovitt, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-10-2007

Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, Fabio Valente, Jithendra Vepa and Hynek Hermansky, Idiap-RR-09-2007

Hierarchical Neural Networks Feature Extraction for LVCSR system, Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky and Ralf Schlüter, Idiap-RR-08-2007

Learning the structure of image collections with latent aspect models, Florent Monay, Idiap-RR-06-2007

Truncation Confusion Patterns in Onset Consonants, Andrew Lovitt, Idiap-RR-05-2007

Face Authentication with Salient Local Features and Static Bayesian Network, Guillaume Heusch and Sébastien Marcel, Idiap-RR-04-2007

Biometric Person Authentication IS A Multiple Classifier Problem, Samy Bengio and Johnny Mariéthoz, Idiap-RR-03-2007

Dynamical Dirichlet Mixture Model, Le Chen, David Barber and Jean-Marc Odobez, Idiap-RR-02-2007

Face Detection and Verification using Local Binary Patterns, Yann Rodriguez, Idiap-RR-79-2006

Probabilistic Graphical Models for Human Interaction Analysis, Dong Zhang, Idiap-RR-78-2006

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, Guillaume Lathoud, Idiap-RR-77-2006

Machine Learning Approaches to Text Representation using Unlabeled Data, Mikaela Keller, Idiap-RR-76-2006

Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, F. Fernàndez and Sarah Favre, Idiap-RR-75-2006

Observations on Multi-Band Asynchrony in Distant Speech Recordings, Guillaume Lathoud, Idiap-RR-74-2006

Two-Handed Gestures for Human-Computer Interaction, Agnès Just, Idiap-RR-73-2006

Discrmininant Models for Text-independent Speaker Verification, Johnny Mariéthoz, Idiap-RR-70-2006

Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), Coralie Hemptinne, Idiap-RR-69-2006

Identifying unexpected words using in-context and out-of-context phoneme posteriors, Hamed Ketabdar and Hynek Hermansky, Idiap-RR-68-2006

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-67-2006

SVM-based Transfer of Visual Knowledge Across Robotic Platforms, Jie Luo, Andrzej Pronobis and Barbara Caputo, Idiap-RR-65-2006

Model Adaptation for Sentence Unit Segmentation from Speech, Sébastien Cuendet, Idiap-RR-64-2006

Analyzing Group Interactions in Conversations: a Review, Daniel Gatica-Perez, Idiap-RR-63-2006

A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, Octavian Cheng, John Dines and Mathew Magimai-Doss, Idiap-RR-62-2006

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, Fabio Valente and Hynek Hermansky, Idiap-RR-61-2006

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-60-2006

Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, Idiap-RR-58-2006

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, Idiap-RR-57-2006

Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-56-2006

A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, Bertrand Mesot and David Barber, Idiap-RR-55-2006

A supervised learning approach based on STDP and polychronization in spiking neuron networks, Hélène Paugam-Moisy, R. Martinez and Samy Bengio, Idiap-RR-54-2006

Melanoma Recognition using Kernel Classifiers, Elisabetta La Torre, Barbara Caputo and Tatiana Tommasi, Idiap-RR-53-2006

Incremental Learning for Place Recognition in Dynamic Environments, Jie Luo, Andrzej Pronobis, Barbara Caputo and Patric Jensfelt, Idiap-RR-52-2006

The more you learn, the less you store: memory\--controlled incremental SVM, Andrzej Pronobis and Barbara Caputo, Idiap-RR-51-2006

Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, David Barber and Silvia Chiappa, Idiap-RR-50-2006

Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Dong Zhang, Daniel Gatica-Perez and Wilfried Post, Idiap-RR-49-2006

Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, Silvia Chiappa, Idiap-RR-48-2006

[URL]

Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, Sébastien Marcel, Jean Keomany and Yann Rodriguez, Idiap-RR-47-2006

Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, Vijay Ullal and Petr Motlicek, Idiap-RR-46-2006

A Multitask Learning Approach to Document Representation using Unlabeled Data, Mikaela Keller and Samy Bengio, Idiap-RR-44-2006

Detecting Intentional Mental Transitions in an Asynchronous BCI, Ferran Galán, Francesc Oliva, Joan Guàrdia, Pierre W. Ferrez and José del R. Millán, Idiap-RR-43-2006

Recognizing People's Focus of Attention from Head Poses: a Study, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-42-2006

Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-41-2006

Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, Kevin C. Smith, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-40-2006

Detecting Abandoned Luggage Items in a Public Space, Kevin C. Smith, Pedro Quelhas and Daniel Gatica-Perez, Idiap-RR-39-2006

Multi-Person Tracking in Meetings: A Comparative Study, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-38-2006

2D Multi-Person Tracking: A Comparative Study in AMI Meetings, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-37-2006

Investigating Lexical Substitution Scoring for Subtitle Generation, Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans, Idiap-RR-36-2006

Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, Idiap-RR-35-2006

On the Recent Use of Local Binary Patterns for Face Authentication, Sébastien Marcel, Yann Rodriguez and Guillaume Heusch, Idiap-RR-34-2006

A Neural Network to Retrieve Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-33-2006

Learning to Retrieve Images from Text Queries with a Discriminative Model, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-32-2006

Indexation de Documents Manuscrits, Alessandro Vinciarelli, Idiap-RR-31-2006

Audio Coding Based on Long Temporal Contexts, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-30-2006

Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, Idiap-RR-29-2006

Multi-stream Processing for Noise Robust Speech Recognition, Hemant Misra, Idiap-RR-28-2006

Sociometry Based Multiparty Audio Recordings Summarization, Alessandro Vinciarelli, Idiap-RR-27-2006

Further Applications of Sector-Based Detection and Short-Term Clustering, Guillaume Lathoud, Idiap-RR-26-2006

Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, Norman Poh and Samy Bengio, Idiap-RR-25-2006

Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, Hari Krishna Maganti, Daniel Gatica-Perez and Iain A. McCowan, Idiap-RR-24-2006

Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-23-2006

The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, Idiap-RR-22-2006

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, Idiap-RR-21-2006

Discriminant linear processing of time-frequency plane, Fabio Valente and Hynek Hermansky, Idiap-RR-20-2006

Infinite Models for Speaker Clustering, Fabio Valente, Idiap-RR-19-2006

Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez and Fabien Cardinaux, Idiap-RR-18-2006

Natural Scene Image Modeling using Color and Texture Visterms., Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-17-2006

Online Classifier Adaptation in Brain-Computer Interfaces, Anna Buttfield and José del R. Millán, Idiap-RR-16-2006

A Discriminative Approach for the Retrieval of Images from Text Queries, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-15-2006

Discriminative Kernel-Based Phoneme Sequence Recognition, Joseph Keshet, Samy Bengio, Dan Chazan, Shai Shalev-Shwartz and Yoram Singer, Idiap-RR-14-2006

Online statistical estimation for vehicle control, Christos Dimitrakakis, Idiap-RR-13-2006

Nearly optimal exploration-exploitation decision thresholds, Christos Dimitrakakis, Idiap-RR-12-2006

Spiking Neuron Networks A survey, Hélène Paugam-Moisy, Idiap-RR-11-2006

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-10-2006

Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, Guillaume Lathoud, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2006

Switching Linear Dynamical Systems for Noise Robust Speech Recognition, Bertrand Mesot and David Barber, Idiap-RR-08-2006

Active Shape Models Using Local Binary Patterns, Jean Keomany and Sébastien Marcel, Idiap-RR-07-2006

Face Authentication Using Adapted Local Binary Pattern Histograms, Yann Rodriguez and Sébastien Marcel, Idiap-RR-06-2006

Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-05-2006

Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, Norman Poh, Samy Bengio and Arun Ross, Idiap-RR-04-2006

Hand Posture Classification and Recognition using the Modified Census Transform, Agnès Just, Yann Rodriguez and Sébastien Marcel, Idiap-RR-02-2006

Towards using slide information to enhance speech transcription of meetings, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-01-2006

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-91-2005

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, Mathew Magimai-Doss, Idiap-RR-90-2005

A Generative Model for Music Transcription, A. T. Cemgil, B. Kappen and David Barber, Idiap-RR-89-2005

Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, J-P. Pfister, T. Toyoizumi, David Barber and W. Gerstner, Idiap-RR-88-2005

Efficient Kalman Smoothing for Harmonic State-Space Models, David Barber, Idiap-RR-87-2005

Probabilistic Tagging of Unstructured Genealogical Records, Mike Perrow and David Barber, Idiap-RR-86-2005

Face Authentication Based on Local Features and Generative Models, Fabien Cardinaux, Idiap-RR-85-2005

Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, Silvia Chiappa and David Barber, Idiap-RR-84-2005

The ami meeting corpus: a pre-announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, Idiap-RR-82-2005

Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, Sébastien Marcel and José del R. Millán, Idiap-RR-81-2005

Tracking the Multi Person Wandering Visual Focus of Attention, Kevin C. Smith, Silèye O. Ba, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-80-2005

Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, Francesco Camastra, Marco Spinetti and Alessandro Vinciarelli, Idiap-RR-79-2005

Sociometry Based Multiparty Audio Recordings Segmentation, Alessandro Vinciarelli, Idiap-RR-78-2005

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, Idiap-RR-77-2005

Local Binary Patterns as an Image Preprocessing for Face Authentication, Guillaume Heusch, Yann Rodriguez and Sébastien Marcel, Idiap-RR-76-2005

Kernelized Infomax Clustering, Felix Agakov and David Barber, Idiap-RR-73-2005

Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, David Barber and Peter Sollich, Idiap-RR-72-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber, Idiap-RR-71-2005

Writer Identification for Smart Meeting Room Systems, Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz and Jonas Richiardi, Idiap-RR-70-2005

The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, Mike Lincoln, Iain A. McCowan, Jithendra Vepa and Hari Krishna Maganti, Idiap-RR-69-2005

Finding groups of people in Google news, Dhiraj Joshi and Daniel Gatica-Perez, Idiap-RR-68-2005

A Discriminative Decoder for the Recognition of Phoneme Sequences, David Grangier and Samy Bengio, Idiap-RR-67-2005

Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-66-2005

Using Pitch as Prior Knowledge in Template-Based Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-65-2005

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, Petr Fousek and Hynek Hermansky, Idiap-RR-64-2005

The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, Idiap-RR-63-2005

Multi-stream ASR: Oracle Test and Embedded Training, Hemant Misra, Jithendra Vepa and Hervé Bourlard, Idiap-RR-62-2005

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, Johnny Mariéthoz and Samy Bengio, Idiap-RR-61-2005

Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, Norman Poh, Alvin Martin and Samy Bengio, Idiap-RR-60-2005

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, Idiap-RR-59-2005

Chord Representations for Probabilistic Models, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-58-2005

A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-57-2005

Modeling semantic aspects for cross-media image indexing, Florent Monay and Daniel Gatica-Perez, Idiap-RR-56-2005

Measuring the Performance of Face Localization Systems, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-53-2005

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, Guillaume Lathoud, Mathew Magimai-Doss, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-52-2005

Modeling Interactions from Email Communication, Dong Zhang, Daniel Gatica-Perez, Deb Roy and Samy Bengio, Idiap-RR-51-2005

Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, Idiap-RR-50-2005

Multiview Face Detection, Tiffany Sauquet, Yann Rodriguez and Sébastien Marcel, Idiap-RR-49-2005

Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, Idiap-RR-48-2005

Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, Hari Krishna Maganti, Jithendra Vepa and Hervé Bourlard, Idiap-RR-47-2005

Efficient Diffusion-based Illumination Normalization for Face Verification, Guillaume Heusch, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-46-2005

Spectral Entropy Feature in Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-45-2005

Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-44-2005

Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-43-2005

Unsupervised Spectral Substraction for Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot and Hervé Bourlard, Idiap-RR-42-2005

Hierarchical approach for spotting keywords, Mikko Lehtonen, Idiap-RR-41-2005

A Thousand Words in a Scene, Pedro Quelhas, Jean-Marc Odobez, Daniel Gatica-Perez and Tinne Tuytelaars, Idiap-RR-40-2005

Benchmarking Non-Parametric Statistical Tests, Mikaela Keller, Samy Bengio and Siew Yeung Wong, Idiap-RR-38-2005

Harmonic Plus Noise Model for Concatenative Speech Synthesis, D. Vandromme, Idiap-RR-37-2005

Application of Information Retrieval Technologies to Presentation Slides, Alessandro Vinciarelli and Jean-Marc Odobez, Idiap-RR-36-2005

A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-35-2005

Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-34-2005

A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, Idiap-RR-33-2005

A Kernel Classifier for Distributions, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-32-2005

Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals and Dong Zhang, Idiap-RR-31-2005

Integrating co-occurrence and spatial contexts on patch-based scene segmentation, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-30-2005

Gradient estimates of return, Christos Dimitrakakis and Samy Bengio, Idiap-RR-29-2005

Joint Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba, Idiap-RR-28-2005

Audio-visual probabilistic tracking of multiple speakers in meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-27-2005

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, Yves Grandvalet, Johnny Mariéthoz and Samy Bengio, Idiap-RR-26-2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, Idiap-RR-25-2005

Two-Handed Gesture Recognition, Agnès Just and Sébastien Marcel, Idiap-RR-24-2005

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2005

Joint Training of Multi-Stream HMMs, Samy Bengio, Idiap-RR-22-2005

Inferring Document Similarity from Hyper-links, David Grangier and Samy Bengio, Idiap-RR-21-2005

Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, Norman Poh and Samy Bengio, Idiap-RR-20-2005

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-19-2005

Multi-resolution RASTA filtering for TANDEM-based ASR, Hynek Hermansky and Petr Fousek, Idiap-RR-18-2005

Local Features and 1D-HMMs for Fast and Robust Face Authentication, Fabien Cardinaux, Idiap-RR-17-2005

Improving Continuous Speech Recognition System Performance with Grapheme Modelling, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-16-2005

Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-15-2005

Constructing visual models with a latent space approach, Florent Monay, Pedro Quelhas, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-14-2005

A Frequency-Domain Silence Noise Model, Guillaume Lathoud, Mathew Magimai-Doss and Bertrand Mesot, Idiap-RR-13-2005

A Neural Network for Text Representation, Mikaela Keller and Samy Bengio, Idiap-RR-12-2005

OCR Based Slide Retrieval, Nabil Daddaoua, Jean-Marc Odobez and Alessandro Vinciarelli, Idiap-RR-11-2005

Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-10-2005

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-09-2005

Generative Temporal ICA for Classification in Asynchronous BCI Systems, Silvia Chiappa and David Barber, Idiap-RR-08-2005

Sports Event Recognition using Layered HMMs, Mark Barnard and Jean-Marc Odobez, Idiap-RR-07-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber and Bertrand Mesot, Idiap-RR-06-2005

Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-05-2005

Multi Channel Sequence Processing, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2005

Speech Acquisition in Meetings with an Audio-Visual Sensor Array, Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore and Silèye O. Ba, Idiap-RR-03-2005

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-02-2005

EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-01-2005

A Stable Switching Kalman Smoother, David Barber, Idiap-RR-89-2004

Variational Information Maximization in Gaussian Channels, Felix Agakov and David Barber, Idiap-RR-88-2004

The Auxiliary Variable Trick for deriving Kalman Smoothers, David Barber, Idiap-RR-87-2004

An Auxiliary Variational Method, Felix Agakov and David Barber, Idiap-RR-86-2004

Variational Information Maximization for Population Coding, David Barber, Idiap-RR-85-2004

Stochastic techniques in deriving perceptual knowledge, Hynek Hermansky, Idiap-RR-84-2004

Effect of Segmentation Method on Video Retrieval Performance, David Grangier and Alessandro Vinciarelli, Idiap-RR-83-2004

Effect of Recognition Errors on Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-82-2004

Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-80-2004

Modeling Scenes with Local Descriptors and Latent Aspects, Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars and Luc Van Gool, Idiap-RR-79-2004

Face Authentication using Client-specific Matching Pursuit, Sébastien Marcel, P. Jost, P. Vandergheynst and Jean-Philippe Thiran, Idiap-RR-78-2004

EEG Classification using Generative Independent Component Analysis, Silvia Chiappa and David Barber, Idiap-RR-77-2004

On Performance / Robustness / Complexity Trade-Offs in Face Verification, Conrad Sanderson, Fabien Cardinaux and Samy Bengio, Idiap-RR-74-2004

On the Use of Information Retrieval Measures for Speech Recognition Evaluation, Iain A. McCowan, Darren Moore, John Dines, Daniel Gatica-Perez, Mike Flynn, Pierre Wellner and Hervé Bourlard, Idiap-RR-73-2004

Estimates of Parameter Distributions for Optimal Action Selection, Christos Dimitrakakis and Samy Bengio, Idiap-RR-72-2004

Tracking People in Meetings with Particles, Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith and Guillaume Lathoud, Idiap-RR-71-2004

Nonlinear Feature Transformations for Noise Robust Speech Recognition, Shajith Ikbal, Idiap-RR-70-2004

A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-69-2004

A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, Norman Poh and Samy Bengio, Idiap-RR-68-2004

Sector-Based Detection for Hands-Free Speech Enhancement in Cars, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, Idiap-RR-67-2004

Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-66-2004

Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-63-2004

A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-62-2004

Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-61-2004

PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-60-2004

LP-TRAP: Linear predictive temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-59-2004

Towards using hierarchical posteriors for flexible automatic speech recognition systems, Hervé Bourlard, Samy Bengio, Mathew Magimai-Doss, Qifeng Zhu, Bertrand Mesot and Nelson Morgan, Idiap-RR-58-2004

Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, David Barber, Idiap-RR-57-2004

Invariances in Kernel Methods: From Samples to Objects, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-56-2004

Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, Michael McGreevy, Idiap-RR-55-2004

A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, Guillaume Lathoud and Mathew Magimai-Doss, Idiap-RR-54-2004

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-53-2004

Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, Norman Poh and Samy Bengio, Idiap-RR-52-2004

Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang and Samy Bengio, Idiap-RR-51-2004

HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, Shajith Ikbal, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-50-2004

Boosting word error rates, Christos Dimitrakakis and Samy Bengio, Idiap-RR-49-2004

Phoneme vs Grapheme Based Automatic Speech Recognition, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-48-2004

An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-46-2004

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-44-2004

Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, Norman Poh and Samy Bengio, Idiap-RR-43-2004

Large Scale Machine Learning, Ronan Collobert, Idiap-RR-42-2004

User-Customized Password Speaker Verification Using Multiple Reference and Background Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-41-2004

Phase AutoCorrelation (PAC) Features for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-40-2004

HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-39-2004

User Authentication via Adapted Statistical Models of Face Images, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-38-2004

Multi-resolution Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Sunil Sivadas and Hervé Bourlard, Idiap-RR-37-2004

On Local Features for Face Verification, Marc Saban and Conrad Sanderson, Idiap-RR-36-2004

Robust Audio Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-35-2004

{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, Mikaela Keller, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2004

Modeling Individual and Group Actions in Meetings With Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-33-2004

Browsing Recorded Meetings with Ferret, Pierre Wellner, Mike Flynn and Maël Guillemot, Idiap-RR-32-2004

Noisy Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-31-2004

PLSA-based Image Auto-Annotation: Constraining the Latent Space, Florent Monay and Daniel Gatica-Perez, Idiap-RR-30-2004

New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, Petr Fousek, Petr Svojanovsky, Frantisek Grezl and Hynek Hermansky, Idiap-RR-29-2004

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, Guillaume Lathoud, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-28-2004

On the Adequacy of Baseform Pronunciations and Pronunciation Variants, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-27-2004

Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-26-2004

Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin C. Smith, Idiap-RR-25-2004

Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-24-2004

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-23-2004

Using RASTA in task independent TANDEM feature extraction, Guillermo Aradilla, John Dines and Sunil Sivadas, Idiap-RR-22-2004

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, Idiap-RR-21-2004

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra and Hervé Bourlard, Idiap-RR-20-2004

Entropy Based Combination of Tandem Representations for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky and Hervé Bourlard, Idiap-RR-19-2004

How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-18-2004

Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, Norman Poh and Samy Bengio, Idiap-RR-17-2004

A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, Johnny Mariéthoz and Samy Bengio, Idiap-RR-16-2004

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-15-2004

Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-14-2004

Sequence Classification with Input-Output Hidden Markov Models, Silvia Chiappa and Samy Bengio, Idiap-RR-13-2004

Application of Information Retrieval Techniques to Single Writer Documents, Alessandro Vinciarelli, Idiap-RR-12-2004

Assessing Scene Structuring in Consumer Videos, Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui and Ming-Ting Sun, Idiap-RR-11-2004

On the Use of Speech and Face Information for Identity Verification, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-10-2004

Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-09-2004

Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, Idiap-RR-08-2004

Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-07-2004

Links between Perceptrons, MLPs and SVMs, Ronan Collobert and Samy Bengio, Idiap-RR-06-2004

Theme Topic Mixture Model: A Graphical Model for Document Representation, Mikaela Keller and Samy Bengio, Idiap-RR-05-2004

Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, Conrad Sanderson and Samy Bengio, Idiap-RR-04-2004

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-03-2004

Making Retrieval Faster Through Document Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-02-2004

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, Idiap-RR-01-2004

The Expected Performance Curve, Samy Bengio, Mikaela Keller and Johnny Mariéthoz, Idiap-RR-85-2003

The Expected Performance Curve: a New Assessment Measure for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-84-2003

A Statistical Significance Test for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-83-2003

Some Emerging Concepts in Speech Recognition., Hynek Hermansky and Hervé Bourlard, Idiap-RR-82-2003

Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, Hynek Hermansky and Nelson Morgan, Idiap-RR-81-2003

On Performance Evaluation of Face Detection and Localization Algorithms, Vlad Popovici, Yann Rodriguez, Jean-Philippe Thiran and Sébastien Marcel, Idiap-RR-80-2003

Reconnaissance de gestes 3D bi-manuels, Agnès Just, Sébastien Marcel, O. Bernier and J. E. Viallet, Idiap-RR-79-2003

A Probabilistic Framework for Joint Head Tracking and Pose Estimation, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-78-2003

Adapted Generative Models For Face Verification, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-76-2003

Tangent Vector Kernels for Invariant Image Classification with SVMs, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-75-2003

Textual Data Representation, Mikaela Keller and Samy Bengio, Idiap-RR-74-2003

Embedding Motion in Model-Based Stochastic Tracking, Jean-Marc Odobez, Daniel Gatica-Perez and Silèye O. Ba, Idiap-RR-72-2003

A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-71-2003

Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, Dong Zhang, S. Z. Li and Daniel Gatica-Perez, Idiap-RR-70-2003

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, Idiap-RR-69-2003

Improving Face Verification using Symmetric Transformation, Sébastien Marcel, Idiap-RR-68-2003

A Symmetric Transformation for LDA-based Face Verification, Sébastien Marcel, Idiap-RR-67-2003

Face Verification using LDA and MLP on the BANCA database, Sébastien Marcel, Idiap-RR-66-2003

Boosting Pixel-based Classifiers for Face Verification, Yann Rodriguez and Sébastien Marcel, Idiap-RR-65-2003

EEG-based BCI Systems and IDIAP EEG Database, Silvia Chiappa and José del R. Millán, Idiap-RR-64-2003

Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-63-2003

An Investigation of Spectral Subband Centroids for Speaker Authentication, Norman Poh, Conrad Sanderson and Samy Bengio, Idiap-RR-62-2003

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-61-2003

Face Verification Using Synthesized Non-Frontal Models, Conrad Sanderson and Samy Bengio, Idiap-RR-60-2003

Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-59-2003

Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, Pedro Quelhas and James Boyce, Idiap-RR-58-2003

On Use of Task Independent Training Data in Tandem Feature Extraction, Sunil Sivadas and Hynek Hermansky, Idiap-RR-57-2003

Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Hervé Bourlard and Hynek Hermansky, Idiap-RR-56-2003

Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-55-2003

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-54-2003

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-53-2003

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, Idiap-RR-52-2003

An Alternative To Silence Removal For Text-Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-51-2003

TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, Idiap-RR-50-2003

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, Silvia Chiappa and Samy Bengio, Idiap-RR-49-2003

Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen and Hervé Bourlard, Idiap-RR-48-2003

Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-47-2003

Audio-Video Person Clustering in Video Databases, F. Kottelat and Jean-Marc Odobez, Idiap-RR-46-2003

Towards Computer Understanding of Human Interactions, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio and Hervé Bourlard, Idiap-RR-45-2003

Text detection and recognition in images and video sequences, Datong Chen, Idiap-RR-44-2003

Video Text Segmentation Using Particle Filters, Datong Chen and Jean-Marc Odobez, Idiap-RR-43-2003

A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, Datong Chen and Jean-Marc Odobez, Idiap-RR-42-2003

Boosting HMMs with an application to speech recognition, Christos Dimitrakakis and Samy Bengio, Idiap-RR-41-2003

Noise Robust Discriminative Models, Quan Le and Samy Bengio, Idiap-RR-40-2003

An Online Audio Indexing System, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2003

A Robust Speaker Clustering Algorithm, Jitendra Ajmera and Charles Wooters, Idiap-RR-38-2003

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2003

Nonlinear Spectral Transformations for Robust Speech Recognition, Shajith Ikbal, Hynek Hermansky and Hervé Bourlard, Idiap-RR-36-2003

Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, Yoshua Bengio and Jean-Sébastien Senécal, Idiap-RR-35-2003

HMM Mixtures (HMM2) for Robust Speech Recognition, Katrin Weber, Idiap-RR-34-2003

On Multi-scale Fourier Transform Analysis of Speech Signals, Vivek Tyagi and Hervé Bourlard, Idiap-RR-33-2003

On Factorizing Spectral Dynamics for Robust Speech Recognition, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-32-2003

On Automatic Annotation of Images with Latent Space Models, Florent Monay and Daniel Gatica-Perez, Idiap-RR-31-2003

On the Need for On-Line Learning in Brain-Computer Interfaces, José del R. Millán, Idiap-RR-30-2003

From Samples to Objects in Kernel Methods, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-29-2003

Speech Recognition with Auxiliary Information, Todd Andrew Stephenson, Idiap-RR-28-2003

Automatic Analysis of Multimodal Group Actions in Meetings, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard and Dong Zhang, Idiap-RR-27-2003

Non-Linear Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-26-2003

A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-25-2003

Offline Cursive Handwriting: From Word To Text Recognition, Alessandro Vinciarelli, Idiap-RR-24-2003

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-23-2003

Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-22-2003

Segmenting Multiple Concurrent Speakers Using Microphone Arrays, Guillaume Lathoud, Iain A. McCowan and Darren Moore, Idiap-RR-21-2003

Face Processing & Frontal Face Verification, Conrad Sanderson, Idiap-RR-20-2003

On the Combination of Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-19-2003

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, Jaume Escofet and Todd Andrew Stephenson, Idiap-RR-18-2003

Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-17-2003

A New Margin-Based Criterion for Efficient Gradient Descent, Ronan Collobert and Samy Bengio, Idiap-RR-16-2003

An Implicit Motion Likelihood for Tracking with Particle Filters, Jean-Marc Odobez, Silèye O. Ba and Daniel Gatica-Perez, Idiap-RR-15-2003

Nonlinear Analysis of Cognitive and Motor-related EEG Signals, Silvia Chiappa and Samy Bengio, Idiap-RR-14-2003

Speech & Face Based Biometric Authentication at IDIAP, Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-13-2003

Multi-Modal Audio-Visual Event Recognition for Football Analysis, Mark Barnard, Jean-Marc Odobez and Samy Bengio, Idiap-RR-12-2003

Conditional Gaussian Mixtures, Todd Andrew Stephenson, Idiap-RR-11-2003

Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, Fabien Cardinaux, Conrad Sanderson and Sébastien Marcel, Idiap-RR-10-2003

Object Localization in Metric Spaces for Video Linking, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-09-2003

Evaluation of formant-like features for automatic speech recognition, F. de Wet, Katrin Weber, Louis Boves, B. Cranen, Samy Bengio and Hervé Bourlard, Idiap-RR-08-2003

Monte Carlo Video Text Segmentation, Datong Chen and Jean-Marc Odobez, Idiap-RR-07-2003

On automatic annotation of meeting databases, Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio and Hervé Bourlard, Idiap-RR-06-2003

Robust Features for Frontal Face Authentication in Difficult Image Conditions, Conrad Sanderson and Samy Bengio, Idiap-RR-05-2003

Scalability Analysis of Audio-Visual Person Identity Verification, J. Czyz, Samy Bengio, Christine Marcel and L. Vandendorpe, Idiap-RR-04-2003

Client Dependent GMM-SVM Models for Speaker Verification, Quan Le and Samy Bengio, Idiap-RR-03-2003

Multimodal Authentication using Asynchronous HMMs, Samy Bengio, Idiap-RR-02-2003

Offline Recognition of Large Vocabulary Cursive Handwritten Text, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-01-2003

Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-62-2002

Text Detection and Recognition in Images and Videos, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-61-2002

Self-Organizing-Maps With BIC For Speaker Clustering, I. Lapidot, Idiap-RR-60-2002

Modeling Human Interaction in Meetings, Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner and Hervé Bourlard, Idiap-RR-59-2002

Speech recognition with auxiliary information, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-58-2002

Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-57-2002

What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, I. Lapidot, Idiap-RR-56-2002

On Spectral Methods and the Structuring of Home Videos, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, Idiap-RR-55-2002

The analysis of kernel ridge regression learning algorithm., Alexei Pozdnoukhov, Idiap-RR-54-2002

Confusion matrix based posterior probabilities correction, Andrew Morris and Hemant Misra, Idiap-RR-53-2002

Mutliscale Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-52-2002

Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-51-2002

Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, Sébastien Marcel, Idiap-RR-50-2002

Robust Face Verification using Skin Color and Neural Networks, Sébastien Marcel, Idiap-RR-49-2002

Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, I. Lapidot and H. Guterman, Idiap-RR-48-2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-47-2002

Torch: a modular machine learning software library, Ronan Collobert, Samy Bengio and Johnny Mariéthoz, Idiap-RR-46-2002

Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-45-2002

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-44-2002

Location Based Speaker Segmentation, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-43-2002

Extended BIC Criterion for Model Selection, I. Lapidot and Andrew Morris, Idiap-RR-42-2002

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, Darren Moore and Iain A. McCowan, Idiap-RR-41-2002

Improving Face Authetication Using Virtual Samples, Norman Poh, Sébastien Marcel and Samy Bengio, Idiap-RR-40-2002

Robust Speaker Change Detection, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2002

Phase AutoCorrelation (PAC) derived Robust Speech Features, Shajith Ikbal, Hemant Misra and Hervé Bourlard, Idiap-RR-38-2002

Audio-Visual Speaker Tracking with Importance Particle Filters, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez and Darren Moore, Idiap-RR-37-2002

A State-of-the-art Neural Network for Robust Face Verification, Sébastien Marcel, Christine Marcel and Samy Bengio, Idiap-RR-36-2002

User-Customized Password HMM Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-35-2002

Gestures for Multi-Modal Interfaces: A Review, Sébastien Marcel, Idiap-RR-34-2002

Information Fusion and Person Verification Using Speech & Face Information, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-33-2002

Transforming the feature vectors to improve HMM based cursive word recognition systems, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-32-2002

Entropy-based Multi-stream Combination, Hemant Misra, Hervé Bourlard and Vivek Tyagi, Idiap-RR-31-2002

SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, V. Lemaire and F. Clérot, Idiap-RR-30-2002

Noise PDF transformation in secondary feature processing, Andrew Morris, Idiap-RR-29-2002

Online Policy Adaptation for Ensemble Algorithms, Christos Dimitrakakis and Samy Bengio, Idiap-RR-28-2002

Bagging Using the VMSE Cost Function, V. Lemaire, Idiap-RR-27-2002

An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, Samy Bengio, Idiap-RR-26-2002

Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-25-2002

Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Todd Andrew Stephenson, Jaume Escofet, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-24-2002

Improved Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard and I. Lapidot, Idiap-RR-23-2002

Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, Daniel Gatica-Perez, Alexander Loui and Ming-Ting Sun, Idiap-RR-22-2002

Face Verification using MLP and SVM, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-21-2002

Linking Objects in Videos by Importance Sampling, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-20-2002

Comparison of Support Vector Machine and Neural Network for Text Texture Verification, Datong Chen and Jean-Marc Odobez, Idiap-RR-19-2002

Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, Jean-Marc Odobez and Datong Chen, Idiap-RR-18-2002

Text Segmentation and Recognition in Complex Background Based on Markov Random Field, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-17-2002

A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, Datong Chen and Jean-Marc Odobez, Idiap-RR-16-2002

Speaker Normalization using HMM2, Shajith Ikbal, Katrin Weber and Hervé Bourlard, Idiap-RR-15-2002

A Multi-sample Multi-source Model for Biometric Authentication, Norman Poh, Samy Bengio and Jerzy Korczak, Idiap-RR-14-2002

The BANCA Database and Experimental Protocol for Speaker Verification, F. Porée, Johnny Mariéthoz, Samy Bengio and Frédéric Bimbot, Idiap-RR-13-2002

Conditional Gaussian Mixture Models for Environmental Risk Mapping, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-12-2002

Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, Daniel Gatica-Perez, Ming-Ting Sun and Alexander Loui, Idiap-RR-11-2002

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-10-2002

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, Iain A. McCowan, Andrew Morris and Hervé Bourlard, Idiap-RR-09-2002

Low cost duration modelling for noise robust speech recognition, Andrew Morris, Simon Payne and Hervé Bourlard, Idiap-RR-08-2002

Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard, I. Lapidot and Iain A. McCowan, Idiap-RR-07-2002

Hybrid generative-discriminative models for speech and speaker recognition, Quan Le and Samy Bengio, Idiap-RR-06-2002

Experimental Protocol on the BANCA Database, Samy Bengio, Frédéric Bimbot, Johnny Mariéthoz, Vlad Popovici, F. Porée, E. Bailly-Baillière, G. Matas and B. Ruiz, Idiap-RR-05-2002

Evaluation of Formant-Like Features for ASR, Katrin Weber, F. de Wet, B. Cranen, Louis Boves, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2002

Estimation of Conditional Distributions using Gaussian Mixture Models, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-03-2002

Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-02-2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2002

Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, B. Fasel, Idiap-RR-49-2001

Robust Face Analysis using Convolutional Neural Networks, B. Fasel, Idiap-RR-48-2001

Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-46-2001

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-45-2001

Improving Face Verification using Skin Color Information, Sébastien Marcel and Samy Bengio, Idiap-RR-44-2001

Robust Speech Recognition and Feature Extraction Using HMM2, Katrin Weber, Shajith Ikbal, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2001

Robust speech recognition based on multi-stream processing, Astrid Hagen, Idiap-RR-41-2001

Microphone Array Post-filter based on Noise Field Coherence, Iain A. McCowan and Hervé Bourlard, Idiap-RR-40-2001

Microphone Array Post-filter for Diffuse Noise Field, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2001

Confidence Measures for Multimodal Identity Verification, Samy Bengio, Christine Marcel, Sébastien Marcel and Johnny Mariéthoz, Idiap-RR-38-2001

Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2001

Increasing Speech Recognition Noise Robustness with HMM2, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-36-2001

PhD Thesis: Speech Analysis with Production Constraints, Sacha Krstulović, Idiap-RR-35-2001

A Comparative Study of Adaptation Methods for Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2001

Robust HMM-Based Speech/Music Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-33-2001

User Customized HMM/ANN Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-32-2001

EEG pattern recognition through multi-stream evidence combination, Andrew Morris, Bernhard Obermaier and Gert Pfurtscheller, Idiap-RR-31-2001

Data utility modelling for mismatch reduction, Andrew Morris, Idiap-RR-30-2001

Pronunciation models and their evaluation using confidence measures, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-29-2001

Video OCR for Sport Video Annotation and Retrieval, Datong Chen and Hervé Bourlard, Idiap-RR-28-2001

IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, Shajith Ikbal, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-27-2001

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-26-2001

Comparison of Client Model Adaptation Schemes, Samy Bengio and Johnny Mariéthoz, Idiap-RR-25-2001

Speech Recognition Using Advanced HMM2 Features, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-24-2001

A Pragmatic View of the Application of HMM2 for ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2001

Confidence Evaluation for Risk Prediction, Nicolas Gilardi, Tom Melluish and Michel Maignan, Idiap-RR-22-2001

Evaluation of Biometric Technology on XM2VTS, Samy Bengio, Johnny Mariéthoz and Sébastien Marcel, Idiap-RR-21-2001

Text Identification in Complex Background using SVM, Datong Chen, Hervé Bourlard and Jean-Philippe Thiran, Idiap-RR-20-2001

Text Enhancement with Asymmetric Filter for Video OCR, Datong Chen, Kim Shearer and Hervé Bourlard, Idiap-RR-19-2001

Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-18-2001

Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, S. Moeller and Hervé Bourlard, Idiap-RR-17-2001

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-15-2001

MAP Combination of Multi-Stream HMM or HMM/ANN Experts, Andrew Morris, Astrid Hagen and Hervé Bourlard, Idiap-RR-14-2001

Speaker Verification Based On User-Customized Password, Mohamed Faouzi BenZeghiba, Hervé Bourlard and Johnny Mariéthoz, Idiap-RR-13-2001

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, Idiap-RR-12-2001

Modeling Auxiliary Information in Bayesian Network Based ASR, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-11-2001

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-10-2001

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, Idiap-RR-09-2001

Using posterior probabilities for speech/music discrimination, Maja Popović, Idiap-RR-08-2001

Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, Mikhail Kanevski, Idiap-RR-07-2001

From missing data to maybe useful data: soft data modelling for noise robust ASR, Andrew Morris, Jon Barker and Hervé Bourlard, Idiap-RR-06-2001

Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, Astrid Hagen, Hervé Bourlard and Andrew Morris, Idiap-RR-05-2001

Support Vector Machines for Classification and Mapping of Reservoir Data, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu, Michel Maignan, Patrick Wong and S. Shibli, Idiap-RR-04-2001

Detection of Narrative Structure for Annotation of News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-03-2001

Artifacts of the colour coherence vector and an alternative similarity measure, Kim Shearer and Svetha Venkatesh, Idiap-RR-02-2001

New Approaches Towards Robust and Adaptive Speech Recognition, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2001

A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, Johnny Mariéthoz, Johan Lindberg and Frédéric Bimbot, Idiap-RR-48-2000

Cursive Character Recognition by Learning Vector Quantization, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-47-2000

The use of Boolean concepts in general classification contexts, Miguel Moreira, Idiap-RR-46-2000

Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, Sébastien Marcel, Idiap-RR-45-2000

Weighting schemes for audio-visual fusion in speech recognition, Hervé Glotin, D. Vergyri, C. Neti, G. Potamianos and Juergen Luettin, Idiap-RR-44-2000

A survey on Off-Line Cursive Word Recognition, Alessandro Vinciarelli, Idiap-RR-43-2000

HMM2- Extraction of Formant Features and their Use for Robust ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2000

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-41-2000

Learning the Decision Function for Speaker Verification, Samy Bengio and Johnny Mariéthoz, Idiap-RR-40-2000

A Survey of Text Detection and Recognition in Images and Videos, Datong Chen and Juergen Luettin, Idiap-RR-38-2000

ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, Datong Chen and Kim Shearer, Idiap-RR-37-2000

Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, Hervé Glotin, Idiap-RR-36-2000

Audio visual speech recognition, C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison and A. Mashari, Idiap-RR-35-2000

Local Machine Learning Models for Spatial Data Analysis, Nicolas Gilardi and Samy Bengio, Idiap-RR-34-2000

Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-33-2000

A new normalization technique for cursive handwritten words, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-32-2000

Advanced Spatial Data Analysis and Modelling with Support Vector Machines, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu and Michel Maignan, Idiap-RR-31-2000

HMM2- A Novel Approach to HMM Emission Probability Estimation, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-30-2000

Multiple Timescale Feature Combination towards Robust Speech Recognition, Katrin Weber, Idiap-RR-29-2000

Multiple Hypotheses Video OCR, Datong Chen and Juergen Luettin, Idiap-RR-28-2000

Test of several external posterior weighting functions for multiband Full Combination ASR, Hervé Glotin and Frédéric Berthommier, Idiap-RR-27-2000

Recent Developments in Speaker Verification at IDIAP, B. Nedic and Hervé Bourlard, Idiap-RR-26-2000

Mixtures of latent variable models for density estimation and classification, Perry Moerland, Idiap-RR-25-2000

On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-24-2000

A neural network for classification with incomplete data, Andrew Morris, Idiap-RR-23-2000

Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-22-2000

Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-21-2000

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-20-2000

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, Todd Andrew Stephenson, Hervé Bourlard, Samy Bengio and Andrew Morris, Idiap-RR-19-2000

Mixture Models for Unsupervised and Supervised Learning, Perry Moerland, Idiap-RR-18-2000

Support Vector Machines for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-17-2000

Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, Hervé Bourlard, Idiap-RR-16-2000

Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, Kim Shearer, Horst Bunke and Svetha Venkatesh, Idiap-RR-15-2000

Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-14-2000

Combining multiple tracking algorithms for improved general performance, Kim Shearer, Kirrily D Wong and Svetha Venkatesh, Idiap-RR-13-2000

Video sequence matching via decision tree path following, Kim Shearer, Svetha Venkatesh and Horst Bunke, Idiap-RR-12-2000

An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, Samy Bengio, Hervé Bourlard and Katrin Weber, Idiap-RR-11-2000

Environmental Data Mapping with Support Vector Regression and Geostatistics, Mikhail Kanevski, Patrick Wong and Stéphane Canu, Idiap-RR-10-2000

Spatial Data Mapping with Support Vector Regression, Mikhail Kanevski and Stéphane Canu, Idiap-RR-09-2000

Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, Johnny Mariéthoz and Frédéric Bimbot, Idiap-RR-08-2000

Handwritten Digits Recognition, Eric Grand, Idiap-RR-07-2000

Indexing spoken audio by LSA and SOMs, Mikko Kurimo, Idiap-RR-06-2000

Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, Mikko Kurimo, Idiap-RR-05-2000

An Introduction to Bayesian Network Theory and Usage, Todd Andrew Stephenson, Idiap-RR-03-2000

Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel and Frédéric Bimbot, Idiap-RR-02-2000

Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, Samy Bengio and Yoshua Bengio, Idiap-RR-01-2000

Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-27-1999

Multi-stream adaptive evidence combination for noise robust ASR, Andrew Morris, Astrid Hagen, Hervé Glotin and Hervé Bourlard, Idiap-RR-26-1999

Off-Line Cursive Script Recognition Based on Continuous Density HMM, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-25-1999

An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, Frédéric Bimbot, Mats Blomberg, Louis Boves, Gérard Chollet, Cédric Jaboulet, Bruno Jacob, Jamal Kharroubi, Johan Koolwaaij, Johan Lindberg, Johnny Mariéthoz, Chafic Mokbel and Houda Mokbel, Idiap-RR-24-1999

CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, Johnny Mariéthoz, Dominique Genoud, Frédéric Bimbot and Chafic Mokbel, Idiap-RR-23-1999

Recognition of Asymmetric Facial Action Unit Activities and Intensities, B. Fasel and Juergen Luettin, Idiap-RR-22-1999

INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, Giulia Bernardis, Hervé Bourlard, Martin Rajman and Jean-Cédric Chappelier, Idiap-RR-21-1999

Fast latent semantic indexing of spoken documents by using self-organizing maps, Mikko Kurimo, Idiap-RR-20-1999

Automatic Facial Expression Analysis: A Survey, B. Fasel and Juergen Luettin, Idiap-RR-19-1999

Towards introducing long-term statistics in MUSE for robust speech recognition, Christopher Kermorvant and Chafic Mokbel, Idiap-RR-18-1999

A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, Christopher Kermorvant and Andrew Morris, Idiap-RR-17-1999

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-16-1999

Numerical Experiments with Support Vector Machines, Mikhail Kanevski and Nicolas Gilardi, Idiap-RR-15-1999

Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, Katrin Keller, Souheil Ben-Yacoub and Chafic Mokbel, Idiap-RR-14-1999

Indexing Audio Documents by using Latent Semantic Analysis and SOM, Mikko Kurimo, Idiap-RR-13-1999

Latent Semantic Indexing by Self-Organizing Map, Mikko Kurimo and Chafic Mokbel, Idiap-RR-12-1999

A comparison of noise reduction techniques for robust speech recognition, Christopher Kermorvant, Idiap-RR-10-1999

DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, Perry Moerland and Eddy Mayoraz, Idiap-RR-09-1999

Combinatorial Approach for Data Binarization, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1999

Environmental spatial data classification with Support Vector Machines, Mikhail Kanevski, Nicolas Gilardi, Eddy Mayoraz and Michel Maignan, Idiap-RR-07-1999

Synchronous Alignment, Johnny Mariéthoz and Chafic Mokbel, Idiap-RR-06-1999

Data binarization by discriminant elimination, Miguel Moreira, Alain Hertz and Eddy Mayoraz, Idiap-RR-04-1999

Fusion of Face and Speech Data for Person Identity Verification, Souheil Ben-Yacoub, Yousri Abdeljaoued and Eddy Mayoraz, Idiap-RR-03-1999

Speaker verification experiments on the XM2VTS database, Juergen Luettin, Idiap-RR-02-1999

Segmentation of X-ray Image Sequences Showing the Vocal Tract, Georg Thimm, Idiap-RR-01-1999

Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), Georg Thimm, Idiap-RR-01-1999

Audio-Visual Person Verification, Souheil Ben-Yacoub, Juergen Luettin, K. Jonsson, J. Matas and J. Kittler, Idiap-RR-18-1998

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, Idiap-RR-17-1998

Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, Sacha Krstulović, Idiap-RR-16-1998

Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-15-1998

Localized mixtures of experts, Perry Moerland, Idiap-RR-14-1998

Introduction à la reconnaissance de la parole et du locuteur, Hervé Bourlard, Idiap-RR-13-1998

Speaker Verification: A Quick Overview, Hervé Bourlard and Nelson Morgan, Idiap-RR-12-1998

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, Giulia Bernardis and Hervé Bourlard, Idiap-RR-11-1998

Evaluating the Complexity of Databases for Person Identification and Verification, Georg Thimm, Souheil Ben-Yacoub and Juergen Luettin, Idiap-RR-10-1998

Illumination-robust Pattern Matching Using Distorted Color Histograms, Georg Thimm and Juergen Luettin, Idiap-RR-09-1998

Multi-Modal Data Fusion for Person Authentication using SVM, Souheil Ben-Yacoub, Idiap-RR-07-1998

Support Vector Machine for Multiclass Classification, Eddy Mayoraz and Ethem Alpaydin, Idiap-RR-06-1998

Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, Ethem Alpaydin and Eddy Mayoraz, Idiap-RR-05-1998

Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, Ethem Alpaydin, Idiap-RR-04-1998

On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, Eddy Mayoraz, Idiap-RR-03-1998

Continuous Audio-Visual Speech Recognition, Juergen Luettin and Stéphane Dupont, Idiap-RR-02-1998

Optimal Parameterization of Point Distribution Models, Georg Thimm and Juergen Luettin, Idiap-RR-01-1998

Investigation of a possible process identity between DRM and Linear Filtering, Sacha Krstulović, Idiap-RR-19-1997

Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, Jean-Luc Beuchat, Idiap-RR-18-1997

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, Idiap-RR-17-1997

An Optical Thresholding Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler, A. R. Pourzand and N. Collings, Idiap-RR-16-1997

Handwritten Digit Recognition with Binary Optical Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler and A. R. Pourzand, Idiap-RR-15-1997

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, Stéphane Dupont and Juergen Luettin, Idiap-RR-14-1997

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, Idiap-RR-13-1997

Speechreading using Probabilistic Models, Juergen Luettin and Neil A. Thacker, Idiap-RR-12-1997

Fast Object Detection using MLP and FFT, Souheil Ben-Yacoub, Idiap-RR-11-1997

On the Complexity of Recognizing Iterated Differences of Polyhedra, Eddy Mayoraz, Idiap-RR-10-1997

Improved Pairwise Coupling Classification With Correcting Classifiers, Miguel Moreira and Eddy Mayoraz, Idiap-RR-09-1997

Text dependent speaker verification using binary classifiers, Dominique Genoud, Miguel Moreira and Eddy Mayoraz, Idiap-RR-08-1997

Mixtures of Experts Estimate A Posteriori Probabilities, Perry Moerland, Idiap-RR-07-1997

Decision fusion in a multi-modal identity verification system using a multi-linear classifier, Patrick Verlinde, Gilbert Maître and Eddy Mayoraz, Idiap-RR-06-1997

Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, Frédéric Bimbot and Dominique Genoud, Idiap-RR-05-1997

Optimal Setting of Weights, Learning Rate, and Gain, Georg Thimm and Emile Fiesler, Idiap-RR-04-1997

Pruning of Neural Networks, Georg Thimm and Emile Fiesler, Idiap-RR-03-1997

Discrete All-Positive Multilayer Perceptrons for Optical Implementation, Perry Moerland, Emile Fiesler and Indu Saxena, Idiap-RR-02-1997

Robust Speech Recognition based on Multi-Stream Features, Stéphane Dupont, Hervé Bourlard and Christophe Ris, Idiap-RR-01-1997

Image Classification by Neural Networks for the Quality Control of Watches, Miguel Moreira, Emile Fiesler and Gianni Pante, Idiap-RR-10-1996

Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, Vincent Fontaine and Hervé Bourlard, Idiap-RR-09-1996

On the Decomposition of Polychotomies into Dichotomies, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1996

Multi-Stream Speech Recognition, Hervé Bourlard, Stéphane Dupont and Christophe Ris, Idiap-RR-07-1996

On Variations of the Convex Hull Operator, Eddy Mayoraz, Idiap-RR-06-1996

An Implementation of Logical Analysis of Data, Endre Boros, Peter L. Hammer, Toshihide Ibaraki, Alexander Kogan, Eddy Mayoraz and Ilya Muchnik, Idiap-RR-05-1996

Secured vocal access to telephone servers, Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu and Dominique Genoud, Idiap-RR-04-1996

On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, Eddy Mayoraz, Idiap-RR-03-1996

Combining methods to improve speaker verification decision, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot and Gérard Chollet, Idiap-RR-02-1996

Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu, Cédric Jaboulet and Philippe Langlais, Idiap-RR-01-1996

Neural Networks with Adaptive Learning Rate and Momentum Terms, Miguel Moreira and Emile Fiesler, Idiap-RR-04-1995

Experiments with robust similarity measures for OCR, Gilbert Maître, Idiap-RR-03-1995

Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, Murielle Vial, Idiap-RR-02-1995

Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, Stéphane Brunet, Idiap-RR-01-1995

High Order and Multilayer Perceptron Initialization, Georg Thimm and Emile Fiesler, Idiap-RR-07-1994

Adaptive Multilayer Optical Neural Network Design, Indu Saxena and Emile Fiesler, Idiap-RR-04-1994

A System for the Off-Line Recognition of Handwritten Text, Thomas M. Breuel, Idiap-RR-02-1994

Finding Lines under Bounded Error, Thomas M. Breuel, Idiap-RR-11-1993

An RBF Network that Learns Some Aspects of Perceptual Organization, Thomas M. Breuel, Idiap-RR-10-1993

View-Based Recognition, Thomas M. Breuel, Idiap-RR-09-1993

The 3D Indexing Problem, Thomas M. Breuel, Idiap-RR-08-1993

Geometric Matching in Computer Vision--Algorithms and Open Problems, Thomas M. Breuel, Idiap-RR-07-1993

Recognition of Handprinted Digits, Thomas M. Breuel, Idiap-RR-06-1993

Un interface de recherche documentaire: I de r, version 2.0, Jean-Luc Cochard, Idiap-RR-04-1993

Un interface d'indexation documentaire: I d'i, version 2.0, Jean-Luc Cochard, Idiap-RR-03-1993

Higher-Order Statistics in Visual Object Recognition, Thomas M. Breuel, Idiap-RR-02-1993

Un interface d'indexation documentaire: I d'i, version 1.4, Jean-Luc Cochard, Idiap-RR-01-1993

Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, Jean-Luc Cochard, Idiap-RR-04-1992

Un environnement d'analyse linguistique robuste: CPD, version 1.7, Jean-Luc Cochard, Idiap-RR-03-1992

Neural Network Formalization, Emile Fiesler, Idiap-RR-01-1992