Research reports list - Idiap Publications

"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, Rémi Lebret and Ronan Collobert, Idiap-RR-21-2015

2D Face Recognition: An Experimental and Reproducible Research Survey, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-13-2017

2D Multi-Person Tracking: A Comparative Study in AMI Meetings, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-37-2006

A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, Bertrand Mesot and David Barber, Idiap-RR-55-2006

A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, Srikanth Madikeri, Subhadeep Dey and Petr Motlicek, Idiap-RR-07-2020

A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, Bertrand Mesot and David Barber, Idiap-RR-52-2007

A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, Jean-Marc Odobez and Silèye O. Ba, Idiap-RR-20-2007

A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-71-2003

A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart and José del R. Millán, Idiap-RR-78-2007

A Comparative Study of Adaptation Methods for Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2001

A comparison of noise reduction techniques for robust speech recognition, Christopher Kermorvant, Idiap-RR-10-1999

A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, Hui Liang, John Dines and Lakshmi Saheer, Idiap-RR-05-2010

A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, Christopher Kermorvant and Andrew Morris, Idiap-RR-17-1999

A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, Anjith George, David Geissbuhler and Sébastien Marcel, Idiap-RR-02-2022

A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, Guillaume Heusch, Tiago de Freitas Pereira and Sébastien Marcel, Idiap-RR-09-2019

A Data-driven Approach to Speech/Non-speech Detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-23-2008

A Discriminative Approach for the Retrieval of Images from Text Queries, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-15-2006

A Discriminative Decoder for the Recognition of Phoneme Sequences, David Grangier and Samy Bengio, Idiap-RR-67-2005

A Discriminative Kernel-based Model to Rank Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-38-2007

A Distance Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-33-2008

A Frequency-Domain Silence Noise Model, Guillaume Lathoud, Mathew Magimai-Doss and Bertrand Mesot, Idiap-RR-13-2005

A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, Octavian Cheng, John Dines and Mathew Magimai-Doss, Idiap-RR-62-2006

A Generative Model for Music Transcription, A. T. Cemgil, B. Kappen and David Barber, Idiap-RR-89-2005

A Generative Model for Rhythms, Jean-François Paiement, Yves Grandvalet, Samy Bengio and Douglas Eck, Idiap-RR-70-2007

A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, Jean-François Paiement, Douglas Eck, Samy Bengio and David Barber, Idiap-RR-33-2005

A Kernel Classifier for Distributions, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-32-2005

A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, Johnny Mariéthoz and Samy Bengio, Idiap-RR-77-2005

A Large-Scale Database of Images and Captions for Automatic Face Naming, Mert Ozcan, Jie Luo, Vittorio Ferrari and Barbara Caputo, Idiap-RR-26-2011

A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, Datong Chen and Jean-Marc Odobez, Idiap-RR-42-2003

A MAP Approach to Noise Compensation of Speech, Philip N. Garner, Idiap-RR-08-2009

A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, Johnny Mariéthoz, Johan Lindberg and Frédéric Bimbot, Idiap-RR-48-2000

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-53-2004

A Meeting Browser Evaluation Test, Pierre Wellner, Mike Flynn, Simon Tucker and Steve Whittaker, Idiap-RR-02-2005

A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-25-2003

A Multi-sample Multi-source Model for Biometric Authentication, Norman Poh, Samy Bengio and Jerzy Korczak, Idiap-RR-14-2002

A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, Youssef Oualil, Friedrich Faubel and Dietrich Klakow, Idiap-RR-09-2012

A Multitask Learning Approach to Document Representation using Unlabeled Data, Mikaela Keller and Samy Bengio, Idiap-RR-44-2006

A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, Bastian Schnell and Philip N. Garner, Idiap-RR-10-2018

A Neural Network based Regression Approach for Recognizing Simultaneous Speech, Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-10-2008

A neural network for classification with incomplete data, Andrew Morris, Idiap-RR-23-2000

A Neural Network for Text Representation, Mikaela Keller and Samy Bengio, Idiap-RR-12-2005

A Neural Network to Retrieve Images from Text Queries, David Grangier and Samy Bengio, Idiap-RR-33-2006

A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, Saeid Haghighatshoar, Mohammad J. Taghizadeh and Afsaneh Asaei, Idiap-RR-35-2015

A New Margin-Based Criterion for Efficient Gradient Descent, Ronan Collobert and Samy Bengio, Idiap-RR-16-2003

A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, Datong Chen and Jean-Marc Odobez, Idiap-RR-16-2002

A new normalization technique for cursive handwritten words, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-32-2000

A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, Johnny Mariéthoz and Samy Bengio, Idiap-RR-16-2004

A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, Norman Poh and Samy Bengio, Idiap-RR-68-2004

A Novel Statistical Generative Model Dedicated To Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-39-2007

A Parallel Mixture of SVMs for Very Large Scale Problems, Ronan Collobert, Samy Bengio and Yoshua Bengio, Idiap-RR-12-2001

A Pragmatic View of the Application of HMM2 for ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2001

A Probabilistic Framework for Joint Head Tracking and Pose Estimation, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-78-2003

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, Idiap-RR-37-2012

A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, Yves Grandvalet, Johnny Mariéthoz and Samy Bengio, Idiap-RR-26-2005

A Probabilistic Model for Chord Progressions, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-57-2005

A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-35-2005

A Robust Speaker Clustering Algorithm, Jitendra Ajmera and Charles Wooters, Idiap-RR-38-2003

A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, Laurent El Shafey, Chris McCool, Roy Wallace and Sébastien Marcel, Idiap-RR-07-2013

[URL]

A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-15-2004

A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, Guillaume Lathoud and Mathew Magimai-Doss, Idiap-RR-54-2004

A simple continuous excitation model for parametric vocoding, Philip N. Garner, Milos Cernak and Blaise Potard, Idiap-RR-03-2015

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-36-2010

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, Idiap-RR-31-2011

A Stable Switching Kalman Smoother, David Barber, Idiap-RR-89-2004

A State-of-the-art Neural Network for Robust Face Verification, Sébastien Marcel, Christine Marcel and Samy Bengio, Idiap-RR-36-2002

A Statistical Significance Test for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-83-2003

A study of phoneme and grapheme based context-dependent ASR systems, John Dines and Mathew Magimai-Doss, Idiap-RR-12-2007

A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-69-2004

A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-10-2006

A Sub-Quadratic Exact Medoid Algorithm, James Newling and Francois Fleuret, Idiap-RR-19-2017

A supervised learning approach based on STDP and polychronization in spiking neuron networks, Hélène Paugam-Moisy, R. Martinez and Samy Bengio, Idiap-RR-54-2006

A Survey of Text Detection and Recognition in Images and Videos, Datong Chen and Juergen Luettin, Idiap-RR-38-2000

A Survey on Language Modeling using Neural Networks, Nikolaos Pappas and Thomas Meyer, Idiap-RR-32-2012

A survey on Off-Line Cursive Word Recognition, Alessandro Vinciarelli, Idiap-RR-43-2000

A Symmetric Transformation for LDA-based Face Verification, Sébastien Marcel, Idiap-RR-67-2003

A System for the Off-Line Recognition of Handwritten Text, Thomas M. Breuel, Idiap-RR-02-1994

A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss and Dietrich Klakow, Idiap-RR-10-2012

A Thousand Words in a Scene, Pedro Quelhas, Jean-Marc Odobez, Daniel Gatica-Perez and Tinne Tuytelaars, Idiap-RR-40-2005

A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-62-2004

ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, Petr Motlicek, Philip N. Garner, Namhoon Kim and Jeongmi Cho, Idiap-RR-38-2013

Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2014

Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-10-2015

Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-38-2011

Acoustic Models for Posterior Features in Speech Recognition, Guillermo Aradilla, Idiap-RR-67-2008

Acoustic-Labial Speaker Verification, Pierre Jourlin, Juergen Luettin, Dominique Genoud and H. Wassner, Idiap-RR-13-1997

Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, Sacha Krstulović, Idiap-RR-16-1998

Active Shape Models Using Local Binary Patterns, Jean Keomany and Sébastien Marcel, Idiap-RR-07-2006

Adaptation Experiments on French MediaParl ASR, Gyorgy Szaszak, Idiap-RR-10-2013

Adaptation of Speech and Bioacoustics Models, Eklavya Sarkar, Amir Mohammadi and Mathew Magimai-Doss, Idiap-RR-05-2025

Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, Johnny Mariéthoz and Frédéric Bimbot, Idiap-RR-08-2000

Adapted Generative Models For Face Verification, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-76-2003

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-06-2008

Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li and John Dines, Idiap-RR-29-2008

Adaptive Beamforming with a Minimum Mutual Information Criterion, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-74-2007

Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, Yoshua Bengio and Jean-Sébastien Senécal, Idiap-RR-35-2003

Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, Astrid Hagen, Hervé Bourlard and Andrew Morris, Idiap-RR-05-2001

Adaptive Multilayer Optical Neural Network Design, Indu Saxena and Emile Fiesler, Idiap-RR-04-1994

Adjustable Deterministic Pseudonymization of Speech, S. Pavankumar Dubagunta, Rob Van Son and Mathew Magimai-Doss, Idiap-RR-12-2021

Advanced Spatial Data Analysis and Modelling with Support Vector Machines, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu and Michel Maignan, Idiap-RR-31-2000

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-23-2010

AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-31-2007

Alleviating Forgetfulness of Linear Attention by Hybrid Sparse Attention and Contextualized Learnable Token Eviction, Mutian He and Philip N. Garner, Idiap-RR-01-2026

[URL]

AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad and Sriram Ganapathy, Idiap-RR-01-2020

AMIDA/Klewel Mini-Project, Petr Motlicek, Philip N. Garner, Maël Guillemot and Vincent Bozzo, Idiap-RR-03-2010

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-60-2006

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, Idiap-RR-02-2010

An Alternative To Silence Removal For Text-Independent Speaker Verification, Johnny Mariéthoz and Samy Bengio, Idiap-RR-51-2003

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, Idiap-RR-16-2010

An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, Sucheta Ghosh, Milos Cernak, Sarbani Palit and B. B. Chaudhuri, Idiap-RR-02-2016

An anomaly detection approach for backdoored neural networks: face recognition as a case study, Alexander Unnervik and Sébastien Marcel, Idiap-RR-08-2022

[URL]

An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, Samy Bengio, Idiap-RR-26-2002

An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, Marco Ewerton, Sylvain Calinon and Jean-Marc Odobez, Idiap-RR-03-2021

An Auxiliary Variational Method, Felix Agakov and David Barber, Idiap-RR-86-2004

An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, Samy Bengio, Hervé Bourlard and Katrin Weber, Idiap-RR-11-2000

An Empirical Model of Emphatic Word Detection, Milos Cernak and Pierre-Edouard Honnet, Idiap-RR-11-2015

AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, Idiap-RR-05-2019

An Implementation of Logical Analysis of Data, Endre Boros, Peter L. Hammer, Toshihide Ibaraki, Alexander Kogan, Eddy Mayoraz and Ilya Muchnik, Idiap-RR-05-1996

An Implicit Motion Likelihood for Tracking with Particle Filters, Jean-Marc Odobez, Silèye O. Ba and Daniel Gatica-Perez, Idiap-RR-15-2003

An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-58-2008

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-22-2010

AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, Idiap-RR-16-2011

An Introduction to Bayesian Network Theory and Usage, Todd Andrew Stephenson, Idiap-RR-03-2000

An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-46-2004

An Investigation of Spectral Subband Centroids for Speaker Authentication, Norman Poh, Conrad Sanderson and Samy Bengio, Idiap-RR-62-2003

An Online Audio Indexing System, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2003

An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, Manuel Günther, Roy Wallace and Sébastien Marcel, Idiap-RR-29-2012

An Open-source State-of-the-art Toolbox for Broadcast News Diarization, Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin and Sylvain Meignier, Idiap-RR-33-2013

An Optical Thresholding Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler, A. R. Pourzand and N. Collings, Idiap-RR-16-1997

An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, Frédéric Bimbot, Mats Blomberg, Louis Boves, Gérard Chollet, Cédric Jaboulet, Bruno Jacob, Jamal Kharroubi, Johan Koolwaaij, Johan Lindberg, Johnny Mariéthoz, Chafic Mokbel and Houda Mokbel, Idiap-RR-24-1999

An RBF Network that Learns Some Aspects of Perceptual Organization, Thomas M. Breuel, Idiap-RR-10-1993

Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, Remi Emonet and Jean-Marc Odobez, Idiap-RR-20-2013

[URL]

Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, Silvia Chiappa, Idiap-RR-48-2006

[URL]

Analysis of CNN-based Speech Recognition System using Raw Speech as Input, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-23-2015

Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, S. R. Mahadeva Prasanna, B. Yegnanarayana, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-27-2007

Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-30-2009

Analysis of Posterior Estimation Approaches to I-vector Extraction for Speaker Recognition, Srikanth Madikeri, Petr Motlicek, Marc Ferras and Subhadeep Dey, Idiap-RR-15-2018

Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, S. Moeller and Hervé Bourlard, Idiap-RR-17-2001

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-03-2008

Analyzing Group Interactions in Conversations: a Review, Daniel Gatica-Perez, Idiap-RR-63-2006

Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot and Ricardo Chavarriaga, Idiap-RR-48-2008

Anti-spoofing in action: joint operation with a verification system, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2013

Application of Information Retrieval Techniques to Single Writer Documents, Alessandro Vinciarelli, Idiap-RR-12-2004

Application of Information Retrieval Technologies to Presentation Slides, Alessandro Vinciarelli and Jean-Marc Odobez, Idiap-RR-36-2005

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, Idiap-RR-04-2010

Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, Petr Motlicek, Philip N. Garner, David Imseng and Fabio Valente, Idiap-RR-20-2012

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, Idiap-RR-35-2009

Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, Esaú Villatoro-Tello, Shantipriya Parida, Sajit Kumar and Petr Motlicek, Idiap-RR-01-2022

Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, Stéphane Brunet, Idiap-RR-01-1995

Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, Sébastien Marcel, Idiap-RR-45-2000

Approximating Optimal Morphing Attacks using Template Inversion, Laurent Colbois, Hatef Otroshi Shahreza and Sébastien Marcel, Idiap-RR-07-2023

Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Héctor Jiménez-Salazar, Daniel Gatica-Perez and Mathew Magimai-Doss, Idiap-RR-19-2021

Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, David Barber, Idiap-RR-57-2004

Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-19-2014

Artifacts of the colour coherence vector and an alternative similarity measure, Kim Shearer and Svetha Venkatesh, Idiap-RR-02-2001

Assessing Scene Structuring in Consumer Videos, Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui and Ming-Ting Sun, Idiap-RR-11-2004

Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-56-2006

Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, Idiap-RR-04-2024

Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-66-2008

ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, Datong Chen and Kim Shearer, Idiap-RR-37-2000

Asynchronous detection and classification of oscillatory brain activity, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-36-2008

Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, Anjith George and Sébastien Marcel, Idiap-RR-08-2023

Audio Coding Based on Long Temporal Contexts, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-30-2006

Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, Vijay Ullal and Petr Motlicek, Idiap-RR-46-2006

Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, Danil Korchagin, Idiap-RR-08-2011

Audio visual speech recognition, C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison and A. Mashari, Idiap-RR-35-2000

Audio-Video Person Clustering in Video Databases, F. Kottelat and Jean-Marc Odobez, Idiap-RR-46-2003

Audio-Visual Person Verification, Souheil Ben-Yacoub, Juergen Luettin, K. Jonsson, J. Matas and J. Kittler, Idiap-RR-18-1998

Audio-visual probabilistic tracking of multiple speakers in meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-27-2005

Audio-Visual Speaker Tracking with Importance Particle Filters, Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez and Darren Moore, Idiap-RR-37-2002

Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, Hervé Bourlard, Idiap-RR-16-2000

Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, Ramya Rasipuram, Milos Cernak, Alexandre Nanchen and Mathew Magimai-Doss, Idiap-RR-12-2015

Automatic Analysis of Multimodal Group Actions in Meetings, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard and Dong Zhang, Idiap-RR-27-2003

AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, Idiap-RR-32-2020

Automatic Facial Expression Analysis: A Survey, B. Fasel and Juergen Luettin, Idiap-RR-19-1999

Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, Petr Motlicek, Idiap-RR-06-2009

Automatic Social Role Recognition In Professional Meetings, A. Sapru and Hervé Bourlard, Idiap-RR-35-2012

Automatic Speech Indexing System of Bilingual Video Parliament Interventions, Gyorgy Szaszak, Milos Cernak, Philip N. Garner, Petr Motlicek, Alexandre Nanchen and Flavio Tarsetti, Idiap-RR-25-2013

Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, Todd Andrew Stephenson, Hervé Bourlard, Samy Bengio and Andrew Morris, Idiap-RR-19-2000

Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, Jaume Escofet and Todd Andrew Stephenson, Idiap-RR-18-2003

Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-41-2000

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, Idiap-RR-17-1998

Automatic Temporal Alignment of AV Data, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-39-2009

Automatic Temporal Alignment of AV Data with Confidence Estimation, Danil Korchagin, Philip N. Garner and John Dines, Idiap-RR-40-2009

Automatic Time Skew Detection and Correction, Danil Korchagin, Idiap-RR-42-2010

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, Idiap-RR-13-2009

Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-40-2008

Autoregressive Models of Amplitude Modulations in Audio Compression, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-33-2009

Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-25-2002

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, Guillaume Lathoud, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-28-2004

Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, Florian Mai and James Henderson, Idiap-RR-21-2021

Bagging Using the VMSE Cost Function, V. Lemaire, Idiap-RR-27-2002

Baseline System for Automatic Speech Recognition with French GlobalPhone Database, Sandrine Revaz and Milos Cernak, Idiap-RR-26-2012

Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart and José del R. Millán, Idiap-RR-26-2007

Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, Silvia Chiappa and David Barber, Idiap-RR-84-2005

Bayesian Networks to Combine Intensity and Color Information in Face Recognition, Guillaume Heusch and Sébastien Marcel, Idiap-RR-27-2009

BEAT: An Open-Source Web-Based Open-Science Platform, André Anjos, Laurent El Shafey and Sébastien Marcel, Idiap-RR-14-2017

Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel and Frédéric Bimbot, Idiap-RR-02-2000

Benchmarking Non-Parametric Statistical Tests, Mikaela Keller, Samy Bengio and Siew Yeung Wong, Idiap-RR-38-2005

BertOdia: BERT pre-training for low resource Odia language, Shantipriya Parida, Satya Prakash Biswal, Biranchi Narayan Nayak, Mael Fabien, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-16-2021

BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Oliver Ohneiser and Hartmut Helmke, Idiap-RR-15-2021

Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool and Sébastien Marcel, Idiap-RR-18-2012

Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther and Sébastien Marcel, Idiap-RR-30-2013

Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez and Fabien Cardinaux, Idiap-RR-18-2006

Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern and Timothy Cootes, Idiap-RR-13-2012

Bias Adaptation for Vocal Tract Length Normalization, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-12-2013

Biometric Person Authentication IS A Multiple Classifier Problem, Samy Bengio and Johnny Mariéthoz, Idiap-RR-03-2007

Biometrics Evaluation under Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-12-2014

Bob: a free signal processing and machine learning toolbox for researchers, André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool and Sébastien Marcel, Idiap-RR-25-2012

Boosting HMMs with an application to speech recognition, Christos Dimitrakakis and Samy Bengio, Idiap-RR-41-2003

Boosting Pixel-based Classifiers for Face Verification, Yann Rodriguez and Sébastien Marcel, Idiap-RR-65-2003

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

Boosting word error rates, Christos Dimitrakakis and Samy Bengio, Idiap-RR-49-2004

Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, Anjith George and Sébastien Marcel, Idiap-RR-09-2023

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

Broadcast Media Content Categorization Using Low-Resolution Concepts, Esaú Villatoro-Tello, Shantipriya Parida, Petr Motlicek, Subhadeep Dey and Qingran Zhan, Idiap-RR-06-2021

Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-30-2007

Browsing Recorded Meetings with Ferret, Pierre Wellner, Mike Flynn and Maël Guillemot, Idiap-RR-32-2004

Calibration from statistical properties of the visual world, Etienne Grossmann, José António Gaspar and Francesco Orabona, Idiap-RR-63-2008

Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, Johnny Mariéthoz and Samy Bengio, Idiap-RR-61-2005

Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, Norman Poh and Samy Bengio, Idiap-RR-20-2005

Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, Anjith George and Sébastien Marcel, Idiap-RR-12-2020

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, Idiap-RR-06-2019

[URL]

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., Philip N. Garner, Idiap-RR-15-2011

CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, Shantipriya Parida, Esaú Villatoro-Tello and Petr Motlicek, Idiap-RR-02-2021

Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean and Dian Tjondronegoro, Idiap-RR-30-2015

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez-Correa and Elmar Nöth, Idiap-RR-16-2017

Characterizing the EEG Correlates of Exploratory Behavior, Nicolas Bourdaud, Ricardo Chavarriaga, Ferran Galán and José del R. Millán, Idiap-RR-28-2008

Chord Representations for Probabilistic Models, Jean-François Paiement, Douglas Eck and Samy Bengio, Idiap-RR-58-2005

Classifying Materials in the Real World, Barbara Caputo, Eric Hayman, Mario Fritz and Jan-Olof Eklhund, Idiap-RR-69-2007

CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-77-2008

CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, Johnny Mariéthoz, Dominique Genoud, Frédéric Bimbot and Chafic Mokbel, Idiap-RR-23-1999

Client Dependent GMM-SVM Models for Speaker Verification, Quan Le and Samy Bengio, Idiap-RR-03-2003

Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-55-2003

ClusterRank: A Graph Based Method for Meeting Summarization, Nikhil Garg, Benoit Favre, Korbinian Reidhammer and Dilek Hakkani Tür, Idiap-RR-09-2009

CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, Ketan Kotwal and Sébastien Marcel, Idiap-RR-10-2020

Co-occurrence Models for Image Annotation and Retrieval, Nikhil Garg, Idiap-RR-22-2009

Cognitive speech coding, Milos Cernak and Afsaneh Asaei, Idiap-RR-27-2016

Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, Fabio Valente and Hynek Hermansky, Idiap-RR-61-2006

COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-51-2007

Combinatorial Approach for Data Binarization, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1999

Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, Ethem Alpaydin, Idiap-RR-04-1998

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-20-2008

Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, Ethem Alpaydin and Eddy Mayoraz, Idiap-RR-05-1998

Combining methods to improve speaker verification decision, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot and Gérard Chollet, Idiap-RR-02-1996

Combining multiple tracking algorithms for improved general performance, Kim Shearer, Kirrily D Wong and Svetha Venkatesh, Idiap-RR-13-2000

Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-18-2001

COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-17-2015

Combining the SNR Spectrum with a Cochlear Model, Philip N. Garner, Idiap-RR-14-2018

Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner and John Dines, Idiap-RR-11-2012

Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, Katrin Keller, Souheil Ben-Yacoub and Chafic Mokbel, Idiap-RR-14-1999

Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, Yang Wang, Alexandre Nanchen, Alexandros Lazaridis, David Imseng and Philip N. Garner, Idiap-RR-18-2017

Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-04-2021

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, Joel Praveen Pinto, Hervé Bourlard, Zacharie De Greve and Hynek Hermansky, Idiap-RR-32-2007

Comparing meeting browsers using a task-based evaluation method, Andrei Popescu-Belis, Idiap-RR-11-2009

Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen and Hervé Bourlard, Idiap-RR-48-2003

Comparison of Client Model Adaptation Schemes, Samy Bengio and Johnny Mariéthoz, Idiap-RR-25-2001

Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-21-2000

Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, Fabien Cardinaux, Conrad Sanderson and Sébastien Marcel, Idiap-RR-10-2003

Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis

COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis, Idiap-RR-34-2020

Comparison of Support Vector Machine and Neural Network for Text Texture Verification, Datong Chen and Jean-Marc Odobez, Idiap-RR-19-2002

Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-44-2005

Composite Kernel Learning, Marie Szafranski, Yves Grandvalet and Alain Rakotomamonjy, Idiap-RR-59-2008

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei and Philip N. Garner, Idiap-RR-11-2016

Conditional Gaussian Mixture Models for Environmental Risk Mapping, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-12-2002

Conditional Gaussian Mixtures, Todd Andrew Stephenson, Idiap-RR-11-2003

Confidence Evaluation for Risk Prediction, Nicolas Gilardi, Tom Melluish and Michel Maignan, Idiap-RR-22-2001

Confidence Measures for Multimodal Identity Verification, Samy Bengio, Christine Marcel, Sébastien Marcel and Johnny Mariéthoz, Idiap-RR-38-2001

Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-53-2003

Confidence-based Cue Integration for Visual Place Recognition, Andrzej Pronobis and Barbara Caputo, Idiap-RR-17-2007

Confusion matrix based posterior probabilities correction, Andrew Morris and Hemant Misra, Idiap-RR-53-2002

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, Xiao Pu, Laura Mascarell and Andrei Popescu-Belis, Idiap-RR-08-2017

Constructing visual models with a latent space approach, Florent Monay, Pedro Quelhas, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-14-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber, Idiap-RR-71-2005

Construction and comparison of approximations for switching linear gaussian state space models, David Barber and Bertrand Mesot, Idiap-RR-06-2005

CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Marc Ferras, Petr Motlicek and Srikanth Madikeri, Idiap-RR-31-2017

Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour and Igor Szoke, Idiap-RR-14-2021

[URL]

Continuous Audio-Visual Speech Recognition, Juergen Luettin and Stéphane Dupont, Idiap-RR-02-1998

Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips and José del R. Millán, Idiap-RR-53-2008

Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, Hari Krishna Maganti, Jithendra Vepa and Hervé Bourlard, Idiap-RR-47-2005

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, Gwénolé Lecorvé and Petr Motlicek, Idiap-RR-21-2012

Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2014

Convolutional Pitch Target Approximation Model for Speech Synthesis, Xingyu Na and Philip N. Garner, Idiap-RR-05-2013

Cross-database evaluation of audio-based spoofing detection systems, Pavel Korshunov and Sébastien Marcel, Idiap-RR-23-2016

[URL]

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, Qingran Zhan, Shixuan Du, Petr Motlicek, Yahui Shan and Xiang Xie, Idiap-RR-05-2021

[URL]

Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, Mirjam Wester and Hui Liang, Idiap-RR-18-2011

Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, Khalil Mrini, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-26-2017

Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-03-2012

Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, Petr Motlicek, David Imseng and Philip N. Garner, Idiap-RR-39-2013

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, Francesco Camastra, Marco Spinetti and Alessandro Vinciarelli, Idiap-RR-79-2005

Cursive Character Recognition by Learning Vector Quantization, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-47-2000

Daily Routine Classification from Mobile Phone Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-62-2007

Data binarization by discriminant elimination, Miguel Moreira, Alain Hertz and Eddy Mayoraz, Idiap-RR-04-1999

Data utility modelling for mismatch reduction, Andrew Morris, Idiap-RR-30-2001

Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, Hui Liang, Idiap-RR-38-2012

Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-02-2019

Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-44-2004

Decision fusion in a multi-modal identity verification system using a multi-linear classifier, Patrick Verlinde, Gilbert Maître and Eddy Mayoraz, Idiap-RR-06-1997

DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Srikanth Madikeri, Marc Ferras and Petr Motlicek, Idiap-RR-08-2016

Deep Neural Networks for Multiple Speaker Detection and Localization, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-RR-02-2018

Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, Idiap-RR-02-2024

Deepfake detection: humans vs. machines, Pavel Korshunov and Sébastien Marcel, Idiap-RR-36-2020

DeepFakes: a New Threat to Face Recognition? Assessment and Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-18-2018

Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, Murielle Vial, Idiap-RR-02-1995

Designing second order recurrent neural networks for prosody modelling, François Marelli, Idiap-RR-16-2018

Detecting Abandoned Luggage Items in a Public Space, Kevin C. Smith, Pedro Quelhas and Daniel Gatica-Perez, Idiap-RR-39-2006

Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang and Samy Bengio, Idiap-RR-51-2004

Detecting Intentional Mental Transitions in an Asynchronous BCI, Ferran Galán, Francesc Oliva, Joan Guàrdia, Pierre W. Ferrez and José del R. Millán, Idiap-RR-43-2006

Detecting queues at vending machines: a statistical layered approach, Xavier Naturel and Jean-Marc Odobez, Idiap-RR-04-2008

Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Dong Zhang, Daniel Gatica-Perez and Wilfried Post, Idiap-RR-49-2006

Detection and Recognition of Number Sequences in Spoken Utterances, Guillermo Aradilla and Jitendra Ajmera, Idiap-RR-42-2007

Detection of Narrative Structure for Annotation of News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-03-2001

Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-23-2005

Development of Bilingual ASR System for MediaParl Corpus, Petr Motlicek, David Imseng, Milos Cernak and Namhoon Kim, Idiap-RR-21-2014

Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, I. Lapidot and H. Guterman, Idiap-RR-48-2002

Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, John Dines and Jithendra Vepa, Idiap-RR-13-2007

Discovering Human Routines from Cell Phone Data with Topic Models, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-32-2008

Discrete All-Positive Multilayer Perceptrons for Optical Implementation, Perry Moerland, Emile Fiesler and Indu Saxena, Idiap-RR-02-1997

Discriminant linear processing of time-frequency plane, Fabio Valente and Hynek Hermansky, Idiap-RR-20-2006

Discriminative Cue Integration for Medical Image Annotation, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-64-2007

Discriminative Kernel-Based Phoneme Sequence Recognition, Joseph Keshet, Samy Bengio, Dan Chazan, Shai Shalev-Shwartz and Yoram Singer, Idiap-RR-14-2006

Discriminatove Keyword Spotting, Joseph Keshet, David Grangier and Samy Bengio, Idiap-RR-31-2008

Discrmininant Models for Text-independent Speaker Verification, Johnny Mariéthoz, Idiap-RR-70-2006

DNN based speaker embedding using content information for text-dependent speaker verification, Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek and Srikanth Madikeri, Idiap-RR-06-2018

Domain Adaptation and Investigation of Robustness of DNN-based Embeddings for Text-Independent Speaker Verification Using Dilated Residual Networks, Seyyed Saeed Sarfjoo, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-10-2019

DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, Perry Moerland and Eddy Mayoraz, Idiap-RR-09-1999

Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, Todd Andrew Stephenson, Jaume Escofet, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-24-2002

Dynamical Dirichlet Mixture Model, Le Chen, David Barber and Jean-Marc Odobez, Idiap-RR-02-2007

EdgeDoc: Hybrid CNN-Transformer Model for Accurate Forgery Detection and Localization in ID Documents, Anjith George and Sébastien Marcel, Idiap-RR-08-2025

EdgeFace: Efficient Face Recognition Model for Edge Devices, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2024

EEG Classification using Generative Independent Component Analysis, Silvia Chiappa and David Barber, Idiap-RR-77-2004

EEG pattern recognition through multi-stream evidence combination, Andrew Morris, Bernhard Obermaier and Gert Pfurtscheller, Idiap-RR-31-2001

EEG-based BCI Systems and IDIAP EEG Database, Silvia Chiappa and José del R. Millán, Idiap-RR-64-2003

EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-01-2005

Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, Idiap-RR-08-2004

Effect of Recognition Errors on Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-82-2004

Effect of Segmentation Method on Video Retrieval Performance, David Grangier and Alessandro Vinciarelli, Idiap-RR-83-2004

Effective post-processing for single-channel frequency-domain speech enhancement, Weifeng Li, Idiap-RR-71-2007

Efficient Diffusion-based Illumination Normalization for Face Verification, Guillaume Heusch, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-46-2005

Efficient Kalman Smoothing for Harmonic State-Space Models, David Barber, Idiap-RR-87-2005

Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, Arnaud Pannatier, Ricardo Picatoste and Francois Fleuret, Idiap-RR-05-2022

Eight Years of Face Recognition Research: Reproducibility, Achievements and Open Issues, Tiago de Freitas Pereira, Dominic Schmidli, Yu Linghu, Xinyi Zhang, Sébastien Marcel and Manuel Günther, Idiap-RR-09-2022

[URL]

Embedding Motion in Model-Based Stochastic Tracking, Jean-Marc Odobez, Daniel Gatica-Perez and Silèye O. Ba, Idiap-RR-72-2003

EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, Idiap-RR-01-2019

EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, Petr Motlicek, Subhadeep Dey, Srikanth Madikeri and Lukas Burget, Idiap-RR-16-2015

Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-24-2008

End-to-end Accented Speech Recognition, Thibault Viglino, Petr Motlicek and Milos Cernak, Idiap-RR-04-2022

End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2016

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-40-2013

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, Idiap-RR-21-2010

Enhanced Phone Posteriors for Improving Speech Recognition Systems, Hamed Ketabdar and Hervé Bourlard, Idiap-RR-39-2008

Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, Pradeep Rangappa, Amrutha Prasad, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2025

Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, Hui Liang and John Dines, Idiap-RR-08-2013

Entropy Based Combination of Tandem Representations for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky and Hervé Bourlard, Idiap-RR-19-2004

Entropy coding of Quantized Spectral Components in FDLP audio codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-71-2008

Entropy-based Multi-stream Combination, Hemant Misra, Hervé Bourlard and Vivek Tyagi, Idiap-RR-31-2002

Environmental Data Mapping with Support Vector Regression and Geostatistics, Mikhail Kanevski, Patrick Wong and Stéphane Canu, Idiap-RR-10-2000

Environmental spatial data classification with Support Vector Machines, Mikhail Kanevski, Nicolas Gilardi, Eddy Mayoraz and Michel Maignan, Idiap-RR-07-1999

Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-10-2001

Estimates of Parameter Distributions for Optimal Action Selection, Christos Dimitrakakis and Samy Bengio, Idiap-RR-72-2004

Estimating Breathing Pattern from Raw Speech Waveform and Short-term Speech Spectrum using Neural Networks, Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik and Mathew Magimai-Doss, Idiap-RR-12-2024

Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-12-2010

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-13-2013

Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, Norman Poh and Samy Bengio, Idiap-RR-25-2006

Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, Julian Fritsch, S. Pavankumar Dubagunta and Mathew Magimai-Doss, Idiap-RR-06-2020

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Daniel Gatica-Perez, Yan Huang and Gerald Friedland, Idiap-RR-60-2007

Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-02-2002

Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-07-2004

Estimation of Conditional Distributions using Gaussian Mixture Models, Nicolas Gilardi, Samy Bengio and Mikhail Kanevski, Idiap-RR-03-2002

Evaluating Attention Networks for Anaphora Resolution, Jonathan Pilault, Nikolaos Pappas, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-27-2017

Evaluating the Complexity of Databases for Person Identification and Verification, Georg Thimm, Souheil Ben-Yacoub and Juergen Luettin, Idiap-RR-10-1998

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-01-2010

Evaluation of Biometric Technology on XM2VTS, Samy Bengio, Johnny Mariéthoz and Sébastien Marcel, Idiap-RR-21-2001

Evaluation of Formant-Like Features for ASR, Katrin Weber, F. de Wet, B. Cranen, Louis Boves, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2002

Evaluation of formant-like features for automatic speech recognition, F. de Wet, Katrin Weber, Louis Boves, B. Cranen, Samy Bengio and Hervé Bourlard, Idiap-RR-08-2003

Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-05-2005

Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, Mikhail Kanevski, Idiap-RR-07-2001

Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, Sébastien Marcel, Idiap-RR-50-2002

Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, Norman Poh and Samy Bengio, Idiap-RR-43-2004

Exemplar-based Sparse Representation for Posterior Features, Sara Bahaadini, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-11-2014

Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek, Aravind Ganapathiraju and Alexei V. Ivanov, Idiap-RR-06-2022

Experimental Protocol on the BANCA Database, Samy Bengio, Frédéric Bimbot, Johnny Mariéthoz, Vlad Popovici, F. Porée, E. Bailly-Baillière, G. Matas and B. Ruiz, Idiap-RR-05-2002

Experiments with robust similarity measures for OCR, Gilbert Maître, Idiap-RR-03-1995

Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-21-2016

Exploiting Contextual Information for Improved Phoneme Recognition, Joel Praveen Pinto, B. Yegnanarayana, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-65-2007

Exploiting contextual information for speech/non-speech detection, Sree Hari Krishnan Parthasarathi and Hynek Hermansky, Idiap-RR-22-2008

Exploiting foreign resources for DNN-based ASR, Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner and Ivan Himawan, Idiap-RR-27-2015

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez Mendez and Jean-Marc Odobez, Idiap-RR-06-2014

Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, Alexandre Heili, Adolfo Lopez-Mendez and Jean-Marc Odobez, Idiap-RR-05-2014

Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, Stefan Duffner and Jean-Marc Odobez, Idiap-RR-01-2011

Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, Joel Praveen Pinto, Andrew Lovitt and Hynek Hermansky, Idiap-RR-11-2007

EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-04-2017

Exploiting temporal context for speech/non-speech detection, Sree Hari Krishnan Parthasarathi, Petr Motlicek and Hynek Hermansky, Idiap-RR-21-2008

Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-41-2006

Extended BIC Criterion for Model Selection, I. Lapidot and Andrew Morris, Idiap-RR-42-2002

Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, Idiap-RR-50-2005

Extractive Odia Text Summarization System: An OCR based Approach, Shantipriya Parida, Idiap-RR-02-2020

EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, Kenneth Alberto Funes Mora, Florent Monay and Jean-Marc Odobez, Idiap-RR-08-2014

Face Authentication Based on Local Features and Generative Models, Fabien Cardinaux, Idiap-RR-85-2005

Face Authentication Using Adapted Local Binary Pattern Histograms, Yann Rodriguez and Sébastien Marcel, Idiap-RR-06-2006

Face Authentication using Client-specific Matching Pursuit, Sébastien Marcel, P. Jost, P. Vandergheynst and Jean-Philippe Thiran, Idiap-RR-78-2004

Face Authentication with Salient Local Features and Static Bayesian Network, Guillaume Heusch and Sébastien Marcel, Idiap-RR-04-2007

Face Detection and Verification using Local Binary Patterns, Yann Rodriguez, Idiap-RR-79-2006

Face detection using boosted Jaccard distance-based regression, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-02-2012

Face Processing & Frontal Face Verification, Conrad Sanderson, Idiap-RR-20-2003

Face Recognition Systems Under Spoofing Attacks, Ivana Chingovska, Nesli Erdogmus, André Anjos and Sébastien Marcel, Idiap-RR-18-2020

Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, Laurent El Shafey, Roy Wallace and Sébastien Marcel, Idiap-RR-37-2011

Face Verification using LDA and MLP on the BANCA database, Sébastien Marcel, Idiap-RR-66-2003

Face Verification using MLP and SVM, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-21-2002

Face Verification Using Synthesized Non-Frontal Models, Conrad Sanderson and Samy Bengio, Idiap-RR-60-2003

Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, B. Fasel, Idiap-RR-49-2001

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-01-2012

Fast Approximate Spoken Term Detection from Sequence of Phonemes, Joel Praveen Pinto, Igor Szoke, S. R. Mahadeva Prasanna and Hynek Hermansky, Idiap-RR-45-2008

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-RR-38-2010

Fast Human Detection from Videos Using Covariance Features, Jian Yao and Jean-Marc Odobez, Idiap-RR-68-2007

Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, Jian Yao and Jean-Marc Odobez, Idiap-RR-19-2009

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, Idiap-RR-17-2016

Fast latent semantic indexing of spoken documents by using self-organizing maps, Mikko Kurimo, Idiap-RR-20-1999

Fast Object Detection using MLP and FFT, Souheil Ben-Yacoub, Idiap-RR-11-1997

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, Thorbecke Iuliia, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-10-2024

FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, Petr Motlicek, Daniel Povey and Martin Karafiat, Idiap-RR-37-2013

Feature Extraction for Multi-class BCI using Canonical Variates Analysis, Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia and José del R. Millán, Idiap-RR-23-2007

Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard and Qingmin Liao, Idiap-RR-17-2014

Feature mapping using far-field microphones for distant speech recognition, Ivan Himawan, Petr Motlicek, David Imseng and Sridha Sridharan, Idiap-RR-20-2016

Feature Representations for Automatic Meerkat Vocalization Classification, Imen Ben Mahmoud, Eklavya Sarkar, Marta Manser and Mathew Magimai-Doss, Idiap-RR-06-2024

Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-77-2007

Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-02-2008

Finding groups of people in Google news, Dhiraj Joshi and Daniel Gatica-Perez, Idiap-RR-68-2005

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-32-2011

Finding Lines under Bounded Error, Thomas M. Breuel, Idiap-RR-11-1993

Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, Daniel Gatica-Perez, Alexander Loui and Ming-Ting Sun, Idiap-RR-22-2002

Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-18-2010

From missing data to maybe useful data: soft data modelling for noise robust ASR, Andrew Morris, Jon Barker and Hervé Bourlard, Idiap-RR-06-2001

From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-20-2000

From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, Andrei Popescu-Belis, Maryam Habibi, Philip N. Garner and Nan Li, Idiap-RR-12-2017

From Samples to Objects in Kernel Methods, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-29-2003

Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-17-2008

Further Applications of Sector-Based Detection and Short-Term Clustering, Guillaume Lathoud, Idiap-RR-26-2006

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, Oya Aran and Daniel Gatica-Perez, Idiap-RR-17-2010

Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, Elie Khoury, Paul Gay and Jean-Marc Odobez, Idiap-RR-31-2013

Fusion of Face and Speech Data for Person Identity Verification, Souheil Ben-Yacoub, Yousri Abdeljaoued and Eddy Mayoraz, Idiap-RR-03-1999

Generative Temporal ICA for Classification in Asynchronous BCI Systems, Silvia Chiappa and David Barber, Idiap-RR-08-2005

Geometric Matching in Computer Vision--Algorithms and Open Problems, Thomas M. Breuel, Idiap-RR-07-1993

German News Article Classification : A Multichannel CNN Approach, Shantipriya Parida, Petr Motlicek and Satya Ranjan Dash, Idiap-RR-09-2020

Gestures for Multi-Modal Interfaces: A Review, Sébastien Marcel, Idiap-RR-34-2002

Gradient Alignment in Deep Neural Networks, Suraj Srinivas and Francois Fleuret, Idiap-RR-14-2020

Gradient estimates of return, Christos Dimitrakakis and Samy Bengio, Idiap-RR-29-2005

Gradient-based spectral visualization of CNNs using raw waveforms, Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2018

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Seyyed Saeed Sarfjoo and Nigmatulina Iuliia, Idiap-RR-22-2021

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, Mael Fabien, Seyyed Saeed Sarfjoo, Srikanth Madikeri and Petr Motlicek, Idiap-RR-01-2023

[URL]

Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, Ramya Rasipuram, Peter Bell and Mathew Magimai-Doss, Idiap-RR-34-2012

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, Idiap-RR-28-2009

Hand Posture Classification and Recognition using the Modified Census Transform, Agnès Just, Yann Rodriguez and Sébastien Marcel, Idiap-RR-02-2006

Hands Free Audio Analysis from Home Entertainment, Danil Korchagin, Philip N. Garner and Petr Motlicek, Idiap-RR-27-2010

Handwritten Digit Recognition with Binary Optical Perceptron, Indu Saxena, Perry Moerland, Emile Fiesler and A. R. Pourzand, Idiap-RR-15-1997

Handwritten Digits Recognition, Eric Grand, Idiap-RR-07-2000

Harmonic Plus Noise Model for Concatenative Speech Synthesis, D. Vandromme, Idiap-RR-37-2005

Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-51-2002

HEAT: Iterative Relevance Feedback with One Million Images, Nicolae Suditu and Francois Fleuret, Idiap-RR-33-2011

Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2001

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-45-2007

Hierarchical approach for spotting keywords, Mikko Lehtonen, Idiap-RR-41-2005

Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Hervé Bourlard and Samy Bengio, Idiap-RR-25-2005

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-14-2010

Hierarchical Neural Networks Feature Extraction for LVCSR system, Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky and Ralf Schlüter, Idiap-RR-08-2007

Hierarchical Penalization, Marie Szafranski, Yves Grandvalet and Pierre Morizet-Mahoudeaux, Idiap-RR-76-2007

Hierarchical Tandem Features for ASR in Mandarin, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-39-2010

High Order and Multilayer Perceptron Initialization, Georg Thimm and Emile Fiesler, Idiap-RR-07-1994

Higher-Order Statistics in Visual Object Recognition, Thomas M. Breuel, Idiap-RR-02-1993

Hilbert Envelope Based Features for Far-Field Speech Recognition, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-42-2008

Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-18-2008

HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-39-2004

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, Silvia Chiappa and Samy Bengio, Idiap-RR-49-2003

HMM Mixtures (HMM2) for Robust Speech Recognition, Katrin Weber, Idiap-RR-34-2003

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, Idiap-RR-32-2015

HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, Shajith Ikbal, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-50-2004

HMM2- A Novel Approach to HMM Emission Probability Estimation, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-30-2000

HMM2- Extraction of Formant Features and their Use for Robust ASR, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2000

How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-18-2004

How does a dictation machine recognize speech?, T. Dutoit, L. Couvreur and Hervé Bourlard, Idiap-RR-72-2008

Human-Centered Computing: Toward a Human Revolution, Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe and Thomas S. Huang, Idiap-RR-57-2007

Hybrid generative-discriminative models for speech and speaker recognition, Quan Le and Samy Bengio, Idiap-RR-06-2002

Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-45-2002

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, Rahim Saedi, Kong Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David Van Leeuwen, Bin Ma, Haizhou Li, John Hansen, Jean-François Bonastre, Sébastien Marcel, John Mason and Eliathamby Ambikairajah, Idiap-RR-34-2013

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-65-2008

Identifying unexpected words using in-context and out-of-context phoneme posteriors, Hamed Ketabdar and Hynek Hermansky, Idiap-RR-68-2006

Idiap Abstract Text Summarization System for German Text Summarization Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-03-2020

IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, Shajith Ikbal, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-27-2001

Idiap NMT System for WAT 2019 Multimodal Translation Task, Shantipriya Parida and Petr Motlicek, Idiap-RR-04-2020

Idiap Scientific Report 2022, Hervé Bourlard, Daniel Gatica-Perez, Jean-Marc Odobez, Philip N. Garner, Petr Motlicek, Mathew Magimai-Doss, Sylvain Calinon, Sébastien Marcel, Jérôme Kämpf, Raphaelle Luisier, Michael Liebling, Lonneke van der Plas, Damien Teney, Ina Kodrasi, Emmanuel Senft, James Henderson, Andre Freitas and André Anjos, Idiap-RR-05-2023

IDIAP SUBMISSION TO NIST LRE22 LANGUAGE RECOGNITION EVALUATION, Amrutha Prasad, Driss Khalil, Srikanth Madikeri and Petr Motlicek, Idiap-RR-11-2025

Idiap Submission to Swiss-German Language Detection Shared Task, Shantipriya Parida, Esaú Villatoro-Tello, Sajit Kumar, Petr Motlicek and Qingran Zhan, Idiap-RR-11-2020

IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION, Srikanth Madikeri, Subhadeep Dey, Marc Ferras, Petr Motlicek and Ivan Himawan, Idiap-RR-32-2016

Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, Srikanth Madikeri, Seyyed Saeed Sarfjoo, Petr Motlicek and Sébastien Marcel, Idiap-RR-17-2019

Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, Seyyed Saeed Sarfjoo, Srikanth Madikeri, Mahdi Hajibabaei, Petr Motlicek and Sébastien Marcel, Idiap-RR-15-2019

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, Sergio Burdisso, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Martin Fajcik, Muskaan Singh, Pavel Smrz and Petr Motlicek, Idiap-RR-13-2022

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, Martin Fajcik, Muskaan Singh, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek and Pavel Smrz, Idiap-RR-12-2022

Illumination-robust Pattern Matching Using Distorted Color Histograms, Georg Thimm and Juergen Luettin, Idiap-RR-09-1998

Image Classification by Neural Networks for the Quality Control of Watches, Miguel Moreira, Emile Fiesler and Gianni Pante, Idiap-RR-10-1996

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-23-2012

Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, Danil Korchagin, Idiap-RR-20-2011

Implémentation d'un algorithme de réduction de taille des réseaux de neurones, François Marelli, Idiap-RR-03-2018

Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Marc Ferras, Idiap-RR-26-2016

Implementation of VTLN for Statistical Speech Synthesis, Lakshmi Saheer, John Dines, Philip N. Garner and Hui Liang, Idiap-RR-32-2010

Implementing contextual biasing in GPU decoder for online ASR, Nigmatulina Iuliia, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motlicek, Juan Zuluaga-Gomez, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-02-2023

Improved Pairwise Coupling Classification With Correcting Classifiers, Miguel Moreira and Eddy Mayoraz, Idiap-RR-09-1997

Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, Benjamin Picart, Idiap-RR-18-2009

Improved Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard and I. Lapidot, Idiap-RR-23-2002

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, Idiap-RR-36-2012

Improving ASR and Callsign Detection in Air Traffic Control Speech using Whisper Prompting, Jehan Joachim Daniel Piaget, Amrutha Prasad and Petr Motlicek, Idiap-RR-04-2025

Improving callsign recognition with air-surveillance data in air-traffic communication, Nigmatulina Iuliia, Rudolf Braun, Juan Zuluaga-Gomez and Petr Motlicek, Idiap-RR-20-2021

[URL]

Improving Continuous Speech Recognition System Performance with Grapheme Modelling, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-16-2005

Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, Tatiana Tommasi, Francesco Orabona, Claudio Castellini and Barbara Caputo, Idiap-RR-07-2012

Improving Face Authetication Using Virtual Samples, Norman Poh, Sébastien Marcel and Samy Bengio, Idiap-RR-40-2002

Improving Face Verification using Skin Color Information, Sébastien Marcel and Samy Bengio, Idiap-RR-44-2001

Improving Face Verification using Symmetric Transformation, Sébastien Marcel, Idiap-RR-68-2003

Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, Yu Yu, Gang Liu and Jean-Marc Odobez, Idiap-RR-03-2019

Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, Norman Poh and Samy Bengio, Idiap-RR-63-2004

Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-14-2013

IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, Idiap-RR-40-2011

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, Idiap-RR-19-2011

Improving Object Classification using Pose Information, Hugo Penedones, Ronan Collobert, Francois Fleuret and David Grangier, Idiap-RR-30-2012

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, Giulia Bernardis and Hervé Bourlard, Idiap-RR-11-1998

Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, Srikanth Madikeri, David Imseng and Hervé Bourlard, Idiap-RR-18-2015

Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, Norman Poh and Samy Bengio, Idiap-RR-52-2004

Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-66-2005

Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, Kim Shearer, Chitra Dorai and Svetha Venkatesh, Idiap-RR-14-2000

Increasing Speech Recognition Noise Robustness with HMM2, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-36-2001

Incremental Learning for Place Recognition in Dynamic Environments, Jie Luo, Andrzej Pronobis, Barbara Caputo and Patric Jensfelt, Idiap-RR-52-2006

Incremental Syllable-Context Phonetic Vocoding, Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na, Idiap-RR-05-2015

Indexation de Documents Manuscrits, Alessandro Vinciarelli, Idiap-RR-31-2006

Indexing Audio Documents by using Latent Semantic Analysis and SOM, Mikko Kurimo, Idiap-RR-13-1999

Indexing spoken audio by LSA and SOMs, Mikko Kurimo, Idiap-RR-06-2000

Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, Bertrand Mesot, Idiap-RR-35-2008

Inferring Document Similarity from Hyper-links, David Grangier and Samy Bengio, Idiap-RR-21-2005

Infinite Models for Speaker Clustering, Fabio Valente, Idiap-RR-19-2006

Information Fusion and Person Verification Using Speech & Face Information, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-33-2002

Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-30-2016

INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, Subhadeep Dey, Srikanth Madikeri and Petr Motlicek, Idiap-RR-09-2016

Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-02-2011

Integrating audio and vision for robust automatic gender recognition, Marianna Pronobis and Mathew Magimai-Doss, Idiap-RR-73-2008

Integrating co-occurrence and spatial contexts on patch-based scene segmentation, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-30-2005

Integrating Language Identification to improve Multilingual Speech Recognition, Holger Caesar, Idiap-RR-24-2012

Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, Srikanth Madikeri, Ivan Himawan, Petr Motlicek and Marc Ferras, Idiap-RR-20-2015

Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, Serena Soldo and Mathew Magimai-Doss, Idiap-RR-17-2012

INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, Giulia Bernardis, Hervé Bourlard, Martin Rajman and Jean-Cédric Chappelier, Idiap-RR-21-1999

Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-26-2008

Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, Roy Wallace, Mitchell McLaren, Chris McCool and Sébastien Marcel, Idiap-RR-28-2011

Intonation atom based emphasis transfer, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-14-2016

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, Srikanth Madikeri, Marc Ferras, Petr Motlicek and Subhadeep Dey, Idiap-RR-05-2017

Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, Francesco Camastra and Alessandro Vinciarelli, Idiap-RR-33-2000

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2010

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-25-2008

Introduction à la reconnaissance de la parole et du locuteur, Hervé Bourlard, Idiap-RR-13-1998

Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, Georgios Skoumas and Philip N. Garner, Idiap-RR-23-2011

Invariances in Kernel Methods: From Samples to Objects, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-56-2004

Investigating Lexical Substitution Scoring for Subtitle Generation, Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans, Idiap-RR-36-2006

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-12-2009

Investigating Semantic Segmentation Models to Assist Visually Impaired People, Michael Villamizar, Olivier Canévet and Jean-Marc Odobez, Idiap-RR-13-2024

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-22-2016

INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, Banriskhem Khonglah, Srikanth Madikeri, Petr Motlicek and Hervé Bourlard, Idiap-RR-13-2019

Investigating time-sensitive topic model approaches for action recognition, Romain Tavenard, Remi Emonet and Jean-Marc Odobez, Idiap-RR-26-2013

Investigation of a possible process identity between DRM and Linear Filtering, Sacha Krstulović, Idiap-RR-19-1997

Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Benjamin Picart, Idiap-RR-11-2010

Is Deep Learning Really Necessary for Word Embeddings?, Rémi Lebret, Joël Legrand and Ronan Collobert, Idiap-RR-44-2013

Iterative Posterior-Based Keyword Spotting Without Filler Models, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-16-1999

Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, Marius-Calin Silaghi and Hervé Bourlard, Idiap-RR-27-1999

Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, Sébastien Marcel, Idiap-RR-14-2007

Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, Mathew Magimai-Doss, Samy Bengio and Hervé Bourlard, Idiap-RR-52-2003

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, Weipeng He, Petr Motlicek and Jean-Marc Odobez, Idiap-RR-17-2018

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, Idiap-RR-25-2016

[URL]

Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, Majid Yazdani and Andrei Popescu-Belis, Idiap-RR-29-2015

Joint Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba, Idiap-RR-28-2005

Joint Training of Multi-Stream HMMs, Samy Bengio, Idiap-RR-22-2005

Juicer: A Weighted Finite-State Transducer speech decoder, Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, Idiap-RR-21-2006

Just-in-Time Multimodal Association and Fusion from Home Entertainment, Danil Korchagin, Petr Motlicek, Stefan Duffner and Hervé Bourlard, Idiap-RR-10-2011

Kernel Based Text-Independnent Speaker Verification, Johnny Mariéthoz, Samy Bengio and Yves Grandvalet, Idiap-RR-68-2008

Kernelized Infomax Clustering, Felix Agakov and David Barber, Idiap-RR-73-2005

Keyword Spotting on Word Lattices, De Greve Zacharie and Joel Praveen Pinto, Idiap-RR-22-2007

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-24-2010

KL-HMM and Probabilistic Lexical Modeling, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-04-2013

KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-19-2015

Knowledge Transfer with Jacobian Matching, Suraj Srinivas and Francois Fleuret, Idiap-RR-04-2018

[URL]

Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, Radu-Andrei Negoescu, Alexander Loui and Daniel Gatica-Perez, Idiap-RR-20-2010

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

Language model domain adaptation for automatic speech recognition, Amrutha Prasad, Petr Motlicek and Alexandre Nanchen, Idiap-RR-05-2020

Large Scale Machine Learning, Ronan Collobert, Idiap-RR-42-2004

Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlicek and Mathew Magimai-Doss, Idiap-RR-09-2021

Latent Semantic Indexing by Self-Organizing Map, Mikko Kurimo and Chafic Mokbel, Idiap-RR-12-1999

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard and Daniel Povey, Idiap-RR-28-2020

LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, Apoorv Vyas, Srikanth Madikeri and Hervé Bourlard, Idiap-RR-40-2020

[URL]

Learning Categories from Few Examples with Multi Model Knowledge Transfer, Tatiana Tommasi, Francesco Orabona and Barbara Caputo, Idiap-RR-16-2013

Learning Entailment-Based Sentence Embeddings from Natural Language Inference, Rabeeh Karimi Mahabadi, Florian Mai and James Henderson, Idiap-RR-20-2019

[URL]

Learning from Candidate Labeling Sets, Jie Luo and Francesco Orabona, Idiap-RR-27-2011

Learning from Images with Captions Using the Maximum Margin Set Algorithm, Jie Luo, Francesco Orabona, Barbara Caputo and Vittorio Ferrari, Idiap-RR-30-2011

Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, Idiap-RR-48-2005

Learning linearly separable features for speech recognition using convolutional neural networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-24-2015

[URL]

Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-15-2020

Learning the Decision Function for Speaker Verification, Samy Bengio and Johnny Mariéthoz, Idiap-RR-40-2000

Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, David Grangier and Samy Bengio, Idiap-RR-15-2007

Learning the structure of image collections with latent aspect models, Florent Monay, Idiap-RR-06-2007

Learning to Retrieve Images from Text Queries with a Discriminative Model, David Grangier, Florent Monay and Samy Bengio, Idiap-RR-32-2006

LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto and Francois Fleuret, Idiap-RR-22-2014

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong and Martin Volk, Idiap-RR-09-2015

Leveraging Sequential Structure in Animal Vocalizations, Eklavya Sarkar and Mathew Magimai-Doss, Idiap-RR-06-2025

Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, Frédéric Bimbot and Dominique Genoud, Idiap-RR-05-1997

Linking Objects in Videos by Importance Sampling, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-20-2002

Links between Perceptrons, MLPs and SVMs, Ronan Collobert and Samy Bengio, Idiap-RR-06-2004

Local Binary Patterns as an Image Preprocessing for Face Authentication, Guillaume Heusch, Yann Rodriguez and Sébastien Marcel, Idiap-RR-76-2005

Local Features and 1D-HMMs for Fast and Robust Face Authentication, Fabien Cardinaux, Idiap-RR-17-2005

Local Machine Learning Models for Spatial Data Analysis, Nicolas Gilardi and Samy Bengio, Idiap-RR-34-2000

Localized mixtures of experts, Perry Moerland, Idiap-RR-14-1998

Location Based Speaker Segmentation, Guillaume Lathoud and Iain A. McCowan, Idiap-RR-43-2002

Long Term Spectral Statistics for Voice Presentation Attack Detection, Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-11-2017

Low cost duration modelling for noise robust speech recognition, Andrew Morris, Simon Payne and Hervé Bourlard, Idiap-RR-08-2002

Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-75-2008

Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, Gil Luyet, Idiap-RR-05-2016

Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, Gil Luyet, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-04-2016

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-14-2011

LP-TRAP: Linear predictive temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-59-2004

LP-TRAPs in all senses, Petr Motlicek, Idiap-RR-66-2007

Machine Learning Approaches to Text Representation using Unlabeled Data, Mikaela Keller, Idiap-RR-76-2006

Machine Learning for Information Retrieval, David Grangier, Idiap-RR-34-2008

Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, Ngoc-Quang Luong and Andrei Popescu-Belis, Idiap-RR-06-2017

Making Retrieval Faster Through Document Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-02-2004

MAP Combination of Multi-Stream HMM or HMM/ANN Experts, Andrew Morris, Astrid Hagen and Hervé Bourlard, Idiap-RR-14-2001

Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, Alessandro Vinciarelli, Idiap-RR-33-2007

Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), Coralie Hemptinne, Idiap-RR-69-2006

Maximum Negentropy Beamforming, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, Idiap-RR-07-2008

Maya Codical Glyph Segmentation: A Crowdsourcing Approach, Gulcan Can, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-01-2017

MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-34-2009

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-16-2009

Measuring the gap between HMM-based ASR and TTS, John Dines, Junichi Yamagishi and Simon King, Idiap-RR-34-2010

Measuring the Performance of Face Localization Systems, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Mariéthoz, Idiap-RR-53-2005

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, Idiap-RR-03-2013

Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-47-2003

Melanoma Recognition using Kernel Classifiers, Elisabetta La Torre, Barbara Caputo and Tatiana Tommasi, Idiap-RR-53-2006

Memoirs of Togetherness from Audio Logs, Danil Korchagin, Idiap-RR-36-2009

Microphone Array Post-filter based on Noise Field Coherence, Iain A. McCowan and Hervé Bourlard, Idiap-RR-40-2001

Microphone Array Post-filter for Diffuse Noise Field, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2001

Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, Darren Moore and Iain A. McCowan, Idiap-RR-41-2002

Minimum Mutual Information Beamforming for Simultaneous Active Speakers, Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough and Matthias Wölfel, Idiap-RR-73-2007

Mining Human Location-Routines using a Multi-Level Topic Model, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-28-2010

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-45-2001

Mixture Models for Unsupervised and Supervised Learning, Perry Moerland, Idiap-RR-18-2000

Mixtures of Experts Estimate A Posteriori Probabilities, Perry Moerland, Idiap-RR-07-1997

Mixtures of latent variable models for density estimation and classification, Perry Moerland, Idiap-RR-25-2000

MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, Weifeng Li, Mathew Magimai-Doss, John Dines and Hervé Bourlard, Idiap-RR-54-2007

Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen and Jan Cernocky, Idiap-RR-09-2010

MOBIO: Mobile Biometric Face and Speaker Authentication, Sébastien Marcel, Chris McCool, Cosmin Atanasoaei, Flavio Tarsetti, Jan Pesan, Pavel Matejka, Jan Cernocky, Mika Helistekangas and Markus Turtinen, Idiap-RR-31-2010

Model Adaptation for Sentence Unit Segmentation from Speech, Sébastien Cuendet, Idiap-RR-64-2006

Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla and Giulio Sandini, Idiap-RR-05-2009

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, Afsaneh Asaei, Hervé Bourlard and Volkan Cevher, Idiap-RR-04-2011

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-19-2010

Modeling Auxiliary Information in Bayesian Network Based ASR, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-11-2001

Modeling Human Interaction in Meetings, Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner and Hervé Bourlard, Idiap-RR-59-2002

Modeling Individual and Group Actions in Meetings With Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-33-2004

Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-09-2004

Modeling Interactions from Email Communication, Dong Zhang, Daniel Gatica-Perez, Deb Roy and Samy Bengio, Idiap-RR-51-2005

Modeling Scenes with Local Descriptors and Latent Aspects, Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars and Luc Van Gool, Idiap-RR-79-2004

Modeling semantic aspects for cross-media image indexing, Florent Monay and Daniel Gatica-Perez, Idiap-RR-56-2005

Modelling Auxiliary Features in Tandem Systems, Mathew Magimai-Doss, Todd Andrew Stephenson, Shajith Ikbal and Hervé Bourlard, Idiap-RR-21-2004

Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-62-2002

Modelling glottal source information for depression detection, D S Pavan Kumar, Bogdan Vlasenko and Mathew Magimai-Doss, Idiap-RR-13-2018

MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-74-2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-70-2008

Monte Carlo Video Text Segmentation, Datong Chen and Jean-Marc Odobez, Idiap-RR-07-2003

More Efficiency in Multiple Kernel Learning, Alain Rakotomamonjy, Francis Bach, Stéphane Canu and Yves Grandvalet, Idiap-RR-18-2007

Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-61-2004

Multi Channel Sequence Processing, Samy Bengio and Hervé Bourlard, Idiap-RR-04-2005

Multi-Layer Background Subtraction Based on Color and Texture, Jian Yao and Jean-Marc Odobez, Idiap-RR-67-2007

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, Idiap-RR-76-2008

Multi-Modal Audio-Visual Event Recognition for Football Analysis, Mark Barnard, Jean-Marc Odobez and Samy Bengio, Idiap-RR-12-2003

Multi-Modal Data Fusion for Person Authentication using SVM, Souheil Ben-Yacoub, Idiap-RR-07-1998

Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-50-2007

Multi-party Speech Recovery Exploiting Structured Sparsity Models, Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard and Volkan Cevher, Idiap-RR-22-2011

Multi-Person Tracking in Meetings: A Comparative Study, Kevin C. Smith, Sascha Schreiber, Vítezslav Beran, Igor Potúcek, Gerhard Rigoll and Daniel Gatica-Perez, Idiap-RR-38-2006

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-47-2008

Multi-resolution RASTA filtering for TANDEM-based ASR, Hynek Hermansky and Petr Fousek, Idiap-RR-18-2005

Multi-resolution Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Sunil Sivadas and Hervé Bourlard, Idiap-RR-37-2004

Multi-stream adaptive evidence combination for noise robust ASR, Andrew Morris, Astrid Hagen, Hervé Glotin and Hervé Bourlard, Idiap-RR-26-1999

Multi-stream ASR: Oracle Test and Embedded Training, Hemant Misra, Jithendra Vepa and Hervé Bourlard, Idiap-RR-62-2005

Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, Fabio Valente, Jithendra Vepa and Hynek Hermansky, Idiap-RR-09-2007

Multi-stream Processing for Noise Robust Speech Recognition, Hemant Misra, Idiap-RR-28-2006

Multi-Stream Speech Recognition, Hervé Bourlard, Stéphane Dupont and Christophe Ris, Idiap-RR-07-1996

Multiclass Transfer Learning from Unconstrained Priors, Jie Luo, Tatiana Tommasi and Barbara Caputo, Idiap-RR-25-2011

Multilingual Hierarchical Attention Networks for Document Classification, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-17-2017

[URL]

Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model, Sibo Tong, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2018

Multimodal Authentication using Asynchronous HMMs, Samy Bengio, Idiap-RR-02-2003

Multimodal Cue Detection Engine for Orchestrated Entertainment, Danil Korchagin, Stefan Duffner, Petr Motlicek and Carl Scheffler, Idiap-RR-34-2011

Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain A. McCowan and Guillaume Lathoud, Idiap-RR-24-2004

Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals and Dong Zhang, Idiap-RR-31-2005

Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez and Iain A. McCowan, Idiap-RR-66-2004

Multimodal Neural Machine Translation System for English to Bengali, Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash and Petr Motlicek, Idiap-RR-13-2021

Multiple Hypotheses Video OCR, Datong Chen and Juergen Luettin, Idiap-RR-28-2000

Multiple Object Tracking using Flow Linear Programming, Jerome Berclaz, Francois Fleuret and Pascal Fua, Idiap-RR-10-2009

Multiple Timescale Feature Combination towards Robust Speech Recognition, Katrin Weber, Idiap-RR-29-2000

Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-21-2011

Multiview Face Detection, Tiffany Sauquet, Yann Rodriguez and Sébastien Marcel, Idiap-RR-49-2005

Mutliscale Facial Expression Recognition using Convolutional Neural Networks, B. Fasel, Idiap-RR-52-2002

NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, Milos Cernak and Sibo Tong, Idiap-RR-28-2017

Natural Scene Image Modeling using Color and Texture Visterms., Pedro Quelhas and Jean-Marc Odobez, Idiap-RR-17-2006

Nearly optimal exploration-exploitation decision thresholds, Christos Dimitrakakis, Idiap-RR-12-2006

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, Idiap-RR-17-1997

Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2008

Neural Network Formalization, Emile Fiesler, Idiap-RR-01-1992

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, Idiap-RR-09-2001

Neural Networks with Adaptive Learning Rate and Momentum Terms, Miguel Moreira and Emile Fiesler, Idiap-RR-04-1995

Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, Alexandre Hyafil and Milos Cernak, Idiap-RR-14-2015

New Approaches Towards Robust and Adaptive Speech Recognition, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2001

New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, Petr Fousek, Petr Svojanovsky, Frantisek Grezl and Hynek Hermansky, Idiap-RR-29-2004

NLPHut’s Participation at WAT2021, Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlicek and Ondrej Bojar, Idiap-RR-10-2021

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri and Petr Motlicek, Idiap-RR-03-2023

Noise PDF transformation in secondary feature processing, Andrew Morris, Idiap-RR-29-2002

Noise Robust Discriminative Models, Quan Le and Samy Bengio, Idiap-RR-40-2003

Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, Idiap-RR-01-2004

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-03-2004

Noisy Text Categorization, Alessandro Vinciarelli, Idiap-RR-61-2003

Noisy Text Clustering, David Grangier and Alessandro Vinciarelli, Idiap-RR-31-2004

Non-linear Spectral Contrast Stretching for In-car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-53-2007

Non-Linear Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-26-2003

Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-43-2007

Nonlinear Analysis of Cognitive and Motor-related EEG Signals, Silvia Chiappa and Samy Bengio, Idiap-RR-14-2003

Nonlinear Feature Transformations for Noise Robust Speech Recognition, Shajith Ikbal, Idiap-RR-70-2004

Nonlinear Spectral Transformations for Robust Speech Recognition, Shajith Ikbal, Hynek Hermansky and Hervé Bourlard, Idiap-RR-36-2003

Not All Samples Are Created Equal: Deep Learning with Importance Sampling, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-12-2018

Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-26-2020

Novel initialization methods for Speaker Diarization, David Imseng, Idiap-RR-07-2009

Numerical Experiments with Support Vector Machines, Mikhail Kanevski and Nicolas Gilardi, Idiap-RR-15-1999

Object Category Detection using Audio-visual Cues, Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back and Joern Anemueller, Idiap-RR-58-2007

Object Localization in Metric Spaces for Video Linking, Daniel Gatica-Perez and Ming-Ting Sun, Idiap-RR-09-2003

Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-06-2015

Objective Speech Intelligibility Assessment through Comparison of Phoneme Class Conditional Probability Sequences, Raphael Ullmann, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-16-2014

Observations on Multi-Band Asynchrony in Distant Speech Recordings, Guillaume Lathoud, Idiap-RR-74-2006

OCR Based Slide Retrieval, Nabil Daddaoua, Jean-Marc Odobez and Alessandro Vinciarelli, Idiap-RR-11-2005

OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, Shantipriya Parida, Satya Ranjan Dash, Ondrej Bojar, Petr Motlicek, Priyanka Pattnaik and Debasish Kumar Mallick, Idiap-RR-08-2020

Off-Line Cursive Script Recognition Based on Continuous Density HMM, Alessandro Vinciarelli and Juergen Luettin, Idiap-RR-25-1999

Offline Cursive Handwriting: From Word To Text Recognition, Alessandro Vinciarelli, Idiap-RR-24-2003

Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-46-2001

Offline Recognition of Large Vocabulary Cursive Handwritten Text, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-01-2003

Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, Alessandro Vinciarelli, Samy Bengio and Horst Bunke, Idiap-RR-22-2003

OM-2: An Online Multi-class Multi-kernel Learning Algorithm, Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo and Nicolo Cesa-Bianchi, Idiap-RR-06-2010

On Automatic Annotation of Images with Latent Space Models, Florent Monay and Daniel Gatica-Perez, Idiap-RR-31-2003

On automatic annotation of meeting databases, Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio and Hervé Bourlard, Idiap-RR-06-2003

On Confusions in a Phoneme Recognizer, Andrew Lovitt, Joel Praveen Pinto and Hynek Hermansky, Idiap-RR-10-2007

On Factorizing Spectral Dynamics for Robust Speech Recognition, Vivek Tyagi, Iain A. McCowan, Hervé Bourlard and Hemant Misra, Idiap-RR-32-2003

On Improving Face Detection Performance by Modelling Contextual Information, Cosmin Atanasoaei, Chris McCool and Sébastien Marcel, Idiap-RR-43-2010

On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, Mathew Magimai-Doss, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-24-2009

On Local Features for Face Verification, Marc Saban and Conrad Sanderson, Idiap-RR-36-2004

On MLP-based Posterior Features for Template-based ASR, Serena Soldo, Mathew Magimai-Doss, Joel Praveen Pinto and Hervé Bourlard, Idiap-RR-37-2009

On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-43-2013

On Multi-scale Fourier Transform Analysis of Speech Signals, Vivek Tyagi and Hervé Bourlard, Idiap-RR-33-2003

On Performance / Robustness / Complexity Trade-Offs in Face Verification, Conrad Sanderson, Fabien Cardinaux and Samy Bengio, Idiap-RR-74-2004

On Performance Evaluation of Face Detection and Localization Algorithms, Vlad Popovici, Yann Rodriguez, Jean-Philippe Thiran and Sébastien Marcel, Idiap-RR-80-2003

On Spectral Methods and the Structuring of Home Videos, Jean-Marc Odobez, Daniel Gatica-Perez and Maël Guillemot, Idiap-RR-55-2002

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, Milos Cernak, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-07-2016

[URL]

ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, Milos Cernak, Petr Motlicek and Philip N. Garner, Idiap-RR-06-2013

On the Adequacy of Baseform Pronunciations and Pronunciation Variants, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-27-2004

On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-13-2015

On the Combination of Auditory and Modulation Frequency Channels for ASR applications, Fabio Valente and Hynek Hermansky, Idiap-RR-12-2008

On the Combination of Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-19-2003

On the Complexity of Recognizing Iterated Differences of Polyhedra, Eddy Mayoraz, Idiap-RR-10-1997

On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, Eddy Mayoraz, Idiap-RR-03-1998

On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, Eddy Mayoraz, Idiap-RR-03-1996

On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-24-2000

On the Decomposition of Polychotomies into Dichotomies, Eddy Mayoraz and Miguel Moreira, Idiap-RR-08-1996

On the detection of morphing attacks generated by GANs, Laurent Colbois and Sébastien Marcel, Idiap-RR-07-2022

On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-19-2012

On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, Anjith George and Sébastien Marcel, Idiap-RR-30-2020

On the impact of non-modal phonation on phonological features, Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann and Nikolai Vogler, Idiap-RR-28-2016

On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, Elie Khoury, Manuel Günther, Laurent El Shafey and Sébastien Marcel, Idiap-RR-35-2013

On the Need for On-Line Learning in Brain-Computer Interfaces, José del R. Millán, Idiap-RR-30-2003

On the Recent Use of Local Binary Patterns for Face Authentication, Sébastien Marcel, Yann Rodriguez and Guillaume Heusch, Idiap-RR-34-2006

On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen, Jan Cernocky and al, Idiap-RR-30-2010

On the Tunability of Optimizers in Deep Learning, Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi and Francois Fleuret, Idiap-RR-19-2019

[URL]

On the Use of Information Retrieval Measures for Speech Recognition Evaluation, Iain A. McCowan, Darren Moore, John Dines, Daniel Gatica-Perez, Mike Flynn, Pierre Wellner and Hervé Bourlard, Idiap-RR-73-2004

On the Use of Speech and Face Information for Identity Verification, Conrad Sanderson and Kuldip K. Paliwal, Idiap-RR-10-2004

On Use of Task Independent Training Data in Tandem Feature Extraction, Sunil Sivadas and Hynek Hermansky, Idiap-RR-57-2003

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-19-2005

On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, Vivek Tyagi, Hervé Bourlard and Christian Wellekens, Idiap-RR-09-2005

On Variations of the Convex Hull Operator, Eddy Mayoraz, Idiap-RR-06-1996

On-line Independent Support Vector Machines for Cognitive Systems, Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo and Giulio Sandini, Idiap-RR-63-2007

On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, Niklas Johansson, Chris McCool and Sébastien Marcel, Idiap-RR-07-2011

Online Classifier Adaptation in Brain-Computer Interfaces, Anna Buttfield and José del R. Millán, Idiap-RR-16-2006

Online Policy Adaptation for Ensemble Algorithms, Christos Dimitrakakis and Samy Bengio, Idiap-RR-28-2002

Online Policy Adaptation for Ensemble Classifiers, Christos Dimitrakakis and Samy Bengio, Idiap-RR-69-2003

Online statistical estimation for vehicle control, Christos Dimitrakakis, Idiap-RR-13-2006

Online-Batch Strongly Convex Multi Kernel Learning, Francesco Orabona, Jie Luo and Barbara Caputo, Idiap-RR-07-2010

Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú Villatoro-Tello, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma and Petr Motlicek, Idiap-RR-07-2021

Optimal Parameterization of Point Distribution Models, Georg Thimm and Juergen Luettin, Idiap-RR-01-1998

Optimal Setting of Weights, Learning Rate, and Gain, Georg Thimm and Emile Fiesler, Idiap-RR-04-1997

Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, J-P. Pfister, T. Toyoizumi, David Barber and W. Gerstner, Idiap-RR-88-2005

Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin C. Smith, Idiap-RR-25-2004

Out-of-Scene AV Data Detection, Danil Korchagin, Idiap-RR-31-2009

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, Flávio Simões, Mário Uliani Neto, Marcus de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Yanmin Qian, D. Paul, G. Saha and Md Sahidullah, Idiap-RR-24-2016

[URL]

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-03-2009

Parts-Based Face Verification using Local Frequency Bands, Chris McCool and Sébastien Marcel, Idiap-RR-06-2011

Perceptual Information Loss due to Impaired Speech Production, Afsaneh Asaei, Milos Cernak and Hervé Bourlard, Idiap-RR-20-2017

Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, Norman Poh, Alvin Martin and Samy Bengio, Idiap-RR-60-2005

Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, Sébastien Marcel and José del R. Millán, Idiap-RR-81-2005

Phase AutoCorrelation (PAC) derived Robust Speech Features, Shajith Ikbal, Hemant Misra and Hervé Bourlard, Idiap-RR-38-2002

Phase AutoCorrelation (PAC) Features for Noise Robust ASR, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-40-2004

Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, Shajith Ikbal, Hemant Misra, Hervé Bourlard and Hynek Hermansky, Idiap-RR-54-2003

PhD Thesis: Speech Analysis with Production Constraints, Sacha Krstulović, Idiap-RR-35-2001

Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-04-2009

Phoneme vs Grapheme Based Automatic Speech Recognition, Mathew Magimai-Doss, John Dines, Hervé Bourlard and Hynek Hermansky, Idiap-RR-48-2004

Phoneme-Grapheme Based Speech Recognition System, Mathew Magimai-Doss, Todd Andrew Stephenson, Hervé Bourlard and Samy Bengio, Idiap-RR-37-2003

Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, Idiap-RR-10-2016

Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, Hui Liang and John Dines, Idiap-RR-17-2011

Phonological vocoding using artificial neural networks, Milos Cernak, Blaise Potard and Philip N. Garner, Idiap-RR-04-2015

Phrase-based Image Captioning, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-08-2015

PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, Marios Athineos, Hynek Hermansky and Daniel P. W. Ellis, Idiap-RR-60-2004

PLSA-based Image Auto-Annotation: Constraining the Latent Space, Florent Monay and Daniel Gatica-Perez, Idiap-RR-30-2004

Plug and Play Autoencoders for Conditional Text Generation, Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith and James Henderson, Idiap-RR-24-2020

Posterior Based Keyword Spotting with A Priori Thresholds, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-67-2006

Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2008

Posterior-based analysis of spatio-temporal features for Sign Language Assessment, Neha Tarigopula, Sandrine Tornay, Ozge Mercanoglu Sincan, Richard Bowden and Mathew Magimai-Doss, Idiap-RR-11-2024

Posterior-Based Features and Distances in Template Matching for Speech Recognition, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-41-2007

Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-33-2015

Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-23-2004

Predicting the dominant clique in meetings through fusion of nonverbal cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, Idiap-RR-08-2008

Predictive Models for Music, Jean-François Paiement, Yves Grandvalet and Samy Bengio, Idiap-RR-51-2008

Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, Blaise Potard, Petr Motlicek and David Imseng, Idiap-RR-02-2015

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-12-2011

Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, Idiap-RR-12-2016

Probabilistic Graphical Models for Human Interaction Analysis, Dong Zhang, Idiap-RR-78-2006

Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-21-2007

Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, Daniel Gatica-Perez, Ming-Ting Sun and Alexander Loui, Idiap-RR-11-2002

Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, Idiap-RR-33-2010

Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2013

Probabilistic Models for Melodic Prediction, Jean-François Paiement, Samy Bengio and Douglas Eck, Idiap-RR-50-2008

Probabilistic Symbol Sequence Matching and its Application to Pathological Speech Intelligibility Assessment, Julian Fritsch, Guillem Quer and Mathew Magimai-Doss, Idiap-RR-01-2021

Probabilistic Tagging of Unstructured Genealogical Records, Mike Perrow and David Barber, Idiap-RR-86-2005

Processing Megapixel Images with Deep Attention-Sampling Models, Angelos Katharopoulos and Francois Fleuret, Idiap-RR-07-2019

[URL]

Progress report of a project in very low bit-rate speech coding, Milos Cernak, Philip N. Garner and Petr Motlicek, Idiap-RR-08-2012

Pronunciation models and their evaluation using confidence measures, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-29-2001

Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman and Philip N. Garner, Idiap-RR-04-2014

Pruning of Neural Networks, Georg Thimm and Emile Fiesler, Idiap-RR-03-1997

Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, Michael McGreevy, Idiap-RR-55-2004

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-16-2016

Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-15-2014

Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, Idiap-RR-15-2009

Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, Dong Zhang, S. Z. Li and Daniel Gatica-Perez, Idiap-RR-70-2003

Real-time Multiple Head Tracking Using Texture and Colour Cues, Vasil Khalidov and Jean-Marc Odobez, Idiap-RR-02-2017

Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, Astrid Hagen and Andrew Morris, Idiap-RR-57-2002

Recent Developments in Speaker Verification at IDIAP, B. Nedic and Hervé Bourlard, Idiap-RR-26-2000

Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, Idiap-RR-27-2008

Recognition and Understanding of Meetings The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, Idiap-RR-46-2007

Recognition of Anticipatory Behavior from Human EEG, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, Idiap-RR-52-2008

Recognition of Asymmetric Facial Action Unit Activities and Intensities, B. Fasel and Juergen Luettin, Idiap-RR-22-1999

Recognition of Handprinted Digits, Thomas M. Breuel, Idiap-RR-06-1993

Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, Agnès Just, O. Bernier and Sébastien Marcel, Idiap-RR-63-2003

Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-41-2008

Recognizing People's Focus of Attention from Head Poses: a Study, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-42-2006

Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, Jean-Luc Beuchat, Idiap-RR-18-1997

Reconnaissance de gestes 3D bi-manuels, Agnès Just, Sébastien Marcel, O. Bernier and J. E. Viallet, Idiap-RR-79-2003

Reconstruction of image sequences from ungated and scanning-aberrated laser scanning microscopy images of the beating heart, Olivia Mariani, Alexander Ernst, Nadia Mercader and Michael Liebling, Idiap-RR-18-2019

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-41-2013

Recurrent Convolutional Neural Networks for Scene Parsing, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2013

Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, Afsaneh Asaei, Dhananjay Ram and Hervé Bourlard, Idiap-RR-31-2016

Reverse Correlation for analyzing MLP Posterior Features in ASR, Joel Praveen Pinto, G. S. V. S. Sivaram and Hynek Hermansky, Idiap-RR-13-2008

Review of Demographic Bias in Face Recognition, Ketan Kotwal and Sébastien Marcel, Idiap-RR-01-2025

Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, Norman Poh, Samy Bengio and Arun Ross, Idiap-RR-04-2006

Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, Yu Yu, Kenneth Alberto Funes Mora and Jean-Marc Odobez, Idiap-RR-09-2017

Robust Audio Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-35-2004

Robust Face Analysis using Convolutional Neural Networks, B. Fasel, Idiap-RR-48-2001

Robust Face Presentation Attack Detection with Multi-channel Neural Networks, Anjith George and Sébastien Marcel, Idiap-RR-03-2022

Robust Face Verification using Skin Color and Neural Networks, Sébastien Marcel, Idiap-RR-49-2002

Robust Features for Frontal Face Authentication in Difficult Image Conditions, Conrad Sanderson and Samy Bengio, Idiap-RR-05-2003

Robust HMM-Based Speech/Music Segmentation, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-33-2001

Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, Hervé Glotin, Idiap-RR-36-2000

Robust overlapping speech recognition based on neural networks, Weifeng Li, John Dines and Mathew Magimai-Doss, Idiap-RR-55-2007

Robust Speaker Change Detection, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-39-2002

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, Idiap-RR-26-2009

Robust Speech Recognition and Feature Extraction Using HMM2, Katrin Weber, Shajith Ikbal, Samy Bengio and Hervé Bourlard, Idiap-RR-42-2001

Robust Speech Recognition based on Multi-Stream Features, Stéphane Dupont, Hervé Bourlard and Christophe Ris, Idiap-RR-01-1997

Robust speech recognition based on multi-stream processing, Astrid Hagen, Idiap-RR-41-2001

Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, Iain A. McCowan, Andrew Morris and Hervé Bourlard, Idiap-RR-09-2002

Robust triphone mapping for acoustic modeling, Milos Cernak, David Imseng and Hervé Bourlard, Idiap-RR-02-2013

Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, Sébastien Marcel, Jean Keomany and Yann Rodriguez, Idiap-RR-47-2006

Robustness of Group Delay Representations for Noisy Speech Signals, Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan and Hema A Murthy, Idiap-RR-36-2011

Robustness of Phase based Features for Speaker Recognition, Padmanabhan Rajan, Sree Hari Krishnan Parthasarathi and Hema A Murthy, Idiap-RR-14-2009

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, N. P. Garg, Sarah Favre, Hugues Salamin, D. Hakkani Tür and Alessandro Vinciarelli, Idiap-RR-57-2008

Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, Idiap-RR-35-2006

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin and Alessandro Vinciarelli, Idiap-RR-64-2008

Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, Alessandro Vinciarelli and Sarah Favre, Idiap-RR-40-2007

Scalability Analysis of Audio-Visual Person Identity Verification, J. Czyz, Samy Bengio, Christine Marcel and L. Vandendorpe, Idiap-RR-04-2003

Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2007

Score Calibration in Face Recognition, Miranti I. Mantasari, Manuel Günther, Roy Wallace, Rahim Saedi, Sébastien Marcel and David Van Leeuwen, Idiap-RR-01-2014

Sector-Based Detection for Hands-Free Speech Enhancement in Cars, Guillaume Lathoud, Julien Bourgeois and Jürgen Freudenberger, Idiap-RR-67-2004

Secured vocal access to telephone servers, Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu and Dominique Genoud, Idiap-RR-04-1996

Segmentation of X-ray Image Sequences Showing the Vocal Tract, Georg Thimm, Idiap-RR-01-1999

Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), Georg Thimm, Idiap-RR-01-1999

Segmenting Multiple Concurrent Speakers Using Microphone Arrays, Guillaume Lathoud, Iain A. McCowan and Darren Moore, Idiap-RR-21-2003

Self-Organizing-Maps With BIC For Speaker Clustering, I. Lapidot, Idiap-RR-60-2002

Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, Alessandro Vinciarelli, F. Fernàndez and Sarah Favre, Idiap-RR-75-2006

Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, Adrian Shajkofci and Michael Liebling, Idiap-RR-07-2018

Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-80-2004

Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, Ajay Srinivasamurthy, Petr Motlicek, Ivan Himawan, Gyorgy Szaszak, Youssef Oualil and Hartmut Helmke, Idiap-RR-21-2017

Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, Idiap-RR-15-2005

Sentiment Analysis using pretrained LLMs, Alexandre Huou, Petr Motlicek and Esaú Villatoro-Tello, Idiap-RR-05-2024

Sequence Classification with Input-Output Hidden Markov Models, Silvia Chiappa and Samy Bengio, Idiap-RR-13-2004

Session Variability Modelling for Face Authentication, Chris McCool, Roy Wallace, Mitchell McLaren, Laurent El Shafey and Sébastien Marcel, Idiap-RR-17-2013

Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, Guillaume Lathoud, Iain A. McCowan and Jean-Marc Odobez, Idiap-RR-14-2004

Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, Hynek Hermansky and Nelson Morgan, Idiap-RR-81-2003

Significance of Contextual Information in Phoneme Recognition, Joel Praveen Pinto, S. R. Mahadeva Prasanna, B. Yegnanarayana and Hynek Hermansky, Idiap-RR-28-2007

{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, Mikaela Keller, Johnny Mariéthoz and Samy Bengio, Idiap-RR-34-2004

Silence Models in Weighted Finite-State Transducers, Philip N. Garner, Idiap-RR-19-2008

Simple Image Description Generator via a Linear Phrase-based Model, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2015

Smartphone Multi-modal Biometric Authentication: Database and Evaluation, Ramachandra Raghavendra, Martin Stokkenes, Amir Mohammadi, Sushma Venkatesh, Kiran B. Raja, Pankaj Wasnik, Eric Poiret, Sébastien Marcel and Christoph Busch, Idiap-RR-17-2020

[URL]

SNR Features for Automatic Speech Recognition, Philip N. Garner, Idiap-RR-25-2009

Social Focus of Attention as a Time Function Derived from Multimodal Signals, Danil Korchagin and Hamid Reza Abutalebi, Idiap-RR-09-2011

Sociometry Based Multiparty Audio Recordings Segmentation, Alessandro Vinciarelli, Idiap-RR-78-2005

Sociometry Based Multiparty Audio Recordings Summarization, Alessandro Vinciarelli, Idiap-RR-27-2006

SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, V. Lemaire and F. Clérot, Idiap-RR-30-2002

Some Emerging Concepts in Speech Recognition., Hynek Hermansky and Hervé Bourlard, Idiap-RR-82-2003

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, Idiap-RR-03-2016

SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-10-2022

Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, Raphael Ullmann and Hervé Bourlard, Idiap-RR-07-2014

Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-19-2016

Sparse Probabilistic Classifiers, Romain Hérault and Yves Grandvalet, Idiap-RR-19-2007

Sparse Subspace Modeling for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-01-2016

Spatial Data Mapping with Support Vector Regression, Mikhail Kanevski and Stéphane Canu, Idiap-RR-09-2000

Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, Guillaume Lathoud, Idiap-RR-77-2006

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, Idiap-RR-23-2009

Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, Idiap-RR-29-2006

Speaker Normalization using HMM2, Shajith Ikbal, Katrin Weber and Hervé Bourlard, Idiap-RR-15-2002

Speaker Verification Based On User-Customized Password, Mohamed Faouzi BenZeghiba, Hervé Bourlard and Johnny Mariéthoz, Idiap-RR-13-2001

Speaker verification experiments on the XM2VTS database, Juergen Luettin, Idiap-RR-02-1999

Speaker Verification: A Quick Overview, Hervé Bourlard and Nelson Morgan, Idiap-RR-12-1998

Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, Vincent Fontaine and Hervé Bourlard, Idiap-RR-09-1996

Spectral Entropy Based Feature for Robust ASR, Hemant Misra, Shajith Ikbal, Hervé Bourlard and Hynek Hermansky, Idiap-RR-56-2003

Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-10-2005

Spectral Entropy Feature in Multi-stream for Robust ASR, Hemant Misra and Hervé Bourlard, Idiap-RR-45-2005

Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-16-2008

Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra and Hervé Bourlard, Idiap-RR-20-2004

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, Samuel Thomas, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-05-2008

Speech & Face Based Biometric Authentication at IDIAP, Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux and Sébastien Marcel, Idiap-RR-13-2003

Speech Acquisition in Meetings with an Audio-Visual Sensor Array, Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore and Silèye O. Ba, Idiap-RR-03-2005

Speech Coding based on Spectral Dynamics, Petr Motlicek, Hynek Hermansky, Harinath Garudadri and Naveen Srinivasamurthy, Idiap-RR-05-2006

Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, Hari Krishna Maganti, Daniel Gatica-Perez and Iain A. McCowan, Idiap-RR-24-2006

Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, Hamid Reza Abutalebi, Mehdi Rashidinejad, Hervé Bourlard and Ali Akbar Tadaion, Idiap-RR-24-2011

SPEECH MODELING USING SPARSE AUTOENCODERS, Selen Hande Kabil and Hervé Bourlard, Idiap-RR-11-2022

Speech power spectra: a window into neural oscillations in Parkinson's disease, Sevada Hovsepyan and Mathew Magimai-Doss, Idiap-RR-02-2025

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-44-2002

Speech Recognition Using Advanced HMM2 Features, Katrin Weber, Samy Bengio and Hervé Bourlard, Idiap-RR-24-2001

Speech Recognition with Auxiliary Information, Todd Andrew Stephenson, Idiap-RR-28-2003

Speech recognition with auxiliary information, Todd Andrew Stephenson, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-58-2002

Speech recognition with speech synthesis models by marginalising over decision tree leaves, John Dines, Lakshmi Saheer and Hui Liang, Idiap-RR-17-2009

Speech vocoding for laboratory phonology, Milos Cernak, Štefan Beňuš and Alexandros Lazaridis, Idiap-RR-07-2015

Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, Jitendra Ajmera, Iain A. McCowan and Hervé Bourlard, Idiap-RR-26-2001

Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, Hayley Hung and Silèye O. Ba, Idiap-RR-20-2009

Speechreading using Probabilistic Models, Juergen Luettin and Neil A. Thacker, Idiap-RR-12-1997

Spiking Neuron Networks A survey, Hélène Paugam-Moisy, Idiap-RR-11-2006

SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, Grisard Malo, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis and Qingran Zhan, Idiap-RR-08-2019

Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-42-2013

Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, Nesli Erdogmus and Sébastien Marcel, Idiap-RR-27-2013

Sports Event Recognition using Layered HMMs, Mark Barnard and Jean-Marc Odobez, Idiap-RR-07-2005

Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, David Barber and Peter Sollich, Idiap-RR-72-2005

STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek and Hervé Bourlard, Idiap-RR-12-2019

Stationary Features and Cat Detection, Francois Fleuret and Donald Geman, Idiap-RR-56-2007

Statistical models for HMM/ANN hybrids, Philip N. Garner and David Imseng, Idiap-RR-11-2013

Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, Conrad Sanderson and Samy Bengio, Idiap-RR-04-2004

Stochastic techniques in deriving perceptual knowledge, Hynek Hermansky, Idiap-RR-84-2004

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, Milos Cernak, Alexandros Lazaridis, Philip N. Garner and Petr Motlicek, Idiap-RR-10-2014

Study of Jacobian Normalization for VTLN, Lakshmi Saheer, Philip N. Garner and John Dines, Idiap-RR-25-2010

Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Weifeng Li and Hervé Bourlard, Idiap-RR-16-2012

Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, Astrid Hagen, Andrew Morris and Hervé Bourlard, Idiap-RR-15-1998

Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-34-2005

Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, Jithendra Vepa and Simon King, Idiap-RR-26-2004

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-06-2016

Supervised and unsupervised Web-based language model domain adaptation, Gwénolé Lecorvé, John Dines, Thomas Hain and Petr Motlicek, Idiap-RR-22-2012

Supervised Gaze Bias Correction for Gaze Coding in Interactions, Remy Siegfried and Jean-Marc Odobez, Idiap-RR-23-2017

Supervised Speech Representation Learning for Parkinson's Disease Classification, Parvaneh Janbakhshi and Ina Kodrasi, Idiap-RR-08-2021

Support Vector Machine for Multiclass Classification, Eddy Mayoraz and Ethem Alpaydin, Idiap-RR-06-1998

Support Vector Machines for Classification and Mapping of Reservoir Data, Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu, Michel Maignan, Patrick Wong and S. Shibli, Idiap-RR-04-2001

Support Vector Machines for Large-Scale Regression Problems, Ronan Collobert and Samy Bengio, Idiap-RR-17-2000

Support Vector Machines with a Reject Option, Yves Grandvalet, Joseph Keshet, Alain Rakotomamonjy and Stéphane Canu, Idiap-RR-01-2009

SVM-based Transfer of Visual Knowledge Across Robotic Platforms, Jie Luo, Andrzej Pronobis and Barbara Caputo, Idiap-RR-65-2006

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, Alexandros Lazaridis, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-03-2014

Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu, Cédric Jaboulet and Philippe Langlais, Idiap-RR-01-1996

Switching Linear Dynamical Systems for Noise Robust Speech Recognition, Bertrand Mesot and David Barber, Idiap-RR-08-2006

Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, Lakshmi Saheer, Xingyu Na and Milos Cernak, Idiap-RR-31-2015

Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, Milos Cernak, Xingyu Na and Philip N. Garner, Idiap-RR-24-2013

Syllable-Level Features for Speech Pathology Detection: A Case Study of Parkinson’s Disease, Sevada Hovsepyan and Mathew Magimai-Doss, Idiap-RR-02-2026

Synchronous Alignment, Johnny Mariéthoz and Chafic Mokbel, Idiap-RR-06-1999

Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, Joël Legrand and Ronan Collobert, Idiap-RR-25-2015

Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, Nikhil Garg and Daniel Gatica-Perez, Idiap-RR-21-2009

Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, Samy Bengio and Yoshua Bengio, Idiap-RR-01-2000

Taming GANs with Lookahead, Tatjana Chavdarova, Matteo Pagliardini, Martin Jaggi and Francois Fleuret, Idiap-RR-20-2020

[URL]

Tangent Vector Kernels for Invariant Image Classification with SVMs, Alexei Pozdnoukhov and Samy Bengio, Idiap-RR-75-2003

TEAM SWITZERLAND SUBMISSION TO NIST SRE24 SPEAKER RECOGNITION EVALUATION, Amrutha Prasad, Hatef Otroshi Shahreza, Andrés Carofilis, Aref Farhadipour, Shiran Liu, Srikanth Madikeri, Anjith George, Petr Motlicek, Sébastien Marcel, Masoumeh Chapariniya, Valeriia Perepelytsia, Teodora Vukovic and Volker Dellwo, Idiap-RR-10-2025

Template-matching for Text-dependent Speaker Verification, Subhadeep Dey, Petr Motlicek, Srikanth Madikeri and Marc Ferras, Idiap-RR-32-2017

Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, Sriram Ganapathy, Petr Motlicek, Hynek Hermansky and Harinath Garudadri, Idiap-RR-48-2007

Test of several external posterior weighting functions for multiband Full Combination ASR, Hervé Glotin and Frédéric Berthommier, Idiap-RR-27-2000

Test time Adaptation through Perturbation Robustness, Prabhu Teja Sivaprasad and Francois Fleuret, Idiap-RR-17-2021

Text dependent speaker verification using binary classifiers, Dominique Genoud, Miguel Moreira and Eddy Mayoraz, Idiap-RR-08-1997

Text detection and recognition in images and video sequences, Datong Chen, Idiap-RR-44-2003

Text Detection and Recognition in Images and Videos, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-61-2002

Text Enhancement with Asymmetric Filter for Video OCR, Datong Chen, Kim Shearer and Hervé Bourlard, Idiap-RR-19-2001

Text Identification in Complex Background using SVM, Datong Chen, Hervé Bourlard and Jean-Philippe Thiran, Idiap-RR-20-2001

Text Segmentation and Recognition in Complex Background Based on Markov Random Field, Datong Chen, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-17-2002

Textual Data Representation, Mikaela Keller and Samy Bengio, Idiap-RR-74-2003

The 2013 Face Recognition Evaluation in Mobile Environment, Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato and Sébastien Marcel, Idiap-RR-36-2013

The 2013 Speaker Recognition Evaluation in Mobile Environment, Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo Violato, Zenelabidine Boulkenafet, Luis-Miguel Mazaira Fernandez, Mireia Diez, Justina Kosmala, Houssemeddine Khemiri, Tomas Cipr, Rahim Saedi, Manuel Günther, Jerneja Zganec-Gros, Ruben Zazo Candil, Flávio Simões, Messaoud Bengherabi, Augustin Alvarez Marquina, Mikel Penagarikano, Alberto Abad, Mehdi Boulayemen, Petr Schwarz, David Van Leeuwen, Javier Gonzalez-Domınguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gomez Vilda, Amparo Varona, Dijana Petrovska-Delacretaz, Pavel Matejka, Joaquin Gonzalez-Rodrıguez, Tiago de Freitas Pereira, Farid Harizi, Luis Javier Rodriguez-Fuentes, Laurent El Shafey, Marcus de Assis Angeloni, German Bordel, Gérard Chollet and Sébastien Marcel, Idiap-RR-32-2013

The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, Ivana Chingovska, André Anjos and Sébastien Marcel, Idiap-RR-18-2013

The 3D Indexing Problem, Thomas M. Breuel, Idiap-RR-08-1993

The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, Idiap-RR-26-2010

The ami meeting corpus: a pre-announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, Idiap-RR-82-2005

The analysis of kernel ridge regression learning algorithm., Alexei Pozdnoukhov, Idiap-RR-54-2002

The Auxiliary Variable Trick for deriving Kalman Smoothers, David Barber, Idiap-RR-87-2004

The BANCA Database and Experimental Protocol for Speaker Verification, F. Porée, Johnny Mariéthoz, Samy Bengio and Frédéric Bimbot, Idiap-RR-13-2002

The COLD Database, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo and Patric Jensfelt, Idiap-RR-49-2007

The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Hynek Hermansky and Alon Zweig, Idiap-RR-41-2010

The Expected Performance Curve, Samy Bengio, Mikaela Keller and Johnny Mariéthoz, Idiap-RR-85-2003

The Expected Performance Curve: a New Assessment Measure for Person Authentication, Samy Bengio and Johnny Mariéthoz, Idiap-RR-84-2003

The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, Zohreh Mostaani, Anjith George, Guillaume Heusch, David Geissbuhler and Sébastien Marcel, Idiap-RR-22-2020

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, Idiap-RR-04-2012

The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, Andrzej Pronobis, Jie Luo and Barbara Caputo, Idiap-RR-08-2010

The more you learn, the less you store: memory\--controlled incremental SVM, Andrzej Pronobis and Barbara Caputo, Idiap-RR-51-2006

The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, Mike Lincoln, Iain A. McCowan, Jithendra Vepa and Hari Krishna Maganti, Idiap-RR-69-2005

The Projectron: a Bounded Kernel-Based Perceptron, Francesco Orabona, Joseph Keshet and Barbara Caputo, Idiap-RR-30-2008

The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), Hynek Hermansky, Petr Fousek and Mikko Lehtonen, Idiap-RR-63-2005

The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, Idiap-RR-22-2006

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, Idiap-RR-13-2016

The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, Pierre-Edouard Honnet, Alexandros Lazaridis, Philip N. Garner and Junichi Yamagishi, Idiap-RR-03-2017

The Speed Submission to DIHARD II: Contributions & Lessons Learned, Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Herve Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas Evans, Sébastien Marcel, Stefano Squartini and Claude Barras, Idiap-RR-14-2019

[UNK]: https://arxiv.org/abs/1911.02388

The TA2 Database - A Multi-Modal Database from Home Entertainment, Stefan Duffner, Petr Motlicek and Danil Korchagin, Idiap-RR-37-2010

The use of Boolean concepts in general classification contexts, Miguel Moreira, Idiap-RR-46-2000

The use of brain-computer interfacing for ambient intelligence, Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew and José del R. Millán, Idiap-RR-61-2007

The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede and Daniel Gatica-Perez, Idiap-RR-33-2012

Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, Mikko Kurimo, Idiap-RR-05-2000

Theme Topic Mixture Model: A Graphical Model for Document Representation, Mikaela Keller and Samy Bengio, Idiap-RR-05-2004

Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Idiap-RR-20-2014

Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, Joseph Keshet, Idiap-RR-44-2007

Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, Guillaume Lathoud, Mathew Magimai-Doss, Jean-Marc Odobez and Hervé Bourlard, Idiap-RR-52-2005

Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, Nicolas Scaringella, Idiap-RR-46-2008

To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, Ricardo Chavarriaga, Pierre W. Ferrez and José del R. Millán, Idiap-RR-37-2007

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Nigmatulina Iuliia, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S and Aravind Ganapathiraju, Idiap-RR-07-2024

[URL]

Tokenwise Contrastive Speech and Text Pre-Training for Speech Emotion Recognition, Eklavya Sarkar and Neha Tarigopula, Idiap-RR-07-2025

Topic and Sentiment in Phrase-Based Statistical Machine Translation, Maryam Habibi, Nikolaos Pappas and Andrei Popescu-Belis, Idiap-RR-10-2017

Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, Chidansh A. Bhatt and Andrei Popescu-Belis, Idiap-RR-09-2014

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-61-2008

Torch: a modular machine learning software library, Ronan Collobert, Samy Bengio and Johnny Mariéthoz, Idiap-RR-46-2002

Towards a breakthrough speaker identification approach for law enforcement agencies, Khaled Khelif, yann Mombrun, Petr Motlicek, Gerhard Backfried, Damien Kelly, Farhan Sahito, Gideon Hazzani, Luca Scarpatto, Emmanouil Chatzigavriil and Srikanth Madikeri, Idiap-RR-29-2017

Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Gelareh Mohammadi and Alessandro Vinciarelli, Idiap-RR-05-2012

Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, Petr Fousek and Hynek Hermansky, Idiap-RR-64-2005

Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos and Mathew Magimai-Doss, Idiap-RR-11-2021

Towards Computer Understanding of Human Interactions, Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio and Hervé Bourlard, Idiap-RR-45-2003

Towards directly modeling raw speech signal for speaker verification using CNNs, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-30-2017

Towards Document-Level Neural Machine Translation, Lesly Miculicich, Idiap-RR-25-2017

Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, Sandrine Tornay and Mathew Magimai-Doss, Idiap-RR-09-2024

Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-43-2005

Towards introducing long-term statistics in MUSE for robust speech recognition, Christopher Kermorvant and Chafic Mokbel, Idiap-RR-18-1999

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2010

TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss, Idiap-RR-16-2019

Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-34-2015

Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, Norman Poh and Samy Bengio, Idiap-RR-17-2004

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-47-2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, Idiap-RR-01-2002

Towards Robust Place Recognition for Robot Localization, Muhammad Muneeb Ullah, Andrzej Pronobis, Barbara Caputo, Jie Luo, Patric Jensfelt and Henrik I. Christensen, Idiap-RR-40-2010

Towards semi-supervised learning of semantic spatial concepts, Jesus Martinez-Gomez and Barbara Caputo, Idiap-RR-03-2011

Towards using hierarchical posteriors for flexible automatic speech recognition systems, Hervé Bourlard, Samy Bengio, Mathew Magimai-Doss, Qifeng Zhu, Bertrand Mesot and Nelson Morgan, Idiap-RR-58-2004

Towards using slide information to enhance speech transcription of meetings, Artem Peregoudov, Alessandro Vinciarelli and Hervé Bourlard, Idiap-RR-01-2006

Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, Marzieh Razavi, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-15-2017

Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, Kevin C. Smith, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-40-2006

Tracking People in Meetings with Particles, Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith and Guillaume Lathoud, Idiap-RR-71-2004

Tracking the Multi Person Wandering Visual Focus of Attention, Kevin C. Smith, Silèye O. Ba, Daniel Gatica-Perez and Jean-Marc Odobez, Idiap-RR-80-2005

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, Idiap-RR-10-2010

TRACY Canvas: A Criminal Network Visualization Tool, Alejandra Sanchez Lara, Petr Motlicek, Dairazalia Sanchez-Cortes, Pradeep Rangappa, Srikanth Madikeri and Driss Khalil, Idiap-RR-03-2025

Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, Sriram Prasath Elango, Tatiana Tommasi and Barbara Caputo, Idiap-RR-06-2012

Transfer Learning through Greedy Subset Selection, Ilja Kuzborskij, Francesco Orabona and Barbara Caputo, Idiap-RR-26-2015

Transforming the feature vectors to improve HMM based cursive word recognition systems, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-32-2002

Translation Error Spotting from a User's Point of View, Thomas Meyer, Idiap-RR-31-2012

TRAP-TANDEM: Data-driven extraction of temporal features from speech, Hynek Hermansky, Idiap-RR-50-2003

Truncation Confusion Patterns in Onset Consonants, Andrew Lovitt, Idiap-RR-05-2007

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, Idiap-RR-35-2010

Twitter Sentiment Analysis (Almost) from Scratch, Rémi Lebret, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-15-2016

Two-Handed Gesture Recognition, Agnès Just and Sébastien Marcel, Idiap-RR-24-2005

Two-Handed Gestures for Human-Computer Interaction, Agnès Just, Idiap-RR-73-2006

Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar and Hema A Murthy, Idiap-RR-09-2018

Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, Francesco Orabona and Jie Luo, Idiap-RR-11-2011

Un environnement d'analyse linguistique robuste: CPD, version 1.7, Jean-Luc Cochard, Idiap-RR-03-1992

Un interface d'indexation documentaire: I d'i, version 1.4, Jean-Luc Cochard, Idiap-RR-01-1993

Un interface d'indexation documentaire: I d'i, version 2.0, Jean-Luc Cochard, Idiap-RR-03-1993

Un interface de recherche documentaire: I de r, version 2.0, Jean-Luc Cochard, Idiap-RR-04-1993

Understanding Factors in Emotion Perception, Lakshmi Saheer and Blaise Potard, Idiap-RR-28-2013

understanding metro station usage using closed circuit television cameras analysis, C. Carincotte, M. Hick, Xavier Naturel, Jean-Marc Odobez, Jian Yao, A. Bastide and B. Corbucci, Idiap-RR-38-2008

Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, Vinayak Abrol, S. Pavankumar Dubagunta and Mathew Magimai-Doss, Idiap-RR-11-2019

Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, Jean-Luc Cochard, Idiap-RR-04-1992

Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, David Barber and Silvia Chiappa, Idiap-RR-50-2006

Unknown-Multiple Speaker clustering using HMM, Jitendra Ajmera, Hervé Bourlard, I. Lapidot and Iain A. McCowan, Idiap-RR-07-2002

Unsupervised Learning for Information Distillation, Kamand Kamangar, Idiap-RR-47-2007

Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, Remi Emonet and Jean-Marc Odobez, Idiap-RR-21-2013

Unsupervised Spectral Substraction for Noise-Robust ASR, Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot and Hervé Bourlard, Idiap-RR-42-2005

Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, Guillaume Lathoud, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-09-2006

Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, Idiap-RR-57-2006

User Authentication via Adapted Statistical Models of Face Images, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, Idiap-RR-38-2004

User Customized HMM/ANN Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-32-2001

User Interface Design in a Just-in-time Retrieval System for Meetings, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Mike Flynn, Sebastian Germesin, Alexandre Nanchen and Majid Yazdani, Idiap-RR-38-2009

User-Customized Password HMM Based Speaker Verification, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-35-2002

User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-10-2002

User-Customized Password Speaker Verification Using Multiple Reference and Background Models, Mohamed Faouzi BenZeghiba and Hervé Bourlard, Idiap-RR-41-2004

Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, Idiap-RR-29-2007

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, Mathew Magimai-Doss, Idiap-RR-90-2005

Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, Norman Poh and Samy Bengio, Idiap-RR-59-2005

Using Coreference Links to Improve Spanish-to-English Machine Translation, Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-07-2017

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-14-2008

Using more informative posterior probabilities for speech recognition, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, Idiap-RR-91-2005

Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, Astrid Hagen and Hervé Bourlard, Idiap-RR-22-2000

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, Idiap-RR-09-2013

Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, Gyorgy Szaszak and Andras Beke, Idiap-RR-23-2013

Using Pitch as Prior Knowledge in Template-Based Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-65-2005

Using pitch frequency information in speech recognition, Mathew Magimai-Doss, Todd Andrew Stephenson and Hervé Bourlard, Idiap-RR-23-2003

Using posterior probabilities for speech/music discrimination, Maja Popović, Idiap-RR-08-2001

Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, Idiap-RR-23-2006

Using RASTA in task independent TANDEM feature extraction, Guillermo Aradilla, John Dines and Sunil Sivadas, Idiap-RR-22-2004

Using self-context for multimodal detection of head nods in face-to-face interactions, Laurent Son Nguyen, Jean-Marc Odobez and Daniel Gatica-Perez, Idiap-RR-27-2012

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, Stéphane Dupont and Juergen Luettin, Idiap-RR-14-1997

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), Lesly Miculicich and Andrei Popescu-Belis, Idiap-RR-29-2016

Variance Reduction Techniques in Biometric Authentication, Norman Poh and Samy Bengio, Idiap-RR-17-2003

Variational Information Maximization for Population Coding, David Barber, Idiap-RR-85-2004

Variational Information Maximization in Gaussian Channels, Felix Agakov and David Barber, Idiap-RR-88-2004

Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, Pedro Quelhas and James Boyce, Idiap-RR-58-2003

Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, Kim Shearer, Horst Bunke and Svetha Venkatesh, Idiap-RR-15-2000

Video OCR for Sport Video Annotation and Retrieval, Datong Chen and Hervé Bourlard, Idiap-RR-28-2001

Video sequence matching via decision tree path following, Kim Shearer, Svetha Venkatesh and Horst Bunke, Idiap-RR-12-2000

Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, Jean-Marc Odobez and Datong Chen, Idiap-RR-18-2002

Video Text Segmentation Using Particle Filters, Datong Chen and Jean-Marc Odobez, Idiap-RR-43-2003

View-Based Recognition, Thomas M. Breuel, Idiap-RR-09-1993

Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, Olivia Mariani, Kevin G. Chan, Alexander Ernst, Nadia Mercader and Michael Liebling, Idiap-RR-04-2019

Visual activity context for focus of attention estimation in dynamic meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, Idiap-RR-02-2009

Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Silèye O. Ba and Jean-Marc Odobez, Idiap-RR-75-2007

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2009

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-69-2008

VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, Julius Jankowski, Lara Brudermuller, Nick Hawes and Sylvain Calinon, Idiap-RR-04-2023

VRBiom: A New Periocular Dataset for Biometric Applications of HMD, Ketan Kotwal, Gökhan Özbulak and Sébastien Marcel, Idiap-RR-03-2024

VTLN Adaptation for Statistical Speech Synthesis, Lakshmi Saheer, Philip N. Garner, John Dines and Hui Liang, Idiap-RR-41-2009

VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, Lakshmi Saheer, Hui Liang, John Dines and Philip N. Garner, Idiap-RR-12-2012

Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, Eklavya Sarkar, Pavel Korshunov, Laurent Colbois and Sébastien Marcel, Idiap-RR-38-2020

Weakly Supervised Object Segmentation with Convolutional Neural Networks, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-13-2014

Weighting schemes for audio-visual fusion in speech recognition, Hervé Glotin, D. Vergyri, C. Neti, G. Potamianos and Juergen Luettin, Idiap-RR-44-2000

What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, Katayoun Farrahi and Daniel Gatica-Perez, Idiap-RR-49-2008

What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, I. Lapidot, Idiap-RR-56-2002

When Differential Privacy Meets Graph Neural Networks, Sina Sajadmanesh and Daniel Gatica-Perez, Idiap-RR-06-2023

When Users Meet Technology: The Meeting Browser Development Helix, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-05-2011

Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, Norman Poh and Samy Bengio, Idiap-RR-59-2003

Wide-Band Audio Coding based on Frequency Domain Linear Prediction, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-32-2009

Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, Petr Motlicek, Vijay Ullal and Hynek Hermansky, Idiap-RR-58-2006

Word Embeddings through Hellinger PCA, Rémi Lebret and Ronan Collobert, Idiap-RR-29-2013

Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, Sree Hari Krishnan Parthasarathi, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-28-2012

Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, Alessandro Vinciarelli and Samy Bengio, Idiap-RR-15-2001

Writer Identification for Smart Meeting Room Systems, Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz and Jonas Richiardi, Idiap-RR-70-2005

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Nigmatulina Iuliia, Petr Motlicek, Manjunath K E and Aravind Ganapathiraju, Idiap-RR-08-2024

[URL]