All research reports
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 |
2024
Investigating Semantic Segmentation Models to Assist Visually Impaired People, , and , Idiap-RR-13-2024 |
|
Estimating Breathing Pattern from Raw Speech Waveform and Short-term Speech Spectrum using Neural Networks, , , , and , Idiap-RR-12-2024 |
|
Posterior-based analysis of spatio-temporal features for Sign Language Assessment, , , , and , Idiap-RR-11-2024 |
|
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment, and , Idiap-RR-09-2024 |
|
XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models, , , , , , , and , Idiap-RR-08-2024 |
[URL] |
TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , Idiap-RR-07-2024 |
[URL] |
Feature Representations for Automatic Meerkat Vocalization Classification, , , and , Idiap-RR-06-2024 |
|
Sentiment Analysis using pretrained LLMs, , and , Idiap-RR-05-2024 |
|
Assessing the Reliability of Biometric Authentication on Virtual Reality Devices, , and , Idiap-RR-04-2024 |
|
VRBiom: A New Periocular Dataset for Biometric Applications of HMD, , and , Idiap-RR-03-2024 |
|
Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition, , and , Idiap-RR-02-2024 |
|
EdgeFace: Efficient Face Recognition Model for Edge Devices, , , , and , Idiap-RR-01-2024 |
|
2023
Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation, and , Idiap-RR-09-2023 |
|
Attacking Face Recognition with T-shirts: Database, Vulnerability Assessment and Detection, and , Idiap-RR-08-2023 |
|
Approximating Optimal Morphing Attacks using Template Inversion, , and , Idiap-RR-07-2023 |
|
When Differential Privacy Meets Graph Neural Networks, and , Idiap-RR-06-2023 |
|
Idiap Scientific Report 2022, , , , , , , , , , , , , , , , , and , Idiap-RR-05-2023 |
|
VP-STO: Via-point-based Stochastic Trajectory Optimization for Reactive Robot Behavior, , , and , Idiap-RR-04-2023 |
|
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews, , , and , Idiap-RR-03-2023 |
|
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , Idiap-RR-02-2023 |
|
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , Idiap-RR-01-2023 |
[URL] |
2022
IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach, , , , , , and , Idiap-RR-13-2022 |
|
IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model, , , , , , and , Idiap-RR-12-2022 |
|
SPEECH MODELING USING SPARSE AUTOENCODERS, and , Idiap-RR-11-2022 |
|
SPARSE AUTOENCODERS TO ENHANCE SPEECH RECOGNITION, and , Idiap-RR-10-2022 |
|
Eight Years of Face Recognition Research: Reproducibility, Achievements and Open Issues, , , , , and , Idiap-RR-09-2022 |
[URL] |
An anomaly detection approach for backdoored neural networks: face recognition as a case study, and , Idiap-RR-08-2022 |
[URL] |
On the detection of morphing attacks generated by GANs, and , Idiap-RR-07-2022 |
|
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings, , , , and , Idiap-RR-06-2022 |
|
Efficient Wind Speed Nowcasting with GPU-Accelerated Nearest Neighbors Algorithm, , and , Idiap-RR-05-2022 |
|
End-to-end Accented Speech Recognition, , and , Idiap-RR-04-2022 |
|
Robust Face Presentation Attack Detection with Multi-channel Neural Networks, and , Idiap-RR-03-2022 |
|
A Comprehensive Evaluation on Multi-channel Biometric Face Presentation Attack Detection, , and , Idiap-RR-02-2022 |
|
Applying Attention Based Models for Detecting Cognitive Processes and Mental Health Conditions, , , and , Idiap-RR-01-2022 |
|
2021
Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, , , , , , and , Idiap-RR-22-2021 |
|
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation, and , Idiap-RR-21-2021 |
Improving callsign recognition with air-surveillance data in air-traffic communication, , , and , Idiap-RR-20-2021 |
[URL] |
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection, , , , and , Idiap-RR-19-2021 |
Test time Adaptation through Perturbation Robustness, and , Idiap-RR-17-2021 |
BertOdia: BERT pre-training for low resource Odia language, , , , , and , Idiap-RR-16-2021 |
|
BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, , , , , , and , Idiap-RR-15-2021 |
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems, , , , , , and , Idiap-RR-14-2021 |
[URL] |
Multimodal Neural Machine Translation System for English to Bengali, , , , , , and , Idiap-RR-13-2021 |
|
Adjustable Deterministic Pseudonymization of Speech, , and , Idiap-RR-12-2021 |
|
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings, , , and , Idiap-RR-11-2021 |
|
NLPHut’s Participation at WAT2021, , , , , , , and , Idiap-RR-10-2021 |
|
Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition, , , , , and , Idiap-RR-09-2021 |
|
Supervised Speech Representation Learning for Parkinson's Disease Classification, and , Idiap-RR-08-2021 |
|
Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution), , , , , , , , and , Idiap-RR-07-2021 |
|
Broadcast Media Content Categorization Using Low-Resolution Concepts, , , , and , Idiap-RR-06-2021 |
|
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features, , , , and , Idiap-RR-05-2021 |
[URL] |
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model, , and , Idiap-RR-04-2021 |
|
An Attention Mechanism for Deep Q-Networks with Applications in Robotic Pushing, , and , Idiap-RR-03-2021 |
|
CHALLENGES IN BROADCAST MEDIA CONTENT CATEGORIZATION, , and , Idiap-RR-02-2021 |
|
Probabilistic Symbol Sequence Matching and its Application to Pathological Speech Intelligibility Assessment, , and , Idiap-RR-01-2021 |
|
2020
LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS, , and , Idiap-RR-40-2020 |
[URL] |
Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, , , and , Idiap-RR-38-2020 |
|
Deepfake detection: humans vs. machines, and , Idiap-RR-36-2020 |
|
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, , , and , Idiap-RR-34-2020 |
|
AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS, , and , Idiap-RR-32-2020 |
|
On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing, and , Idiap-RR-30-2020 |
|
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System, , , , , and , Idiap-RR-28-2020 |
|
Novel Architectures for Unsupervised Information Bottleneck based Speaker Diarization of Meetings, , , and , Idiap-RR-26-2020 |
|
Plug and Play Autoencoders for Conditional Text Generation, , , , and , Idiap-RR-24-2020 |
|
The High-Quality Wide Multi-Channel Attack (HQ-WMCA) database, , , , and , Idiap-RR-22-2020 |
|
Taming GANs with Lookahead, , , and , Idiap-RR-20-2020 |
[URL] |
Face Recognition Systems Under Spoofing Attacks, , , and , Idiap-RR-18-2020 |
|
Smartphone Multi-modal Biometric Authentication: Database and Evaluation, , , , , , , , and , Idiap-RR-17-2020 |
[URL] |
Learning One Class Representations for Presentation Attack Detection using Multi-channel Convolutional Neural Networks, and , Idiap-RR-15-2020 |
|
Gradient Alignment in Deep Neural Networks, and , Idiap-RR-14-2020 |
|
Can Your Face Detector Do Anti-spoofing? Face Presentation Attack Detection with a Multi-Channel Face Detector, and , Idiap-RR-12-2020 |
|
Idiap Submission to Swiss-German Language Detection Shared Task, , , , and , Idiap-RR-11-2020 |
|
CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR, and , Idiap-RR-10-2020 |
|
German News Article Classification : A Multichannel CNN Approach, , and , Idiap-RR-09-2020 |
|
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation, , , , , and , Idiap-RR-08-2020 |
|
A BAYESIAN APPROACH TO INTER-TASK FUSION FOR SPEAKER RECOGNITION, , and , Idiap-RR-07-2020 |
|
Language model domain adaptation for automatic speech recognition, , and , Idiap-RR-05-2020 |
|
Idiap NMT System for WAT 2019 Multimodal Translation Task, and , Idiap-RR-04-2020 |
|
Idiap Abstract Text Summarization System for German Text Summarization Task, and , Idiap-RR-03-2020 |
|
Extractive Odia Text Summarization System: An OCR based Approach, , Idiap-RR-02-2020 |
|
AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS, , , , and , Idiap-RR-01-2020 |
|
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, , , and |
|
2019
Learning Entailment-Based Sentence Embeddings from Natural Language Inference, , and , Idiap-RR-20-2019 |
[URL] |
On the Tunability of Optimizers in Deep Learning, , , , and , Idiap-RR-19-2019 |
[URL] |
Reconstruction of image sequences from ungated and scanning-aberrated laser scanning microscopy images of the beating heart, , , and , Idiap-RR-18-2019 |
|
Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, , , and , Idiap-RR-17-2019 |
|
TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION, , and , Idiap-RR-16-2019 |
|
Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, , , , and , Idiap-RR-15-2019 |
|
The Speed Submission to DIHARD II: Contributions & Lessons Learned, , , , , , , , , , , , , and , Idiap-RR-14-2019 |
|
INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, , , and , Idiap-RR-13-2019 |
|
STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, , , , , and , Idiap-RR-12-2019 |
|
Understanding Raw Waveform based CNN through Low-rank Spectro-Temporal Decoupling, , and , Idiap-RR-11-2019 |
|
Domain Adaptation and Investigation of Robustness of DNN-based Embeddings for Text-Independent Speaker Verification Using Dilated Residual Networks, , and , Idiap-RR-10-2019 |
|
A Comprehensive Experimental and Reproducible Study on Selfie Biometrics in Multistream and Heterogeneous Settings, , and , Idiap-RR-09-2019 |
|
SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, , , , , and , Idiap-RR-08-2019 |
|
Processing Megapixel Images with Deep Attention-Sampling Models, and , Idiap-RR-07-2019 |
[URL] |
Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs, , and , Idiap-RR-06-2020 |
|
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, , and , Idiap-RR-06-2019 |
[URL] |
AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, , , , and , Idiap-RR-05-2019 |
|
Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, , , , and , Idiap-RR-04-2019 |
|
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, , and , Idiap-RR-03-2019 |
|
Data-Driven Movement Subunit Extraction from Skeleton Information for Modeling Signs and Gestures, , and , Idiap-RR-02-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
2018
DeepFakes: a New Threat to Face Recognition? Assessment and Detection, and , Idiap-RR-18-2018 |
|
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , Idiap-RR-17-2018 |
|
Designing second order recurrent neural networks for prosody modelling, , Idiap-RR-16-2018 |
|
Analysis of Posterior Estimation Approaches to I-vector Extraction for Speaker Recognition, , , and , Idiap-RR-15-2018 |
|
Combining the SNR Spectrum with a Cochlear Model, , Idiap-RR-14-2018 |
|
Modelling glottal source information for depression detection, , and , Idiap-RR-13-2018 |
|
Not All Samples Are Created Equal: Deep Learning with Importance Sampling, and , Idiap-RR-12-2018 |
|
Gradient-based spectral visualization of CNNs using raw waveforms, , , and , Idiap-RR-11-2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , Idiap-RR-10-2018 |
|
Two-Pass IB based Speaker Diarization System using Meeting-Specific ANN based Features, , , and , Idiap-RR-09-2018 |
|
Semi-blind spatially-variant deconvolution in optical microscopy with local Point Spread Function estimation by use of Convolutional Neural Networks, and , Idiap-RR-07-2018 |
|
DNN based speaker embedding using content information for text-dependent speaker verification, , , and , Idiap-RR-06-2018 |
|
Knowledge Transfer with Jacobian Matching, and , Idiap-RR-04-2018 |
[URL] |
Implémentation d'un algorithme de réduction de taille des réseaux de neurones, , Idiap-RR-03-2018 |
|
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , Idiap-RR-02-2018 |
|
Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model, , and , Idiap-RR-01-2018 |
|
2017
Template-matching for Text-dependent Speaker Verification, , , and , Idiap-RR-32-2017 |
|
CONTENT NORMALIZATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-31-2017 |
|
Towards directly modeling raw speech signal for speaker verification using CNNs, , and , Idiap-RR-30-2017 |
|
Towards a breakthrough speaker identification approach for law enforcement agencies, , , , , , , , , and , Idiap-RR-29-2017 |
|
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, and , Idiap-RR-28-2017 |
|
Evaluating Attention Networks for Anaphora Resolution, , , and , Idiap-RR-27-2017 |
|
Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models, , and , Idiap-RR-26-2017 |
|
Towards Document-Level Neural Machine Translation, , Idiap-RR-25-2017 |
|
Supervised Gaze Bias Correction for Gaze Coding in Interactions, and , Idiap-RR-23-2017 |
|
Semi-supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control, , , , , and , Idiap-RR-21-2017 |
|
Perceptual Information Loss due to Impaired Speech Production, , and , Idiap-RR-20-2017 |
|
A Sub-Quadratic Exact Medoid Algorithm, and , Idiap-RR-19-2017 |
|
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
|
Multilingual Hierarchical Attention Networks for Document Classification, and , Idiap-RR-17-2017 |
[URL] |
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , Idiap-RR-16-2017 |
|
Towards Weakly Supervised Acoustic Subword Unit Discovery and Lexicon Development Using Hidden Markov Models, , and , Idiap-RR-15-2017 |
|
BEAT: An Open-Source Web-Based Open-Science Platform, , and , Idiap-RR-14-2017 |
|
2D Face Recognition: An Experimental and Reproducible Research Survey, , and , Idiap-RR-13-2017 |
|
From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, , , and , Idiap-RR-12-2017 |
|
Long Term Spectral Statistics for Voice Presentation Attack Detection, , , and , Idiap-RR-11-2017 |
|
Topic and Sentiment in Phrase-Based Statistical Machine Translation, , and , Idiap-RR-10-2017 |
|
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , Idiap-RR-09-2017 |
|
Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues, , and , Idiap-RR-08-2017 |
|
Using Coreference Links to Improve Spanish-to-English Machine Translation, and , Idiap-RR-07-2017 |
|
Machine translation of Spanish personal and possessive pronouns using anaphora probabilities, and , Idiap-RR-06-2017 |
|
INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION, , , and , Idiap-RR-05-2017 |
|
EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-04-2017 |
|
The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, , , and , Idiap-RR-03-2017 |
|
Real-time Multiple Head Tracking Using Texture and Colour Cues, and , Idiap-RR-02-2017 |
|
Maya Codical Glyph Segmentation: A Crowdsourcing Approach, , and , Idiap-RR-01-2017 |
|
2016
IDIAP SUBMISSION TO THE NIST SRE 2016 SPEAKER RECOGNITION EVALUATION, , , , and , Idiap-RR-32-2016 |
|
Redundant Hash Addressing for Large-Scale Query by Example Spoken Query Detection, , and , Idiap-RR-31-2016 |
|
Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, , and , Idiap-RR-30-2016 |
|
Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT), and , Idiap-RR-29-2016 |
|
On the impact of non-modal phonation on phonological features, , , , , , , , , , , , , and , Idiap-RR-28-2016 |
|
Cognitive speech coding, and , Idiap-RR-27-2016 |
|
Implementation of the Standard I-vector System for the Kaldi Speech Recognition Toolkit, , , and , Idiap-RR-26-2016 |
|
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , Idiap-RR-25-2016 |
[URL] |
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , Idiap-RR-24-2016 |
[URL] |
Cross-database evaluation of audio-based spoofing detection systems, and , Idiap-RR-23-2016 |
[URL] |
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, and , Idiap-RR-21-2016 |
|
Feature mapping using far-field microphones for distant speech recognition, , , and , Idiap-RR-20-2016 |
|
Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, , and , Idiap-RR-19-2016 |
|
End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, , and , Idiap-RR-18-2016 |
|
Fast K-Means with Accurate Bounds, and , Idiap-RR-17-2016 |
|
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , Idiap-RR-16-2016 |
|
Twitter Sentiment Analysis (Almost) from Scratch, , and , Idiap-RR-15-2016 |
|
Intonation atom based emphasis transfer, and , Idiap-RR-14-2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , Idiap-RR-13-2016 |
|
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , Idiap-RR-10-2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , Idiap-RR-09-2016 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , Idiap-RR-08-2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , Idiap-RR-07-2016 |
[URL] |
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , Idiap-RR-06-2016 |
|
Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, , Idiap-RR-05-2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , Idiap-RR-04-2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , Idiap-RR-03-2016 |
|
An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, , , and , Idiap-RR-02-2016 |
|
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , Idiap-RR-01-2016 |
|
2015
A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, , and , Idiap-RR-35-2015 |
|
Towards Multiple Pronunciation Generation in Acoustic G2P Conversion Framework, , and , Idiap-RR-34-2015 |
|
Posterior-Based Multi-Stream Formulation To Combine Multiple Grapheme-to-Phoneme Conversion Techniques, and , Idiap-RR-33-2015 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , Idiap-RR-32-2015 |
|
Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, , and , Idiap-RR-31-2015 |
|
Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, , , , and , Idiap-RR-30-2015 |
|
Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, and , Idiap-RR-29-2015 |
|
Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
Transfer Learning through Greedy Subset Selection, , and , Idiap-RR-26-2015 |
|
Syntactic Parsing of Morphologically Rich Languages Using Deep Neural Networks, and , Idiap-RR-25-2015 |
|
Learning linearly separable features for speech recognition using convolutional neural networks, , and , Idiap-RR-24-2015 |
[URL] |
Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , Idiap-RR-23-2015 |
|
Simple Image Description Generator via a Linear Phrase-based Model, , and , Idiap-RR-22-2015 |
|
"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , Idiap-RR-20-2015 |
|
KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , Idiap-RR-19-2015 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , Idiap-RR-17-2015 |
|
EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , Idiap-RR-16-2015 |
|
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , Idiap-RR-14-2015 |
|
On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, , and , Idiap-RR-13-2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
An Empirical Model of Emphatic Word Detection, and , Idiap-RR-11-2015 |
|
Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, , and , Idiap-RR-10-2015 |
|
Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , Idiap-RR-09-2015 |
|
Phrase-based Image Captioning, , and , Idiap-RR-08-2015 |
|
Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, , , and , Idiap-RR-06-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Phonological vocoding using artificial neural networks, , and , Idiap-RR-04-2015 |
|
A simple continuous excitation model for parametric vocoding, , and , Idiap-RR-03-2015 |
|
Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis, , and , Idiap-RR-02-2015 |
|
2014
LETHA: Learning from High Quality Inputs for 3D Pose Estimation in Low Quality Images., , , and , Idiap-RR-22-2014 |
|
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Theoretical Analysis of Euclidean Distance Matrix Completion for Ad hoc Microphone Array Calibration, , Idiap-RR-20-2014 |
|
Articulatory Feature based Continuous Speech Recognition using Probabilistic Lexical Modeling, and , Idiap-RR-19-2014 |
|
Convolutional Neural Networks-based Continuous Speech Recognition using Raw Speech Signal, , and , Idiap-RR-18-2014 |
|
Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, , , , , , and , Idiap-RR-17-2014 |
|
Objective Speech Intelligibility Assessment through Comparison of Phoneme Class Conditional Probability Sequences, , and , Idiap-RR-16-2014 |
|
Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, , and , Idiap-RR-15-2014 |
|
Weakly Supervised Object Segmentation with Convolutional Neural Networks, and , Idiap-RR-13-2014 |
|
Biometrics Evaluation under Spoofing Attacks, , and , Idiap-RR-12-2014 |
|
Exemplar-based Sparse Representation for Posterior Features, , and , Idiap-RR-11-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, and , Idiap-RR-09-2014 |
|
EYEDIAP Database: Data Description and Gaze Tracking Evaluation Benchmarks, , and , Idiap-RR-08-2014 |
|
Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, and , Idiap-RR-07-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-06-2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-05-2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , Idiap-RR-04-2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
|
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , Idiap-RR-02-2014 |
|
Score Calibration in Face Recognition, , , , , and , Idiap-RR-01-2014 |
|
2013
Is Deep Learning Really Necessary for Word Embeddings?, , and , Idiap-RR-44-2013 |
|
On Modeling Context-dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , Idiap-RR-43-2013 |
|
Spoofing Attacks To 2D Face Recognition Systems With 3D Masks, and , Idiap-RR-42-2013 |
|
Recurrent Convolutional Neural Networks for Scene Labeling, and , Idiap-RR-41-2013 |
|
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, , and , Idiap-RR-40-2013 |
|
Crosslingual Tandem-SGMM: Exploiting Out-Of-Language Data for Acoustic Model and Feature Level Adaptation, , and , Idiap-RR-39-2013 |
|
ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS, , , and , Idiap-RR-38-2013 |
|
FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSIANS IN LVCSR TASK, , and , Idiap-RR-37-2013 |
|
The 2013 Face Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-36-2013 |
|
On the Improvements of Uni-modal and Bi-modal Fusions of Speaker and Face Recognition for Mobile Biometrics, , , and , Idiap-RR-35-2013 |
|
I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-34-2013 |
|
An Open-source State-of-the-art Toolbox for Broadcast News Diarization, , , , , and , Idiap-RR-33-2013 |
|
The 2013 Speaker Recognition Evaluation in Mobile Environment, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , Idiap-RR-32-2013 |
|
Fusing Matching and Biometric Similarity Measures for Face Diarization in Video, , and , Idiap-RR-31-2013 |
|
Bi-Modal Biometric Authentication on Mobile Phones in Challenging Conditions, , , , and , Idiap-RR-30-2013 |
|
Word Embeddings through Hellinger PCA, and , Idiap-RR-29-2013 |
|
Understanding Factors in Emotion Perception, and , Idiap-RR-28-2013 |
|
Spoofing in 2D Face Recognition with 3D Masks and Anti-spoofing with Kinect, and , Idiap-RR-27-2013 |
|
Investigating time-sensitive topic model approaches for action recognition, , and , Idiap-RR-26-2013 |
|
Automatic Speech Indexing System of Bilingual Video Parliament Interventions, , , , , and , Idiap-RR-25-2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , Idiap-RR-24-2013 |
|
Using Phonological Phrase Segmentation to Improve Automatic Keyword Spotting for the Highly Agglutinating Hungarian Language, and , Idiap-RR-23-2013 |
|
Recurrent Convolutional Neural Networks for Scene Parsing, and , Idiap-RR-22-2013 |
|
Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, and , Idiap-RR-21-2013 |
|
Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux, and , Idiap-RR-20-2013 |
[URL] |
Anti-spoofing in action: joint operation with a verification system, , and , Idiap-RR-19-2013 |
|
The 2nd Competition on Counter Measures to 2D Face Spoofing Attacks, , and , Idiap-RR-18-2013 |
|
Session Variability Modelling for Face Authentication, , , , and , Idiap-RR-17-2013 |
|
Learning Categories from Few Examples with Multi Model Knowledge Transfer, , and , Idiap-RR-16-2013 |
|
Probabilistic Lexical Modeling and Grapheme-based Automatic Speech Recognition, and , Idiap-RR-15-2013 |
|
Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach, and , Idiap-RR-14-2013 |
|
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, , and , Idiap-RR-13-2013 |
|
Bias Adaptation for Vocal Tract Length Normalization, , , and , Idiap-RR-12-2013 |
|
Statistical models for HMM/ANN hybrids, and , Idiap-RR-11-2013 |
|
Adaptation Experiments on French MediaParl ASR, , Idiap-RR-10-2013 |
|
Using out-of-language data to improve an under-resourced speech recognizer, , , and , Idiap-RR-09-2013 |
|
Enhancing State Mapping-Based Cross-Lingual Speaker Adaptation using Phonological Knowledge in a Data-Driven Manner, and , Idiap-RR-08-2013 |
|
A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition, , , and , Idiap-RR-07-2013 |
[URL] |
ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, , and , Idiap-RR-06-2013 |
|
Convolutional Pitch Target Approximation Model for Speech Synthesis, and , Idiap-RR-05-2013 |
|
KL-HMM and Probabilistic Lexical Modeling, and , Idiap-RR-04-2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , Idiap-RR-03-2013 |
|
Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
Comparing different acoustic modeling techniques for multilingual boosting, , , , and , Idiap-RR-01-2013 |
|
2012
Data-Driven Enhancement of State Mapping-Based Cross-Lingual Speaker Adaptation, , Idiap-RR-38-2012 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , Idiap-RR-37-2012 |
|
IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, , and , Idiap-RR-36-2012 |
|
Automatic Social Role Recognition In Professional Meetings, and , Idiap-RR-35-2012 |
|
Grapheme and Multilingual Posterior Features For Under-Resource Speech Recognition: A Study on Scottish Gaelic, , and , Idiap-RR-34-2012 |
|
The Vernissage Corpus: A Multimodal Human-Robot-Interaction Dataset, , , , , , , , , and , Idiap-RR-33-2012 |
|
A Survey on Language Modeling using Neural Networks, and , Idiap-RR-32-2012 |
|
Translation Error Spotting from a User's Point of View, , Idiap-RR-31-2012 |
|
Improving Object Classification using Pose Information, , , and , Idiap-RR-30-2012 |
|
An Open Source Framework for Standardized Comparisons of Face Recognition Algorithms, , and , Idiap-RR-29-2012 |
|
Wordless Sounds: Robust Speaker Diarization using Privacy-Preserving Audio Representations, , and , Idiap-RR-28-2012 |
|
Using self-context for multimodal detection of head nods in face-to-face interactions, , and , Idiap-RR-27-2012 |
|
Baseline System for Automatic Speech Recognition with French GlobalPhone Database, and , Idiap-RR-26-2012 |
|
Bob: a free signal processing and machine learning toolbox for researchers, , , , , and , Idiap-RR-25-2012 |
|
Integrating Language Identification to improve Multilingual Speech Recognition, , Idiap-RR-24-2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , Idiap-RR-23-2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , Idiap-RR-22-2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , Idiap-RR-21-2012 |
|
Application of Subspace Gaussian Mixture Models in Contrastive Acoustic Scenarios, , , and , Idiap-RR-20-2012 |
|
On the Effectiveness of Local Binary Patterns in Face Anti-spoofing, , and , Idiap-RR-19-2012 |
|
Bi-Modal Authentication in Mobile Environments Using Session Variability Modelling, , , , and , Idiap-RR-18-2012 |
|
Integrating Posterior Features and Self-Organizing Maps for Isolated Word Recognition without Dynamic Programming, and , Idiap-RR-17-2012 |
|
Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, and , Idiap-RR-16-2012 |
|
Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, , and , Idiap-RR-15-2012 |
|
Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, and , Idiap-RR-14-2012 |
|
Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data, , , , , , , , , , , , , and , Idiap-RR-13-2012 |
|
VTLN-Based Rapid Cross-Lingual Adaptation for Statistical Parametric Speech Synthesis, , , and , Idiap-RR-12-2012 |
|
Combining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework, , , and , Idiap-RR-11-2012 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , Idiap-RR-10-2012 |
|
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , Idiap-RR-09-2012 |
|
Progress report of a project in very low bit-rate speech coding, , and , Idiap-RR-08-2012 |
|
Improving Control of Dexterous Hand Prostheses Using Adaptive Learning, , , and , Idiap-RR-07-2012 |
|
Transfer Learning of Visual Concepts across Robots: a Discriminative Approach, , and , Idiap-RR-06-2012 |
|
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, and , Idiap-RR-05-2012 |
|
The Kaldi Speech Recognition Toolkit, , , , , , , , , , , , and , Idiap-RR-04-2012 |
|
Cross-pollination of normalisation techniques from speaker to face authentication using Gaussian mixture models, , , and , Idiap-RR-03-2012 |
|
Face detection using boosted Jaccard distance-based regression, , and , Idiap-RR-02-2012 |
|
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, , and , Idiap-RR-01-2012 |
|
2011
IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, , and , Idiap-RR-40-2011 |
|
BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, , and , Idiap-RR-39-2011 |
|
Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM, and , Idiap-RR-38-2011 |
|
Face Verification using Gabor Filtering and Adapted Gaussian Mixture Models, , and , Idiap-RR-37-2011 |
|
Robustness of Group Delay Representations for Noisy Speech Signals, , and , Idiap-RR-36-2011 |
|
Continuous Speech Recognition using Boosted Binary Features, , and , Idiap-RR-35-2011 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , Idiap-RR-34-2011 |
|
HEAT: Iterative Relevance Feedback with One Million Images, and , Idiap-RR-33-2011 |
|
Finding Information in Multimedia Records of Meetings, , and , Idiap-RR-32-2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , Idiap-RR-31-2011 |
|
Learning from Images with Captions Using the Maximum Margin Set Algorithm, , , and , Idiap-RR-30-2011 |
|
Inter-session Variability Modelling and Joint Factor Analysis for Face Authentication, , , and , Idiap-RR-28-2011 |
|
Learning from Candidate Labeling Sets, and , Idiap-RR-27-2011 |
|
A Large-Scale Database of Images and Captions for Automatic Face Naming, , , and , Idiap-RR-26-2011 |
|
Multiclass Transfer Learning from Unconstrained Priors, , and , Idiap-RR-25-2011 |
|
Speech Enhancement using Beta-order MMSE Spectral Amplitude Estimator with Laplacian Prior, , , and , Idiap-RR-24-2011 |
|
Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR, and , Idiap-RR-23-2011 |
|
Multi-party Speech Recovery Exploiting Structured Sparsity Models, , , and , Idiap-RR-22-2011 |
|
Multitask Learning to Improve Articulatory Feature Estimation and Phoneme Recognition, and , Idiap-RR-21-2011 |
|
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , Idiap-RR-20-2011 |
|
Improving non-native ASR through stochastic multilingual phoneme space transformations, , , , and , Idiap-RR-19-2011 |
|
Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech, and , Idiap-RR-18-2011 |
|
Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation, and , Idiap-RR-17-2011 |
|
AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, , , , and , Idiap-RR-16-2011 |
|
Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., , Idiap-RR-15-2011 |
|
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization, , and , Idiap-RR-14-2011 |
|
Language dependent universal phoneme posterior estimation for mixed language speech recognition, , , and , Idiap-RR-13-2011 |
|
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection, , , and , Idiap-RR-12-2011 |
|
Ultra-Fast Optimization Algorithm for Sparse Multi Kernel Learning, and , Idiap-RR-11-2011 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , Idiap-RR-09-2011 |
|
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , Idiap-RR-08-2011 |
|
On-line unsupervised adaptation for face verification using Gaussian Mixture Models with multiple user models, , and , Idiap-RR-07-2011 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-06-2011 |
|
When Users Meet Technology: The Meeting Browser Development Helix, , and , Idiap-RR-05-2011 |
|
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition, , and , Idiap-RR-04-2011 |
|
Towards semi-supervised learning of semantic spatial concepts, and , Idiap-RR-03-2011 |
|
Integrating Articulatory Features using Kullback-Leibler Divergence based Acoustic Model for Phoneme Recognition, and , Idiap-RR-02-2011 |
|
Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking, and , Idiap-RR-01-2011 |
|
2010
On Improving Face Detection Performance by Modelling Contextual Information, , and , Idiap-RR-43-2010 |
|
Automatic Time Skew Detection and Correction, , Idiap-RR-42-2010 |
|
The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, , , , , , , , , , , , , and , Idiap-RR-41-2010 |
|
Towards Robust Place Recognition for Robot Localization, , , , , and , Idiap-RR-40-2010 |
|
Hierarchical Tandem Features for ASR in Mandarin, , and , Idiap-RR-39-2010 |
|
Fast Bounding Box Estimation based Face Detection, and , Idiap-RR-38-2010 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , Idiap-RR-37-2010 |
|
A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining, , and , Idiap-RR-36-2010 |
|
Tuning-Robust Initialization Methods for Speaker Diarization, and , Idiap-RR-35-2010 |
|
Measuring the gap between HMM-based ASR and TTS, , and , Idiap-RR-34-2010 |
|
Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , Idiap-RR-33-2010 |
|
Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
MOBIO: Mobile Biometric Face and Speaker Authentication, , , , , , , , and , Idiap-RR-31-2010 |
|
On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , , and , Idiap-RR-30-2010 |
|
Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , Idiap-RR-24-2010 |
|
Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
English Spoken Term Detection in Multilingual Recordings, , and , Idiap-RR-21-2010 |
|
Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
|
Modeling and Understanding Flickr Communities through Topic-based Analysis, and , Idiap-RR-19-2010 |
|
Flickr Groups: Multimedia Communities for Multimedia Analysis, and , Idiap-RR-18-2010 |
|
Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
|
An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , Idiap-RR-13-2010 |
|
Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, , and , Idiap-RR-11-2010 |
|
Tracter: A Lightweight Dataflow Framework, and , Idiap-RR-10-2010 |
|
Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , and , Idiap-RR-09-2010 |
|
The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , Idiap-RR-01-2010 |
|
2009
VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , Idiap-RR-40-2009 |
|
Automatic Temporal Alignment of AV Data, , and , Idiap-RR-39-2009 |
|
User Interface Design in a Just-in-time Retrieval System for Meetings, , , , , , and , Idiap-RR-38-2009 |
|
On MLP-based Posterior Features for Template-based ASR, , , and , Idiap-RR-37-2009 |
|
Memoirs of Togetherness from Audio Logs, , Idiap-RR-36-2009 |
|
APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
Out-of-Scene AV Data Detection, , Idiap-RR-31-2009 |
|
Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, and , Idiap-RR-30-2009 |
|
Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , Idiap-RR-29-2009 |
|
Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , Idiap-RR-28-2009 |
|
Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , Idiap-RR-27-2009 |
|
Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
SNR Features for Automatic Speech Recognition, , Idiap-RR-25-2009 |
|
On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, , and , Idiap-RR-24-2009 |
|
Speaker Change Detection with Privacy-Preserving Audio Cues, , , and , Idiap-RR-23-2009 |
|
Co-occurrence Models for Image Annotation and Retrieval, , Idiap-RR-22-2009 |
|
Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, and , Idiap-RR-21-2009 |
|
Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, and , Idiap-RR-20-2009 |
|
Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, and , Idiap-RR-19-2009 |
|
Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, , Idiap-RR-18-2009 |
|
Speech recognition with speech synthesis models by marginalising over decision tree leaves, , and , Idiap-RR-17-2009 |
|
Measuring the gap between HMM-based ASR and TTS, , and , Idiap-RR-16-2009 |
|
Real-Time ASR from Meetings, , , , , , , , and , Idiap-RR-15-2009 |
|
Robustness of Phase based Features for Speaker Recognition, , and , Idiap-RR-14-2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , Idiap-RR-13-2009 |
|
Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, , , and , Idiap-RR-12-2009 |
|
Comparing meeting browsers using a task-based evaluation method, , Idiap-RR-11-2009 |
|
Multiple Object Tracking using Flow Linear Programming, , and , Idiap-RR-10-2009 |
|
ClusterRank: A Graph Based Method for Meeting Summarization, , , and , Idiap-RR-09-2009 |
|
A MAP Approach to Noise Compensation of Speech, , Idiap-RR-08-2009 |
|
Novel initialization methods for Speaker Diarization, , Idiap-RR-07-2009 |
|
Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, , Idiap-RR-06-2009 |
|
Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics, , , , and , Idiap-RR-05-2009 |
|
Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, , and , Idiap-RR-04-2009 |
|
Parts-Based Face Verification using Local Frequency Bands, and , Idiap-RR-03-2009 |
|
Visual activity context for focus of attention estimation in dynamic meetings, , and , Idiap-RR-02-2009 |
|
Support Vector Machines with a Reject Option, , , and , Idiap-RR-01-2009 |
|
2008
CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, , and , Idiap-RR-77-2008 |
|
Multi-layer Boosting for Pattern Recognition, , Idiap-RR-76-2008 |
|
Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, , and , Idiap-RR-75-2008 |
|
MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, , and , Idiap-RR-74-2008 |
|
Integrating audio and vision for robust automatic gender recognition, and , Idiap-RR-73-2008 |
|
How does a dictation machine recognize speech?, , and , Idiap-RR-72-2008 |
|
Entropy coding of Quantized Spectral Components in FDLP audio codec, , and , Idiap-RR-71-2008 |
|
Modulation Frequency Features For Phoneme Recognition In Noisy Speech, , and , Idiap-RR-70-2008 |
|
Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, , , and , Idiap-RR-69-2008 |
|
Kernel Based Text-Independnent Speaker Verification, , and , Idiap-RR-68-2008 |
|
Acoustic Models for Posterior Features in Speech Recognition, , Idiap-RR-67-2008 |
|
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, , , and , Idiap-RR-66-2008 |
|
Identifying Dominant People in Meetings from Audio-Visual Sensors, and , Idiap-RR-65-2008 |
|
Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, , and , Idiap-RR-64-2008 |
|
Calibration from statistical properties of the visual world, , and , Idiap-RR-63-2008 |
|
Topickr: Flickr Groups and Users Reloaded, and , Idiap-RR-61-2008 |
|
Composite Kernel Learning, , and , Idiap-RR-59-2008 |
|
An Information Theoretic Approach to Speaker Diarization of Meeting Data, , and , Idiap-RR-58-2008 |
|
Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, , , , and , Idiap-RR-57-2008 |
|
Continuous Brain-Actuated Control of an Intelligent Wheelchair by Human EEG, , , , , , and , Idiap-RR-53-2008 |
|
Recognition of Anticipatory Behavior from Human EEG, , and , Idiap-RR-52-2008 |
|
Predictive Models for Music, , and , Idiap-RR-51-2008 |
|
Probabilistic Models for Melodic Prediction, , and , Idiap-RR-50-2008 |
|
What Did You Do Today? Discovering Daily Routines from Large-Scale Mobile Data, and , Idiap-RR-49-2008 |
|
Analyzing Interactions Between Navigation Strategies Using a Computational Model of Action Selection, , , , and , Idiap-RR-48-2008 |
|
Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, and , Idiap-RR-47-2008 |
|
Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, , Idiap-RR-46-2008 |
|
Fast Approximate Spoken Term Detection from Sequence of Phonemes, , , and , Idiap-RR-45-2008 |
|
Hilbert Envelope Based Features for Far-Field Speech Recognition, , and , Idiap-RR-42-2008 |
|
Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction, , and , Idiap-RR-41-2008 |
|
Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding, , , and , Idiap-RR-40-2008 |
|
Enhanced Phone Posteriors for Improving Speech Recognition Systems, and , Idiap-RR-39-2008 |
|
understanding metro station usage using closed circuit television cameras analysis, , , , , , and , Idiap-RR-38-2008 |
|
Asynchronous detection and classification of oscillatory brain activity, , and , Idiap-RR-36-2008 |
|
Inference in Switching Linear Dynamical Systems Applied to Noise Robust Speech Recognition of Isolated Digits, , Idiap-RR-35-2008 |
|
Machine Learning for Information Retrieval, , Idiap-RR-34-2008 |
|
A Distance Model for Rhythms, , , and , Idiap-RR-33-2008 |
|
Discovering Human Routines from Cell Phone Data with Topic Models, and , Idiap-RR-32-2008 |
|
Discriminatove Keyword Spotting, , and , Idiap-RR-31-2008 |
|
The Projectron: a Bounded Kernel-Based Perceptron, , and , Idiap-RR-30-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , , and , Idiap-RR-29-2008 |
|
Characterizing the EEG Correlates of Exploratory Behavior, , , and , Idiap-RR-28-2008 |
|
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, and , Idiap-RR-27-2008 |
|
Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, , and , Idiap-RR-26-2008 |
|
Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition, and , Idiap-RR-25-2008 |
|
Emulating Temporal Receptive Fields of Auditory Mid-Brain Neurons for Automatic Speech Recognition, and , Idiap-RR-24-2008 |
|
A Data-driven Approach to Speech/Non-speech Detection, and , Idiap-RR-23-2008 |
|
Exploiting contextual information for speech/non-speech detection, and , Idiap-RR-22-2008 |
|
Exploiting temporal context for speech/non-speech detection, , and , Idiap-RR-21-2008 |
|
Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition, and , Idiap-RR-20-2008 |
|
Silence Models in Weighted Finite-State Transducers, , Idiap-RR-19-2008 |
|
Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech, , and , Idiap-RR-18-2008 |
|
Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction, , and , Idiap-RR-17-2008 |
|
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain, , , and , Idiap-RR-16-2008 |
|
Posterior Features Applied to Speech Recognition Tasks with Limited Training Data, , and , Idiap-RR-15-2008 |
|
Using KL-based Acoustic Models in a Large Vocabulary Recognition Task, , and , Idiap-RR-14-2008 |
|
Reverse Correlation for analyzing MLP Posterior Features in ASR, , and , Idiap-RR-13-2008 |
|
On the Combination of Auditory and Modulation Frequency Channels for ASR applications, and , Idiap-RR-12-2008 |
|
A Neural Network based Regression Approach for Recognizing Simultaneous Speech, , , , and , Idiap-RR-10-2008 |
|
Neural Network based Regression for Robust Overlapping Speech Recognition using Microphone Arrays, , , and , Idiap-RR-09-2008 |
|
Predicting the dominant clique in meetings through fusion of nonverbal cues, , , and , Idiap-RR-08-2008 |
|
Maximum Negentropy Beamforming, , , , and , Idiap-RR-07-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , Idiap-RR-06-2008 |
|
Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain, , and , Idiap-RR-05-2008 |
|
Detecting queues at vending machines: a statistical layered approach, and , Idiap-RR-04-2008 |
|
Analyzing Flickr Groups, and , Idiap-RR-03-2008 |
|
Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, , , , , and , Idiap-RR-02-2008 |
|
2007
A Comparative Psychophysical and EEG Study of Different Feedback Modalities for HRI, , , , and , Idiap-RR-78-2007 |
|
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , Idiap-RR-77-2007 |
|
Hierarchical Penalization, , and , Idiap-RR-76-2007 |
|
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, and , Idiap-RR-75-2007 |
|
Adaptive Beamforming with a Minimum Mutual Information Criterion, , , , , and , Idiap-RR-74-2007 |
|
Minimum Mutual Information Beamforming for Simultaneous Active Speakers, , , , , and , Idiap-RR-73-2007 |
|
Effective post-processing for single-channel frequency-domain speech enhancement, , Idiap-RR-71-2007 |
|
A Generative Model for Rhythms, , , and , Idiap-RR-70-2007 |
|
Classifying Materials in the Real World, , , and , Idiap-RR-69-2007 |
|
Fast Human Detection from Videos Using Covariance Features, and , Idiap-RR-68-2007 |
|
Multi-Layer Background Subtraction Based on Color and Texture, and , Idiap-RR-67-2007 |
|
LP-TRAPs in all senses, , Idiap-RR-66-2007 |
|
Exploiting Contextual Information for Improved Phoneme Recognition, , , and , Idiap-RR-65-2007 |
|
Discriminative Cue Integration for Medical Image Annotation, , and , Idiap-RR-64-2007 |
|
On-line Independent Support Vector Machines for Cognitive Systems, , , , and , Idiap-RR-63-2007 |
|
Daily Routine Classification from Mobile Phone Data, and , Idiap-RR-62-2007 |
|
The use of brain-computer interfacing for ambient intelligence, , , , , and , Idiap-RR-61-2007 |
|
ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, , , and , Idiap-RR-60-2007 |
|
Object Category Detection using Audio-visual Cues, , , , and , Idiap-RR-58-2007 |
|
Human-Centered Computing: Toward a Human Revolution, , , and , Idiap-RR-57-2007 |
|
Stationary Features and Cat Detection, and , Idiap-RR-56-2007 |
|
Robust overlapping speech recognition based on neural networks, , and , Idiap-RR-55-2007 |
|
MLP-based Log Spectral Energy Mapping for Robust Overlapping Speech Recognition, , , and , Idiap-RR-54-2007 |
|
Non-linear Spectral Contrast Stretching for In-car Speech Recognition, and , Idiap-RR-53-2007 |
|
A Bayesian Switching Linear Dynamical System for Scale-Invariant robust speech extraction, and , Idiap-RR-52-2007 |
|
COMBINATION OF AGGLOMERATIVE AND SEQUENTIAL CLUSTERING FOR SPEAKER DIARIZATION, , and , Idiap-RR-51-2007 |
|
Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues, and , Idiap-RR-50-2007 |
|
The COLD Database, , , , and , Idiap-RR-49-2007 |
|
Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction, , , and , Idiap-RR-48-2007 |
|
Unsupervised Learning for Information Distillation, , Idiap-RR-47-2007 |
|
Recognition and Understanding of Meetings The AMI and AMIDA Projects, , and , Idiap-RR-46-2007 |
|
Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications, and , Idiap-RR-45-2007 |
|
Theoretical Foundations for Large-Margin Kernel-Based Continuous Speech Recognition, , Idiap-RR-44-2007 |
|
Non-uniform QMF Decomposition for Wide-band Audio Coding based on Frequency Domain Linear Prediction, , , and , Idiap-RR-43-2007 |
|
Detection and Recognition of Number Sequences in Spoken Utterances, and , Idiap-RR-42-2007 |
|
Posterior-Based Features and Distances in Template Matching for Speech Recognition, and , Idiap-RR-41-2007 |
|
Role Recognition in Radio Programs using Social Affiliation Networks and Mixtures of Discrete Distributions: an Approach Inspired by Social Cognition, and , Idiap-RR-40-2007 |
|
A Novel Statistical Generative Model Dedicated To Face Recognition, and , Idiap-RR-39-2007 |
|
A Discriminative Kernel-based Model to Rank Images from Text Queries, and , Idiap-RR-38-2007 |
|
To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, , and , Idiap-RR-37-2007 |
|
Mapping Nonverbal Communication into Social Status: Automatic Recognition of Journalists and Non-journalists in Radio News, , Idiap-RR-33-2007 |
|
Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting, , , and , Idiap-RR-32-2007 |
|
AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA, , and , Idiap-RR-31-2007 |
|
Broadcast News Story Segmentation Using Social Network Analysis and Hidden Markov Models, and , Idiap-RR-30-2007 |
|
Using Audio and Video Features to Classify the Most Dominant Person in a Group Meeting, , , , , , , , and , Idiap-RR-29-2007 |
|
Significance of Contextual Information in Phoneme Recognition, , , and , Idiap-RR-28-2007 |
|
Analysis of Confusion Matrix to Combine Evidence for Phoneme Recognition, , , and , Idiap-RR-27-2007 |
|
Bayesian Controller for a Novel Semi-Autonomous Navigation Concept, , , and , Idiap-RR-26-2007 |
|
Feature Extraction for Multi-class BCI using Canonical Variates Analysis, , , , and , Idiap-RR-23-2007 |
|
Keyword Spotting on Word Lattices, and , Idiap-RR-22-2007 |
|
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, and , Idiap-RR-21-2007 |
|
A Cognitive and Unsupervised MAP Adaptation Approach to the Recognition of the Focus of Attention from Head Pose, and , Idiap-RR-20-2007 |
|
Sparse Probabilistic Classifiers, and , Idiap-RR-19-2007 |
|
More Efficiency in Multiple Kernel Learning, , , and , Idiap-RR-18-2007 |
|
Confidence-based Cue Integration for Visual Place Recognition, and , Idiap-RR-17-2007 |
|
Scalable Wide-band Audio Codec based on Frequency Domain Linear Prediction, , , and , Idiap-RR-16-2007 |
|
Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection, and , Idiap-RR-15-2007 |
|
Joint Bi-Modal Face and Speaker Authentication using Explicit Polynomial Expansion, , Idiap-RR-14-2007 |
|
Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, and , Idiap-RR-13-2007 |
|
A study of phoneme and grapheme based context-dependent ASR systems, and , Idiap-RR-12-2007 |
|
Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting, , and , Idiap-RR-11-2007 |
|
On Confusions in a Phoneme Recognizer, , and , Idiap-RR-10-2007 |
|
Multi-stream Features Combination based on Dempster-Shafer Rule for LVCSR System, , and , Idiap-RR-09-2007 |
|
Hierarchical Neural Networks Feature Extraction for LVCSR system, , , , , and , Idiap-RR-08-2007 |
|
Learning the structure of image collections with latent aspect models, , Idiap-RR-06-2007 |
|
Truncation Confusion Patterns in Onset Consonants, , Idiap-RR-05-2007 |
|
Face Authentication with Salient Local Features and Static Bayesian Network, and , Idiap-RR-04-2007 |
|
Biometric Person Authentication IS A Multiple Classifier Problem, and , Idiap-RR-03-2007 |
|
Dynamical Dirichlet Mixture Model, , and , Idiap-RR-02-2007 |
|
2006
Face Detection and Verification using Local Binary Patterns, , Idiap-RR-79-2006 |
|
Probabilistic Graphical Models for Human Interaction Analysis, , Idiap-RR-78-2006 |
|
Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays, , Idiap-RR-77-2006 |
|
Machine Learning Approaches to Text Representation using Unlabeled Data, , Idiap-RR-76-2006 |
|
Semantic Segmentation of Radio Programs Using Social Network Analysis and Duration Distribution Modeling, , and , Idiap-RR-75-2006 |
|
Observations on Multi-Band Asynchrony in Distant Speech Recordings, , Idiap-RR-74-2006 |
|
Two-Handed Gestures for Human-Computer Interaction, , Idiap-RR-73-2006 |
|
Discrmininant Models for Text-independent Speaker Verification, , Idiap-RR-70-2006 |
|
Master Thesis: Integration of the Harmonic plus Noise Model (HNM) into the Hidden Markov Model-Based Speech Synthesis System (HTS), , Idiap-RR-69-2006 |
|
Identifying unexpected words using in-context and out-of-context phoneme posteriors, and , Idiap-RR-68-2006 |
|
Posterior Based Keyword Spotting with A Priori Thresholds, , , and , Idiap-RR-67-2006 |
|
SVM-based Transfer of Visual Knowledge Across Robotic Platforms, , and , Idiap-RR-65-2006 |
|
Model Adaptation for Sentence Unit Segmentation from Speech, , Idiap-RR-64-2006 |
|
Analyzing Group Interactions in Conversations: a Review, , Idiap-RR-63-2006 |
|
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition, , and , Idiap-RR-62-2006 |
|
Combination of Acoustic Classifiers based on Dempster-Shafer Theory of evidence, and , Idiap-RR-61-2006 |
|
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, , and , Idiap-RR-60-2006 |
|
Wide-Band Perceptual Audio Coding based on Frequency-Domain Linear Prediction, , and , Idiap-RR-58-2006 |
|
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, , and , Idiap-RR-57-2006 |
|
Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, , and , Idiap-RR-56-2006 |
|
A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models, and , Idiap-RR-55-2006 |
|
A supervised learning approach based on STDP and polychronization in spiking neuron networks, , and , Idiap-RR-54-2006 |
|
Melanoma Recognition using Kernel Classifiers, , and , Idiap-RR-53-2006 |
|
Incremental Learning for Place Recognition in Dynamic Environments, , , and , Idiap-RR-52-2006 |
|
The more you learn, the less you store: memory\--controlled incremental SVM, and , Idiap-RR-51-2006 |
|
Unified Inference for Variational Bayesian Linear Gaussian State-Space Models, and , Idiap-RR-50-2006 |
|
Detection and Application of Influence Rankings in Small Group Meetings, , , and , Idiap-RR-49-2006 |
|
Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces, , Idiap-RR-48-2006 |
[URL] |
Robust-to-Illumination Face Localisation using Active Shape Models and Local Binary Patterns, , and , Idiap-RR-47-2006 |
|
Audio Coding Based on Long Temporal Segments: Experiments With Quantization of Excitation Signal, and , Idiap-RR-46-2006 |
|
A Multitask Learning Approach to Document Representation using Unlabeled Data, and , Idiap-RR-44-2006 |
|
Detecting Intentional Mental Transitions in an Asynchronous BCI, , , , and , Idiap-RR-43-2006 |
|
Recognizing People's Focus of Attention from Head Poses: a Study, and , Idiap-RR-42-2006 |
|
Exploring Contextual Information in a Layered Framework for Group Action Recognition, , and , Idiap-RR-41-2006 |
|
Tracking Attention for Multiple People: Wandering Visual Focus of Attention Estimation, , , and , Idiap-RR-40-2006 |
|
Detecting Abandoned Luggage Items in a Public Space, , and , Idiap-RR-39-2006 |
|
Multi-Person Tracking in Meetings: A Comparative Study, , , , , and , Idiap-RR-38-2006 |
|
2D Multi-Person Tracking: A Comparative Study in AMI Meetings, , , , , and , Idiap-RR-37-2006 |
|
Investigating Lexical Substitution Scoring for Subtitle Generation, , , , and , Idiap-RR-36-2006 |
|
Role Recognition in Broadcast News Using Social Network Analysis and Duration Distribution Modeling, , Idiap-RR-35-2006 |
|
On the Recent Use of Local Binary Patterns for Face Authentication, , and , Idiap-RR-34-2006 |
|
A Neural Network to Retrieve Images from Text Queries, and , Idiap-RR-33-2006 |
|
Learning to Retrieve Images from Text Queries with a Discriminative Model, , and , Idiap-RR-32-2006 |
|
Indexation de Documents Manuscrits, , Idiap-RR-31-2006 |
|
Audio Coding Based on Long Temporal Contexts, , , and , Idiap-RR-30-2006 |
|
Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech, and , Idiap-RR-29-2006 |
|
Multi-stream Processing for Noise Robust Speech Recognition, , Idiap-RR-28-2006 |
|
Sociometry Based Multiparty Audio Recordings Summarization, , Idiap-RR-27-2006 |
|
Further Applications of Sector-Based Detection and Short-Term Clustering, , Idiap-RR-26-2006 |
|
Estimating the Confidence Interval of Expected Performance Curve in Biometric Authentication Using Joint Bootstrap, and , Idiap-RR-25-2006 |
|
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, , and , Idiap-RR-24-2006 |
|
Using Posterior-Based Features in Template Matching for Speech Recognition, , and , Idiap-RR-23-2006 |
|
The segmentation of multi-channel meeting recordings for automatic speech recognition, , and , Idiap-RR-22-2006 |
|
Juicer: A Weighted Finite-State Transducer speech decoder, , , , , and , Idiap-RR-21-2006 |
|
Discriminant linear processing of time-frequency plane, and , Idiap-RR-20-2006 |
|
Infinite Models for Speaker Clustering, , Idiap-RR-19-2006 |
|
Bi-Modal Face and Speech Authentication: a BioLogin Demonstration System, , , and , Idiap-RR-18-2006 |
|
Natural Scene Image Modeling using Color and Texture Visterms., and , Idiap-RR-17-2006 |
|
Online Classifier Adaptation in Brain-Computer Interfaces, and , Idiap-RR-16-2006 |
|
A Discriminative Approach for the Retrieval of Images from Text Queries, , and , Idiap-RR-15-2006 |
|
Discriminative Kernel-Based Phoneme Sequence Recognition, , , , and , Idiap-RR-14-2006 |
|
Online statistical estimation for vehicle control, , Idiap-RR-13-2006 |
|
Nearly optimal exploration-exploitation decision thresholds, , Idiap-RR-12-2006 |
|
Spiking Neuron Networks A survey, , Idiap-RR-11-2006 |
|
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, and , Idiap-RR-10-2006 |
|
Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels, , and , Idiap-RR-09-2006 |
|
Switching Linear Dynamical Systems for Noise Robust Speech Recognition, and , Idiap-RR-08-2006 |
|
Active Shape Models Using Local Binary Patterns, and , Idiap-RR-07-2006 |
|
Face Authentication Using Adapted Local Binary Pattern Histograms, and , Idiap-RR-06-2006 |
|
Speech Coding based on Spectral Dynamics, , , and , Idiap-RR-05-2006 |
|
Revisiting Doddington's Zoo: A Systematic Method to Assess User-dependent Variabilities, , and , Idiap-RR-04-2006 |
|
Hand Posture Classification and Recognition using the Modified Census Transform, , and , Idiap-RR-02-2006 |
|
Towards using slide information to enhance speech transcription of meetings, , and , Idiap-RR-01-2006 |
|
2005
Using more informative posterior probabilities for speech recognition, , , and , Idiap-RR-91-2005 |
|
Using Auxiliary Sources of Knowledge for Automatic Speech Recognition, , Idiap-RR-90-2005 |
|
A Generative Model for Music Transcription, , and , Idiap-RR-89-2005 |
|
Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing, , , and , Idiap-RR-88-2005 |
|
Efficient Kalman Smoothing for Harmonic State-Space Models, , Idiap-RR-87-2005 |
|
Probabilistic Tagging of Unstructured Genealogical Records, and , Idiap-RR-86-2005 |
|
Face Authentication Based on Local Features and Generative Models, , Idiap-RR-85-2005 |
|
Bayesian Factorial Linear Gaussian State-Space Models for Biosignal Decomposition, and , Idiap-RR-84-2005 |
|
The ami meeting corpus: a pre-announcement, , , , , , , , , , , , , , , , and , Idiap-RR-82-2005 |
|
Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation, and , Idiap-RR-81-2005 |
|
Tracking the Multi Person Wandering Visual Focus of Attention, , , and , Idiap-RR-80-2005 |
|
Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition, , and , Idiap-RR-79-2005 |
|
Sociometry Based Multiparty Audio Recordings Segmentation, , Idiap-RR-78-2005 |
|
A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems, and , Idiap-RR-77-2005 |
|
Local Binary Patterns as an Image Preprocessing for Face Authentication, , and , Idiap-RR-76-2005 |
|
Kernelized Infomax Clustering, and , Idiap-RR-73-2005 |
|
Stable Directed Belief Propagation in Gaussian DAGs using the auxiliary variable trick, and , Idiap-RR-72-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, , Idiap-RR-71-2005 |
|
Writer Identification for Smart Meeting Room Systems, , , , , and , Idiap-RR-70-2005 |
|
The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, , , and , Idiap-RR-69-2005 |
|
Finding groups of people in Google news, and , Idiap-RR-68-2005 |
|
A Discriminative Decoder for the Recognition of Phoneme Sequences, and , Idiap-RR-67-2005 |
|
Improving Speech Recognition Using a Data-Driven Approach, , and , Idiap-RR-66-2005 |
|
Using Pitch as Prior Knowledge in Template-Based Speech Recognition, , and , Idiap-RR-65-2005 |
|
Towards ASR Based on Hierarchical Posterior-Based Keyword Recognition, and , Idiap-RR-64-2005 |
|
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input), , and , Idiap-RR-63-2005 |
|
Multi-stream ASR: Oracle Test and Embedded Training, , and , Idiap-RR-62-2005 |
|
Can a Professional Imitator Fool a GMM-Based Speaker Verification System?, and , Idiap-RR-61-2005 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps, , and , Idiap-RR-60-2005 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
Using Chimeric Users to Construct Fusion Classifiers in Biometric Authentication Tasks: An Investigation, and , Idiap-RR-59-2005 |
|
Chord Representations for Probabilistic Models, , and , Idiap-RR-58-2005 |
|
A Probabilistic Model for Chord Progressions, , and , Idiap-RR-57-2005 |
|
Modeling semantic aspects for cross-media image indexing, and , Idiap-RR-56-2005 |
|
Measuring the Performance of Face Localization Systems, , , and , Idiap-RR-53-2005 |
|
Threshold Selection for Unsupervised Detection, with an Application to Microphone Arrays, , , and , Idiap-RR-52-2005 |
|
Modeling Interactions from Email Communication, , , and , Idiap-RR-51-2005 |
|
Extracting Information from Multimedia Meeting Collections, , and , Idiap-RR-50-2005 |
|
Multiview Face Detection, , and , Idiap-RR-49-2005 |
|
Learning influence among interacting Markov chains, , , and , Idiap-RR-48-2005 |
|
Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus, , and , Idiap-RR-47-2005 |
|
Efficient Diffusion-based Illumination Normalization for Face Verification, , and , Idiap-RR-46-2005 |
|
Spectral Entropy Feature in Multi-stream for Robust ASR, and , Idiap-RR-45-2005 |
|
Compensating User-Specific Information with User-Independent Information in Biometric Authentication Tasks, and , Idiap-RR-44-2005 |
|
Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication, and , Idiap-RR-43-2005 |
|
Unsupervised Spectral Substraction for Noise-Robust ASR, , , and , Idiap-RR-42-2005 |
|
Hierarchical approach for spotting keywords, , Idiap-RR-41-2005 |
|
A Thousand Words in a Scene, , , and , Idiap-RR-40-2005 |
|
Benchmarking Non-Parametric Statistical Tests, , and , Idiap-RR-38-2005 |
|
Harmonic Plus Noise Model for Concatenative Speech Synthesis, , Idiap-RR-37-2005 |
|
Application of Information Retrieval Technologies to Presentation Slides, and , Idiap-RR-36-2005 |
|
A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking, and , Idiap-RR-35-2005 |
|
Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis, and , Idiap-RR-34-2005 |
|
A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space, , , and , Idiap-RR-33-2005 |
|
A Kernel Classifier for Distributions, and , Idiap-RR-32-2005 |
|
Multimodal Integration for Meeting Group Action Segmentation and Recognition, , , , , and , Idiap-RR-31-2005 |
|
Integrating co-occurrence and spatial contexts on patch-based scene segmentation, , , and , Idiap-RR-30-2005 |
|
Gradient estimates of return, and , Idiap-RR-29-2005 |
|
Joint Speech and Speaker Recognition, , Idiap-RR-28-2005 |
|
Audio-visual probabilistic tracking of multiple speakers in meetings, , , and , Idiap-RR-27-2005 |
|
A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification, , and , Idiap-RR-26-2005 |
|
Hierarchical Multi-Stream Posterior Based Speech Recognition System, , and , Idiap-RR-25-2005 |
|
Two-Handed Gesture Recognition, and , Idiap-RR-24-2005 |
|
Developing and Enhancing Posterior Based Speech Recognition Systems, , , and , Idiap-RR-23-2005 |
|
Joint Training of Multi-Stream HMMs, , Idiap-RR-22-2005 |
|
Inferring Document Similarity from Hyper-links, and , Idiap-RR-21-2005 |
|
Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?, and , Idiap-RR-20-2005 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-19-2005 |
|
Multi-resolution RASTA filtering for TANDEM-based ASR, and , Idiap-RR-18-2005 |
|
Local Features and 1D-HMMs for Fast and Robust Face Authentication, , Idiap-RR-17-2005 |
|
Improving Continuous Speech Recognition System Performance with Grapheme Modelling, , , and , Idiap-RR-16-2005 |
|
Semi-supervised Meeting Event Recognition with Adapted HMMs, , and , Idiap-RR-15-2005 |
|
Constructing visual models with a latent space approach, , , and , Idiap-RR-14-2005 |
|
A Frequency-Domain Silence Noise Model, , and , Idiap-RR-13-2005 |
|
A Neural Network for Text Representation, and , Idiap-RR-12-2005 |
|
OCR Based Slide Retrieval, , and , Idiap-RR-11-2005 |
|
Spectral Entropy Feature in Full-Combination Multi-stream for Robust ASR, and , Idiap-RR-10-2005 |
|
On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, , and , Idiap-RR-09-2005 |
|
Generative Temporal ICA for Classification in Asynchronous BCI Systems, and , Idiap-RR-08-2005 |
|
Sports Event Recognition using Layered HMMs, and , Idiap-RR-07-2005 |
|
Construction and comparison of approximations for switching linear gaussian state space models, and , Idiap-RR-06-2005 |
|
Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments, and , Idiap-RR-05-2005 |
|
Multi Channel Sequence Processing, and , Idiap-RR-04-2005 |
|
Speech Acquisition in Meetings with an Audio-Visual Sensor Array, , , , and , Idiap-RR-03-2005 |
|
A Meeting Browser Evaluation Test, , , and , Idiap-RR-02-2005 |
|
EER of Fixed and Trainable Fusion Classifiers: A Theoretical Study with Application to Biometric Authentication Tasks, and , Idiap-RR-01-2005 |
|
2004
A Stable Switching Kalman Smoother, , Idiap-RR-89-2004 |
|
Variational Information Maximization in Gaussian Channels, and , Idiap-RR-88-2004 |
|
The Auxiliary Variable Trick for deriving Kalman Smoothers, , Idiap-RR-87-2004 |
|
An Auxiliary Variational Method, and , Idiap-RR-86-2004 |
|
Variational Information Maximization for Population Coding, , Idiap-RR-85-2004 |
|
Stochastic techniques in deriving perceptual knowledge, , Idiap-RR-84-2004 |
|
Effect of Segmentation Method on Video Retrieval Performance, and , Idiap-RR-83-2004 |
|
Effect of Recognition Errors on Text Clustering, and , Idiap-RR-82-2004 |
|
Semi-supervised Adapted HMMs for Unusual Event Detection, , and , Idiap-RR-80-2004 |
|
Modeling Scenes with Local Descriptors and Latent Aspects, , , , , and , Idiap-RR-79-2004 |
|
Face Authentication using Client-specific Matching Pursuit, , , and , Idiap-RR-78-2004 |
|
EEG Classification using Generative Independent Component Analysis, and , Idiap-RR-77-2004 |
|
On Performance / Robustness / Complexity Trade-Offs in Face Verification, , and , Idiap-RR-74-2004 |
|
On the Use of Information Retrieval Measures for Speech Recognition Evaluation, , , , , , and , Idiap-RR-73-2004 |
|
Estimates of Parameter Distributions for Optimal Action Selection, and , Idiap-RR-72-2004 |
|
Tracking People in Meetings with Particles, , , , and , Idiap-RR-71-2004 |
|
Nonlinear Feature Transformations for Noise Robust Speech Recognition, , Idiap-RR-70-2004 |
|
A Study of the Effects of Score Normalisation Prior to Fusion in Biometric Authentication Tasks, and , Idiap-RR-69-2004 |
|
A Novel Approach to Combining Client-Dependent and Confidence Information in Multimodal Biometric, and , Idiap-RR-68-2004 |
|
Sector-Based Detection for Hands-Free Speech Enhancement in Cars, , and , Idiap-RR-67-2004 |
|
Multimodal Multispeaker Probabilistic Tracking in Meetings, , , and , Idiap-RR-66-2004 |
|
Improving Fusion with Margin-Derived Confidence In Biometric Authentication Tasks, and , Idiap-RR-63-2004 |
|
A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification, and , Idiap-RR-62-2004 |
|
Motion likelihood and proposal modeling in Model-Based Stochastic Tracking, and , Idiap-RR-61-2004 |
|
PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns, , and , Idiap-RR-60-2004 |
|
LP-TRAP: Linear predictive temporal patterns, , and , Idiap-RR-59-2004 |
|
Towards using hierarchical posteriors for flexible automatic speech recognition systems, , , , , and , Idiap-RR-58-2004 |
|
Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing, , Idiap-RR-57-2004 |
|
Invariances in Kernel Methods: From Samples to Objects, and , Idiap-RR-56-2004 |
|
Pseudo-Syntactic Language Modeling for Disfluent Speech Recognition, , Idiap-RR-55-2004 |
|
A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers, and , Idiap-RR-54-2004 |
|
A Meeting Browser Evaluation Test, , , and , Idiap-RR-53-2004 |
|
Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation, and , Idiap-RR-52-2004 |
|
Detecting Group Interest-level in Meetings, , , and , Idiap-RR-51-2004 |
|
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition, , and , Idiap-RR-50-2004 |
|
Boosting word error rates, and , Idiap-RR-49-2004 |
|
Phoneme vs Grapheme Based Automatic Speech Recognition, , , and , Idiap-RR-48-2004 |
|
An Investigation of F-ratio Client-Dependent Normalisation on Biometric Authentication Tasks, and , Idiap-RR-46-2004 |
|
Database, Protocol and Tools for Evaluating Score-Level Fusion Algorithms in Biometric Authentication, and , Idiap-RR-44-2004 |
|
Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion, and , Idiap-RR-43-2004 |
|
Large Scale Machine Learning, , Idiap-RR-42-2004 |
|
User-Customized Password Speaker Verification Using Multiple Reference and Background Models, and , Idiap-RR-41-2004 |
|
Phase AutoCorrelation (PAC) Features for Noise Robust ASR, , , and , Idiap-RR-40-2004 |
HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-39-2004 |
|
User Authentication via Adapted Statistical Models of Face Images, , and , Idiap-RR-38-2004 |
|
Multi-resolution Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-37-2004 |
|
On Local Features for Face Verification, and , Idiap-RR-36-2004 |
|
Robust Audio Segmentation, , and , Idiap-RR-35-2004 |
|
{S}ignificance {T}ests for {\em Bizarre} {M}easures in 2-{C}lass {C}lassification {T}asks, , and , Idiap-RR-34-2004 |
|
Modeling Individual and Group Actions in Meetings With Layered HMMs, , , , and , Idiap-RR-33-2004 |
|
Browsing Recorded Meetings with Ferret, , and , Idiap-RR-32-2004 |
|
Noisy Text Clustering, and , Idiap-RR-31-2004 |
|
PLSA-based Image Auto-Annotation: Constraining the Latent Space, and , Idiap-RR-30-2004 |
|
New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments, , , and , Idiap-RR-29-2004 |
|
AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking, , and , Idiap-RR-28-2004 |
|
On the Adequacy of Baseform Pronunciations and Pronunciation Variants, and , Idiap-RR-27-2004 |
|
Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis, and , Idiap-RR-26-2004 |
|
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, , Idiap-RR-25-2004 |
|
Multimodal Group Action Clustering in Meetings, , , , and , Idiap-RR-24-2004 |
|
Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition, and , Idiap-RR-23-2004 |
|
Using RASTA in task independent TANDEM feature extraction, , and , Idiap-RR-22-2004 |
|
Modelling Auxiliary Features in Tandem Systems, , , and , Idiap-RR-21-2004 |
|
Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR, , , and , Idiap-RR-20-2004 |
|
Entropy Based Combination of Tandem Representations for Noise Robust ASR, , , , and , Idiap-RR-19-2004 |
|
How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?, and , Idiap-RR-18-2004 |
|
Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task, and , Idiap-RR-17-2004 |
|
A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch, and , Idiap-RR-16-2004 |
|
A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays, and , Idiap-RR-15-2004 |
|
Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events, , and , Idiap-RR-14-2004 |
|
Sequence Classification with Input-Output Hidden Markov Models, and , Idiap-RR-13-2004 |
Application of Information Retrieval Techniques to Single Writer Documents, , Idiap-RR-12-2004 |
|
Assessing Scene Structuring in Consumer Videos, , , , and , Idiap-RR-11-2004 |
|
On the Use of Speech and Face Information for Identity Verification, and , Idiap-RR-10-2004 |
|
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, , , , and , Idiap-RR-09-2004 |
|
Effect of Recognition Errors on Information Retrieval Performance, , Idiap-RR-08-2004 |
|
Estimating the Quality of Face Localization for Face Verification, , , and , Idiap-RR-07-2004 |
|
Links between Perceptrons, MLPs and SVMs, and , Idiap-RR-06-2004 |
|
Theme Topic Mixture Model: A Graphical Model for Document Representation, and , Idiap-RR-05-2004 |
|
Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth, and , Idiap-RR-04-2004 |
|
Noisy Text Categorization, , Idiap-RR-03-2004 |
|
Making Retrieval Faster Through Document Clustering, and , Idiap-RR-02-2004 |
|
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, and , Idiap-RR-01-2004 |
|
2003
The Expected Performance Curve, , and , Idiap-RR-85-2003 |
|
The Expected Performance Curve: a New Assessment Measure for Person Authentication, and , Idiap-RR-84-2003 |
|
A Statistical Significance Test for Person Authentication, and , Idiap-RR-83-2003 |
|
Some Emerging Concepts in Speech Recognition., and , Idiap-RR-82-2003 |
|
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research, and , Idiap-RR-81-2003 |
|
On Performance Evaluation of Face Detection and Localization Algorithms, , , and , Idiap-RR-80-2003 |
|
Reconnaissance de gestes 3D bi-manuels, , , and , Idiap-RR-79-2003 |
|
A Probabilistic Framework for Joint Head Tracking and Pose Estimation, and , Idiap-RR-78-2003 |
|
Adapted Generative Models For Face Verification, , and , Idiap-RR-76-2003 |
|
Tangent Vector Kernels for Invariant Image Classification with SVMs, and , Idiap-RR-75-2003 |
|
Textual Data Representation, and , Idiap-RR-74-2003 |
|
Embedding Motion in Model-Based Stochastic Tracking, , and , Idiap-RR-72-2003 |
|
A Color and Gradient Local Descriptor Fusion Scheme For Object Recognition, and , Idiap-RR-71-2003 |
|
Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces, , and , Idiap-RR-70-2003 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
Online Policy Adaptation for Ensemble Classifiers, and , Idiap-RR-69-2003 |
|
Improving Face Verification using Symmetric Transformation, , Idiap-RR-68-2003 |
|
A Symmetric Transformation for LDA-based Face Verification, , Idiap-RR-67-2003 |
|
Face Verification using LDA and MLP on the BANCA database, , Idiap-RR-66-2003 |
|
Boosting Pixel-based Classifiers for Face Verification, and , Idiap-RR-65-2003 |
|
EEG-based BCI Systems and IDIAP EEG Database, and , Idiap-RR-64-2003 |
|
Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures, , and , Idiap-RR-63-2003 |
|
An Investigation of Spectral Subband Centroids for Speaker Authentication, , and , Idiap-RR-62-2003 |
|
Noisy Text Categorization, , Idiap-RR-61-2003 |
|
Face Verification Using Synthesized Non-Frontal Models, and , Idiap-RR-60-2003 |
|
Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?, and , Idiap-RR-59-2003 |
|
Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis, and , Idiap-RR-58-2003 |
|
On Use of Task Independent Training Data in Tandem Feature Extraction, and , Idiap-RR-57-2003 |
|
Spectral Entropy Based Feature for Robust ASR, , , and , Idiap-RR-56-2003 |
|
Clustering And Segmenting Speakers And Their Locations In Meetings, , and , Idiap-RR-55-2003 |
|
Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition, , , and , Idiap-RR-54-2003 |
|
Confidence Measures in Multiple pronunciations Modeling For Speaker Verification, and , Idiap-RR-53-2003 |
|
Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition, , and , Idiap-RR-52-2003 |
|
An Alternative To Silence Removal For Text-Independent Speaker Verification, and , Idiap-RR-51-2003 |
|
TRAP-TANDEM: Data-driven extraction of temporal features from speech, , Idiap-RR-50-2003 |
|
HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems, and , Idiap-RR-49-2003 |
|
Comparison and Combination of Features in a Hybrid HMM/MLP and a HMM/GMM Speech Recognition System, , , , and , Idiap-RR-48-2003 |
|
Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR, , , and , Idiap-RR-47-2003 |
|
Audio-Video Person Clustering in Video Databases, and , Idiap-RR-46-2003 |
|
Towards Computer Understanding of Human Interactions, , , and , Idiap-RR-45-2003 |
|
Text detection and recognition in images and video sequences, , Idiap-RR-44-2003 |
|
Video Text Segmentation Using Particle Filters, and , Idiap-RR-43-2003 |
|
A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods, and , Idiap-RR-42-2003 |
|
Boosting HMMs with an application to speech recognition, and , Idiap-RR-41-2003 |
|
Noise Robust Discriminative Models, and , Idiap-RR-40-2003 |
|
An Online Audio Indexing System, , and , Idiap-RR-39-2003 |
|
A Robust Speaker Clustering Algorithm, and , Idiap-RR-38-2003 |
|
Phoneme-Grapheme Based Speech Recognition System, , , and , Idiap-RR-37-2003 |
|
Nonlinear Spectral Transformations for Robust Speech Recognition, , and , Idiap-RR-36-2003 |
|
Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model, and , Idiap-RR-35-2003 |
|
HMM Mixtures (HMM2) for Robust Speech Recognition, , Idiap-RR-34-2003 |
|
On Multi-scale Fourier Transform Analysis of Speech Signals, and , Idiap-RR-33-2003 |
|
On Factorizing Spectral Dynamics for Robust Speech Recognition, , , and , Idiap-RR-32-2003 |
|
On Automatic Annotation of Images with Latent Space Models, and , Idiap-RR-31-2003 |
|
On the Need for On-Line Learning in Brain-Computer Interfaces, , Idiap-RR-30-2003 |
|
From Samples to Objects in Kernel Methods, and , Idiap-RR-29-2003 |
|
Speech Recognition with Auxiliary Information, , Idiap-RR-28-2003 |
|
Automatic Analysis of Multimodal Group Actions in Meetings, , , , , and , Idiap-RR-27-2003 |
|
Non-Linear Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-26-2003 |
|
A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking, , , and , Idiap-RR-25-2003 |
|
Offline Cursive Handwriting: From Word To Text Recognition, , Idiap-RR-24-2003 |
|
Using pitch frequency information in speech recognition, , and , Idiap-RR-23-2003 |
|
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, , and , Idiap-RR-22-2003 |
|
Segmenting Multiple Concurrent Speakers Using Microphone Arrays, , and , Idiap-RR-21-2003 |
|
Face Processing & Frontal Face Verification, , Idiap-RR-20-2003 |
|
On the Combination of Speech and Speaker Recognition, and , Idiap-RR-19-2003 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable, and , Idiap-RR-18-2003 |
|
Variance Reduction Techniques in Biometric Authentication, and , Idiap-RR-17-2003 |
|
A New Margin-Based Criterion for Efficient Gradient Descent, and , Idiap-RR-16-2003 |
|
An Implicit Motion Likelihood for Tracking with Particle Filters, , and , Idiap-RR-15-2003 |
|
Nonlinear Analysis of Cognitive and Motor-related EEG Signals, and , Idiap-RR-14-2003 |
Speech & Face Based Biometric Authentication at IDIAP, , , , , , , and , Idiap-RR-13-2003 |
|
Multi-Modal Audio-Visual Event Recognition for Football Analysis, , and , Idiap-RR-12-2003 |
|
Conditional Gaussian Mixtures, , Idiap-RR-11-2003 |
|
Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS, , and , Idiap-RR-10-2003 |
|
Object Localization in Metric Spaces for Video Linking, and , Idiap-RR-09-2003 |
|
Evaluation of formant-like features for automatic speech recognition, , , , , and , Idiap-RR-08-2003 |
|
Monte Carlo Video Text Segmentation, and , Idiap-RR-07-2003 |
|
On automatic annotation of meeting databases, , , , and , Idiap-RR-06-2003 |
|
Robust Features for Frontal Face Authentication in Difficult Image Conditions, and , Idiap-RR-05-2003 |
|
Scalability Analysis of Audio-Visual Person Identity Verification, , , and , Idiap-RR-04-2003 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , Idiap-RR-03-2003 |
|
Multimodal Authentication using Asynchronous HMMs, , Idiap-RR-02-2003 |
|
Offline Recognition of Large Vocabulary Cursive Handwritten Text, , and , Idiap-RR-01-2003 |
|
2002
Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems, , and , Idiap-RR-62-2002 |
|
Text Detection and Recognition in Images and Videos, , and , Idiap-RR-61-2002 |
|
Self-Organizing-Maps With BIC For Speaker Clustering, , Idiap-RR-60-2002 |
|
Modeling Human Interaction in Meetings, , , , , , , and , Idiap-RR-59-2002 |
|
Speech recognition with auxiliary information, , and , Idiap-RR-58-2002 |
Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR, and , Idiap-RR-57-2002 |
|
What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?, , Idiap-RR-56-2002 |
|
On Spectral Methods and the Structuring of Home Videos, , and , Idiap-RR-55-2002 |
|
The analysis of kernel ridge regression learning algorithm., , Idiap-RR-54-2002 |
|
Confusion matrix based posterior probabilities correction, and , Idiap-RR-53-2002 |
|
Mutliscale Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-52-2002 |
|
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks, , Idiap-RR-51-2002 |
|
Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database, , Idiap-RR-50-2002 |
|
Robust Face Verification using Skin Color and Neural Networks, , Idiap-RR-49-2002 |
|
Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering, and , Idiap-RR-48-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-47-2002 |
|
Torch: a modular machine learning software library, , and , Idiap-RR-46-2002 |
|
Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification, and , Idiap-RR-45-2002 |
|
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks, , and , Idiap-RR-44-2002 |
Location Based Speaker Segmentation, and , Idiap-RR-43-2002 |
|
Extended BIC Criterion for Model Selection, and , Idiap-RR-42-2002 |
|
Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings, and , Idiap-RR-41-2002 |
|
Improving Face Authetication Using Virtual Samples, , and , Idiap-RR-40-2002 |
|
Robust Speaker Change Detection, , and , Idiap-RR-39-2002 |
|
Phase AutoCorrelation (PAC) derived Robust Speech Features, , and , Idiap-RR-38-2002 |
|
Audio-Visual Speaker Tracking with Importance Particle Filters, , , , and , Idiap-RR-37-2002 |
|
A State-of-the-art Neural Network for Robust Face Verification, , and , Idiap-RR-36-2002 |
|
User-Customized Password HMM Based Speaker Verification, and , Idiap-RR-35-2002 |
|
Gestures for Multi-Modal Interfaces: A Review, , Idiap-RR-34-2002 |
|
Information Fusion and Person Verification Using Speech & Face Information, and , Idiap-RR-33-2002 |
|
Transforming the feature vectors to improve HMM based cursive word recognition systems, and , Idiap-RR-32-2002 |
|
Entropy-based Multi-stream Combination, , and , Idiap-RR-31-2002 |
|
SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study, and , Idiap-RR-30-2002 |
|
Noise PDF transformation in secondary feature processing, , Idiap-RR-29-2002 |
|
Online Policy Adaptation for Ensemble Algorithms, and , Idiap-RR-28-2002 |
|
Bagging Using the VMSE Cost Function, , Idiap-RR-27-2002 |
|
An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition, , Idiap-RR-26-2002 |
|
Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition, , and , Idiap-RR-25-2002 |
|
Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables, , , and , Idiap-RR-24-2002 |
|
Improved Unknown-Multiple Speaker clustering using HMM, , and , Idiap-RR-23-2002 |
|
Finding Structure in Consumer Videos by Probabilistic Hierarchical Clustering, , and , Idiap-RR-22-2002 |
|
Face Verification using MLP and SVM, and , Idiap-RR-21-2002 |
|
Linking Objects in Videos by Importance Sampling, and , Idiap-RR-20-2002 |
|
Comparison of Support Vector Machine and Neural Network for Text Texture Verification, and , Idiap-RR-19-2002 |
|
Video Text Recognition Based on Markov Random Field and Grayscale Consistency Constraint, and , Idiap-RR-18-2002 |
|
Text Segmentation and Recognition in Complex Background Based on Markov Random Field, , and , Idiap-RR-17-2002 |
|
A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds, and , Idiap-RR-16-2002 |
|
Speaker Normalization using HMM2, , and , Idiap-RR-15-2002 |
|
A Multi-sample Multi-source Model for Biometric Authentication, , and , Idiap-RR-14-2002 |
|
The BANCA Database and Experimental Protocol for Speaker Verification, , , and , Idiap-RR-13-2002 |
|
Conditional Gaussian Mixture Models for Environmental Risk Mapping, , and , Idiap-RR-12-2002 |
|
Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation, , and , Idiap-RR-11-2002 |
|
User-Customized Password Speaker Verification based on HMM/ANN and GMM Models, and , Idiap-RR-10-2002 |
|
Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach, , and , Idiap-RR-09-2002 |
|
Low cost duration modelling for noise robust speech recognition, , and , Idiap-RR-08-2002 |
|
Unknown-Multiple Speaker clustering using HMM, , , and , Idiap-RR-07-2002 |
|
Hybrid generative-discriminative models for speech and speaker recognition, and , Idiap-RR-06-2002 |
|
Experimental Protocol on the BANCA Database, , , , , , , and , Idiap-RR-05-2002 |
|
Evaluation of Formant-Like Features for ASR, , , , , and , Idiap-RR-04-2002 |
|
Estimation of Conditional Distributions using Gaussian Mixture Models, , and , Idiap-RR-03-2002 |
|
Estimating the Intrinsic Dimension of Data with a Fractal-Based Method, and , Idiap-RR-02-2002 |
|
Towards Robust and Adaptive Speech Recognition Models, , and , Idiap-RR-01-2002 |
|
2001
Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, , Idiap-RR-49-2001 |
|
Robust Face Analysis using Convolutional Neural Networks, , Idiap-RR-48-2001 |
|
Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features, and , Idiap-RR-46-2001 |
|
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition, , and , Idiap-RR-45-2001 |
|
Improving Face Verification using Skin Color Information, and , Idiap-RR-44-2001 |
|
Robust Speech Recognition and Feature Extraction Using HMM2, , , and , Idiap-RR-42-2001 |
|
Robust speech recognition based on multi-stream processing, , Idiap-RR-41-2001 |
|
Microphone Array Post-filter based on Noise Field Coherence, and , Idiap-RR-40-2001 |
|
Microphone Array Post-filter for Diffuse Noise Field, and , Idiap-RR-39-2001 |
|
Confidence Measures for Multimodal Identity Verification, , , and , Idiap-RR-38-2001 |
|
Hidden Markov Models and other Finite State Automata for Sequence Processing, and , Idiap-RR-37-2001 |
|
Increasing Speech Recognition Noise Robustness with HMM2, , and , Idiap-RR-36-2001 |
|
PhD Thesis: Speech Analysis with Production Constraints, , Idiap-RR-35-2001 |
|
A Comparative Study of Adaptation Methods for Speaker Verification, and , Idiap-RR-34-2001 |
|
Robust HMM-Based Speech/Music Segmentation, , and , Idiap-RR-33-2001 |
|
User Customized HMM/ANN Based Speaker Verification, and , Idiap-RR-32-2001 |
|
EEG pattern recognition through multi-stream evidence combination, , and , Idiap-RR-31-2001 |
|
Data utility modelling for mismatch reduction, , Idiap-RR-30-2001 |
|
Pronunciation models and their evaluation using confidence measures, and , Idiap-RR-29-2001 |
|
Video OCR for Sport Video Annotation and Retrieval, and , Idiap-RR-28-2001 |
|
IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications, , , and , Idiap-RR-27-2001 |
|
Speech/Music Discrimination using Entropy and Dynamism Features in a HMM Classification Framewor, , and , Idiap-RR-26-2001 |
|
Comparison of Client Model Adaptation Schemes, and , Idiap-RR-25-2001 |
|
Speech Recognition Using Advanced HMM2 Features, , and , Idiap-RR-24-2001 |
|
A Pragmatic View of the Application of HMM2 for ASR, , and , Idiap-RR-23-2001 |
|
Confidence Evaluation for Risk Prediction, , and , Idiap-RR-22-2001 |
|
Evaluation of Biometric Technology on XM2VTS, , and , Idiap-RR-21-2001 |
|
Text Identification in Complex Background using SVM, , and , Idiap-RR-20-2001 |
|
Text Enhancement with Asymmetric Filter for Video OCR, , and , Idiap-RR-19-2001 |
|
Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, and , Idiap-RR-18-2001 |
|
Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model, and , Idiap-RR-17-2001 |
|
Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition, and , Idiap-RR-15-2001 |
|
MAP Combination of Multi-Stream HMM or HMM/ANN Experts, , and , Idiap-RR-14-2001 |
|
Speaker Verification Based On User-Customized Password, , and , Idiap-RR-13-2001 |
|
A Parallel Mixture of SVMs for Very Large Scale Problems, , and , Idiap-RR-12-2001 |
|
Modeling Auxiliary Information in Bayesian Network Based ASR, , and , Idiap-RR-11-2001 |
|
Error Correcting Posterior Combination for Robust Multi-Band Speech Recognition, and , Idiap-RR-10-2001 |
|
Neural Networks in Automatic Speech Recognition, , , and , Idiap-RR-09-2001 |
|
Using posterior probabilities for speech/music discrimination, , Idiap-RR-08-2001 |
|
Evaluation of SVM Binary Classification with Nonparametric Stochastic Simulations, , Idiap-RR-07-2001 |
|
From missing data to maybe useful data: soft data modelling for noise robust ASR, , and , Idiap-RR-06-2001 |
|
Adaptive ML-Weighting in Multi-Band Recombination of Gaussian Mixture ASR, , and , Idiap-RR-05-2001 |
|
Support Vector Machines for Classification and Mapping of Reservoir Data, , , , , and , Idiap-RR-04-2001 |
|
Detection of Narrative Structure for Annotation of News Broadcasts, , and , Idiap-RR-03-2001 |
|
Artifacts of the colour coherence vector and an alternative similarity measure, and , Idiap-RR-02-2001 |
|
New Approaches Towards Robust and Adaptive Speech Recognition, , and , Idiap-RR-01-2001 |
|
2000
A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification, , and , Idiap-RR-48-2000 |
|
Cursive Character Recognition by Learning Vector Quantization, and , Idiap-RR-47-2000 |
|
The use of Boolean concepts in general classification contexts, , Idiap-RR-46-2000 |
|
Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main, , Idiap-RR-45-2000 |
|
Weighting schemes for audio-visual fusion in speech recognition, , , , and , Idiap-RR-44-2000 |
|
A survey on Off-Line Cursive Word Recognition, , Idiap-RR-43-2000 |
|
HMM2- Extraction of Formant Features and their Use for Robust ASR, , and , Idiap-RR-42-2000 |
|
Automatic Speech Recognition using Pitch Information in Dynamic Bayesian Networks, , and , Idiap-RR-41-2000 |
|
Learning the Decision Function for Speaker Verification, and , Idiap-RR-40-2000 |
|
A Survey of Text Detection and Recognition in Images and Videos, and , Idiap-RR-38-2000 |
|
ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO, and , Idiap-RR-37-2000 |
|
Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates, , Idiap-RR-36-2000 |
Audio visual speech recognition, , , , , , , and , Idiap-RR-35-2000 |
|
Local Machine Learning Models for Spatial Data Analysis, and , Idiap-RR-34-2000 |
|
Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm, and , Idiap-RR-33-2000 |
|
A new normalization technique for cursive handwritten words, and , Idiap-RR-32-2000 |
|
Advanced Spatial Data Analysis and Modelling with Support Vector Machines, , , and , Idiap-RR-31-2000 |
|
HMM2- A Novel Approach to HMM Emission Probability Estimation, , and , Idiap-RR-30-2000 |
|
Multiple Timescale Feature Combination towards Robust Speech Recognition, , Idiap-RR-29-2000 |
|
Multiple Hypotheses Video OCR, and , Idiap-RR-28-2000 |
|
Test of several external posterior weighting functions for multiband Full Combination ASR, and , Idiap-RR-27-2000 |
|
Recent Developments in Speaker Verification at IDIAP, and , Idiap-RR-26-2000 |
|
Mixtures of latent variable models for density estimation and classification, , Idiap-RR-25-2000 |
|
On the Convergence of SVMTorch, an Algorithm for Large-Scale Regression Problems, and , Idiap-RR-24-2000 |
|
A neural network for classification with incomplete data, , Idiap-RR-23-2000 |
|
Using Multiple Time Scales in the Framework of Multi-Stream Speech Recognition, and , Idiap-RR-22-2000 |
|
Comparison of HMM experts with MLP experts in the Full Combination Multi-Band Approach to Robust ASR, and , Idiap-RR-21-2000 |
|
From Multi-Band Full Combination to Multi-Stream Full Combination Processing in Robust ASR, , and , Idiap-RR-20-2000 |
|
Automatic Speech Recognition using Dynamic Bayesian Networks with both Acoustic and Articulatory Variables, , , and , Idiap-RR-19-2000 |
|
Mixture Models for Unsupervised and Supervised Learning, , Idiap-RR-18-2000 |
|
Support Vector Machines for Large-Scale Regression Problems, and , Idiap-RR-17-2000 |
|
Auto-Association by Multilayer Perceptrons and Singular Value Decomposition, , Idiap-RR-16-2000 |
|
Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees, , and , Idiap-RR-15-2000 |
|
Incorporating Domain Knowledge with Video and Voice Data Analysis in News Broadcasts, , and , Idiap-RR-14-2000 |
|
Combining multiple tracking algorithms for improved general performance, , and , Idiap-RR-13-2000 |
|
Video sequence matching via decision tree path following, , and , Idiap-RR-12-2000 |
|
An EM Algorithm for HMMs with Emission Distributions Represented by HMMs, , and , Idiap-RR-11-2000 |
|
Environmental Data Mapping with Support Vector Regression and Geostatistics, , and , Idiap-RR-10-2000 |
|
Spatial Data Mapping with Support Vector Regression, and , Idiap-RR-09-2000 |
|
Adaptation robuste de modèles HMM pour la vérification du locuteur dépendante du texte, and , Idiap-RR-08-2000 |
|
Handwritten Digits Recognition, , Idiap-RR-07-2000 |
|
Indexing spoken audio by LSA and SOMs, , Idiap-RR-06-2000 |
|
Thematic Indexing of Spoken Documents by Using Self-Organizing Maps, , Idiap-RR-05-2000 |
|
An Introduction to Bayesian Network Theory and Usage, , Idiap-RR-03-2000 |
|
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification, , , , , and , Idiap-RR-02-2000 |
|
Taking on the Curse of Dimensionality in Joint Distributions Using Neural Networks, and , Idiap-RR-01-2000 |
|
1999
Iterative Posterior-Based Keyword Spotting Without Filler Models: Iterative Viterbi Decoding and One-Pass Approach, and , Idiap-RR-27-1999 |
Multi-stream adaptive evidence combination for noise robust ASR, , , and , Idiap-RR-26-1999 |
|
Off-Line Cursive Script Recognition Based on Continuous Density HMM, and , Idiap-RR-25-1999 |
|
An Overview of the PICASSO Project Research Activities in Speaker Verification for Telephone Applications, , , , , , , , , , , and , Idiap-RR-24-1999 |
CLIENT / WORLD MODEL SYNCHRONOUS ALIGNEMENT FOR SPEAKER VERIFICATION, , , and , Idiap-RR-23-1999 |
|
Recognition of Asymmetric Facial Action Unit Activities and Intensities, and , Idiap-RR-22-1999 |
|
INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development, , , and , Idiap-RR-21-1999 |
|
Fast latent semantic indexing of spoken documents by using self-organizing maps, , Idiap-RR-20-1999 |
|
Automatic Facial Expression Analysis: A Survey, and , Idiap-RR-19-1999 |
|
Towards introducing long-term statistics in MUSE for robust speech recognition, and , Idiap-RR-18-1999 |
|
A comparison of two strategies for ASR in additive noise : Missing Data and Spectral Subtraction, and , Idiap-RR-17-1999 |
|
Iterative Posterior-Based Keyword Spotting Without Filler Models, and , Idiap-RR-16-1999 |
Numerical Experiments with Support Vector Machines, and , Idiap-RR-15-1999 |
|
Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models, , and , Idiap-RR-14-1999 |
|
Indexing Audio Documents by using Latent Semantic Analysis and SOM, , Idiap-RR-13-1999 |
|
Latent Semantic Indexing by Self-Organizing Map, and , Idiap-RR-12-1999 |
|
A comparison of noise reduction techniques for robust speech recognition, , Idiap-RR-10-1999 |
|
DynaBoost: Combining Boosted Hypotheses in a Dynamic Way, and , Idiap-RR-09-1999 |
|
Combinatorial Approach for Data Binarization, and , Idiap-RR-08-1999 |
|
Environmental spatial data classification with Support Vector Machines, , , and , Idiap-RR-07-1999 |
|
Synchronous Alignment, and , Idiap-RR-06-1999 |
|
Data binarization by discriminant elimination, , and , Idiap-RR-04-1999 |
|
Fusion of Face and Speech Data for Person Identity Verification, , and , Idiap-RR-03-1999 |
|
Speaker verification experiments on the XM2VTS database, , Idiap-RR-02-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract, , Idiap-RR-01-1999 |
|
Segmentation of X-ray Image Sequences Showing the Vocal Tract (with tool documentation), , Idiap-RR-01-1999 |
|
1998
Audio-Visual Person Verification, , , , and , Idiap-RR-18-1998 |
|
Automatic Speech Recognition: an Auditory Perspective, , and , Idiap-RR-17-1998 |
Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering, , Idiap-RR-16-1998 |
|
Subband-Based Speech Recognition in Noisy Conditions: The Full Combination Approach, , and , Idiap-RR-15-1998 |
|
Localized mixtures of experts, , Idiap-RR-14-1998 |
Introduction à la reconnaissance de la parole et du locuteur, , Idiap-RR-13-1998 |
Speaker Verification: A Quick Overview, and , Idiap-RR-12-1998 |
|
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems, and , Idiap-RR-11-1998 |
|
Evaluating the Complexity of Databases for Person Identification and Verification, , and , Idiap-RR-10-1998 |
|
Illumination-robust Pattern Matching Using Distorted Color Histograms, and , Idiap-RR-09-1998 |
|
Multi-Modal Data Fusion for Person Authentication using SVM, , Idiap-RR-07-1998 |
|
Support Vector Machine for Multiclass Classification, and , Idiap-RR-06-1998 |
|
Combining Linear Dichomotizers to Construct Nonlinear Polychotomizers, and , Idiap-RR-05-1998 |
|
Combined 5x2cv $F$-Test for Comparing Supervised Classification Learning Algorithms, , Idiap-RR-04-1998 |
|
On the Complexity of Recognizing Regions Computable by Two-Layered Perceptrons, , Idiap-RR-03-1998 |
|
Continuous Audio-Visual Speech Recognition, and , Idiap-RR-02-1998 |
|
Optimal Parameterization of Point Distribution Models, and , Idiap-RR-01-1998 |
|
1997
Investigation of a possible process identity between DRM and Linear Filtering, , Idiap-RR-19-1997 |
|
Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques, , Idiap-RR-18-1997 |
|
Neural Network Adaptations to Hardware Implementations, and , Idiap-RR-17-1997 |
|
An Optical Thresholding Perceptron, , , , and , Idiap-RR-16-1997 |
|
Handwritten Digit Recognition with Binary Optical Perceptron, , , and , Idiap-RR-15-1997 |
|
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition, and , Idiap-RR-14-1997 |
|
Acoustic-Labial Speaker Verification, , , and , Idiap-RR-13-1997 |
|
Speechreading using Probabilistic Models, and , Idiap-RR-12-1997 |
|
Fast Object Detection using MLP and FFT, , Idiap-RR-11-1997 |
|
On the Complexity of Recognizing Iterated Differences of Polyhedra, , Idiap-RR-10-1997 |
|
Improved Pairwise Coupling Classification With Correcting Classifiers, and , Idiap-RR-09-1997 |
|
Text dependent speaker verification using binary classifiers, , and , Idiap-RR-08-1997 |
|
Mixtures of Experts Estimate A Posteriori Probabilities, , Idiap-RR-07-1997 |
|
Decision fusion in a multi-modal identity verification system using a multi-linear classifier, , and , Idiap-RR-06-1997 |
Likelihood ratio adjustment for the compensation of model mismatch in speaker verification, and , Idiap-RR-05-1997 |
|
Optimal Setting of Weights, Learning Rate, and Gain, and , Idiap-RR-04-1997 |
|
Pruning of Neural Networks, and , Idiap-RR-03-1997 |
|
Discrete All-Positive Multilayer Perceptrons for Optical Implementation, , and , Idiap-RR-02-1997 |
|
Robust Speech Recognition based on Multi-Stream Features, , and , Idiap-RR-01-1997 |
|
1996
Image Classification by Neural Networks for the Quality Control of Watches, , and , Idiap-RR-10-1996 |
|
Speaker-Dependent Speech Recognition Based on Phone-Like Units Models --- Application to Voice Dialing, and , Idiap-RR-09-1996 |
|
On the Decomposition of Polychotomies into Dichotomies, and , Idiap-RR-08-1996 |
|
Multi-Stream Speech Recognition, , and , Idiap-RR-07-1996 |
|
On Variations of the Convex Hull Operator, , Idiap-RR-06-1996 |
|
An Implementation of Logical Analysis of Data, , , , , and , Idiap-RR-05-1996 |
|
Secured vocal access to telephone servers, , , , and , Idiap-RR-04-1996 |
|
On the Complexity of the Class of Regions Computable by a Two-Layered Perceptron, , Idiap-RR-03-1996 |
|
Combining methods to improve speaker verification decision, , , and , Idiap-RR-02-1996 |
|
Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability, , , , and , Idiap-RR-01-1996 |
|
1995
Neural Networks with Adaptive Learning Rate and Momentum Terms, and , Idiap-RR-04-1995 |
|
Experiments with robust similarity measures for OCR, , Idiap-RR-03-1995 |
Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole, , Idiap-RR-02-1995 |
Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur, , Idiap-RR-01-1995 |
|
1994
High Order and Multilayer Perceptron Initialization, and , Idiap-RR-07-1994 |
|
Adaptive Multilayer Optical Neural Network Design, and , Idiap-RR-04-1994 |
|
A System for the Off-Line Recognition of Handwritten Text, , Idiap-RR-02-1994 |
|
1993
Finding Lines under Bounded Error, , Idiap-RR-11-1993 |
|
An RBF Network that Learns Some Aspects of Perceptual Organization, , Idiap-RR-10-1993 |
|
View-Based Recognition, , Idiap-RR-09-1993 |
|
The 3D Indexing Problem, , Idiap-RR-08-1993 |
|
Geometric Matching in Computer Vision--Algorithms and Open Problems, , Idiap-RR-07-1993 |
|
Recognition of Handprinted Digits, , Idiap-RR-06-1993 |
|
Un interface de recherche documentaire: I de r, version 2.0, , Idiap-RR-04-1993 |
|
Un interface d'indexation documentaire: I d'i, version 2.0, , Idiap-RR-03-1993 |
|
Higher-Order Statistics in Visual Object Recognition, , Idiap-RR-02-1993 |
|
Un interface d'indexation documentaire: I d'i, version 1.4, , Idiap-RR-01-1993 |
|
1992
Une technique efficace de traitement en Prolog de la morphologie flexionnelle du français, , Idiap-RR-04-1992 |
|
Un environnement d'analyse linguistique robuste: CPD, version 1.7, , Idiap-RR-03-1992 |
|
Neural Network Formalization, , Idiap-RR-01-1992 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 |