All research reports
2010
| Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
| MOBIO: Mobile Biometric Face and Speaker Authentication, , , , , , , , and , Idiap-RR-31-2010 |
|
| On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , , and , Idiap-RR-30-2010 |
|
| Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
| Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
| Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
| The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
| Study of Jacobian Normalization for VTLN, , and , Idiap-RR-25-2010 |
|
| KL Realignment for Speaker Diarization with Multiple Feature Streams, , and , Idiap-RR-24-2010 |
|
| Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
| An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
| English Spoken Term Detection in Multilingual Recordings, , and , Idiap-RR-21-2010 |
|
| Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media, , and , Idiap-RR-20-2010 |
|
| Modeling and Understanding Flickr Communities through Topic-based Analysis, and , Idiap-RR-19-2010 |
|
| Flickr Groups: Multimedia Communities for Multimedia Analysis, and , Idiap-RR-18-2010 |
|
| Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations, and , Idiap-RR-17-2010 |
|
| An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation, and , Idiap-RR-16-2010 |
|
| Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
| Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
| Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , Idiap-RR-13-2010 |
|
| Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
| Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition, , and , Idiap-RR-11-2010 |
|
| Tracter: A Lightweight Dataflow Framework, and , Idiap-RR-10-2010 |
|
| Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , and , Idiap-RR-09-2010 |
|
| The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
| Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
| OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
| A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
| Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
| AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
| An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
| Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , Idiap-RR-01-2010 |
|
2009
| VTLN Adaptation for Statistical Speech Synthesis, , , and , Idiap-RR-41-2009 |
|
| Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , Idiap-RR-40-2009 |
|
| Automatic Temporal Alignment of AV Data, , and , Idiap-RR-39-2009 |
|
| User Interface Design in a Just-in-time Retrieval System for Meetings, , , , , , and , Idiap-RR-38-2009 |
|
| On MLP-based Posterior Features for Template-based ASR, , , and , Idiap-RR-37-2009 |
|
| Memoirs of Togetherness from Audio Logs, , Idiap-RR-36-2009 |
|
| APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, , , and , Idiap-RR-35-2009 |
|
| MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction, , and , Idiap-RR-34-2009 |
|
| Autoregressive Models of Amplitude Modulations in Audio Compression, , and , Idiap-RR-33-2009 |
|
| Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , and , Idiap-RR-32-2009 |
|
| Out-of-Scene AV Data Detection, , Idiap-RR-31-2009 |
|
| Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition, and , Idiap-RR-30-2009 |
|
| Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , Idiap-RR-29-2009 |
|
| Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, and , Idiap-RR-28-2009 |
|
| Bayesian Networks to Combine Intensity and Color Information in Face Recognition, and , Idiap-RR-27-2009 |
|
| Robust Speaker Diarization for Short Speech Recordings, and , Idiap-RR-26-2009 |
|
| SNR Features for Automatic Speech Recognition, , Idiap-RR-25-2009 |
|
| On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, , and , Idiap-RR-24-2009 |
|
