All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |
2010
| Tracter: A Lightweight Dataflow Framework, and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
| Audio–Visual Synchronisation for Speaker Diarisation, , and , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010 |
|
| An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, , and , Idiap-RR-22-2010 |
|
| Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010 |
|
| Flickr Groups: Multimedia Communities for Multimedia Analysis, and , Idiap-RR-18-2010 |
|
| Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, , and , Idiap-RR-23-2010 |
|
| Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, and , Idiap-RR-12-2010 |
|
| Mining Human Location-Routines using a Multi-Level Topic Model, and , Idiap-RR-28-2010 |
|
| Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data, and , in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 2010 |
|
| Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, and , Idiap-RR-29-2010 |
|
| English Spoken Term Detection in Multilingual Recordings, , and , Idiap-RR-21-2010 |
|
| On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , , and , Idiap-RR-30-2010 |
|
| MOBIO: Mobile Biometric Face and Speaker Authentication, , , , , , , , and , Idiap-RR-31-2010 |
|
| Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, , , , and , Idiap-RR-09-2010 |
|
| Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes, , and , Idiap-RR-33-2010 |
|
| Hierarchical Multilayer Perceptron based Language Identification, , and , Idiap-RR-14-2010 |
|
| Towards mixed language speech recognition systems, , and , Idiap-RR-15-2010 |
|
| Tracter: A Lightweight Dataflow Framework, and , Idiap-RR-10-2010 |
|
| Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
| Fast Bounding Box Estimation based Face Detection, and , Idiap-RR-38-2010 |
|
| The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , Idiap-RR-08-2010 |
|
| Online-Batch Strongly Convex Multi Kernel Learning, , and , Idiap-RR-07-2010 |
|
| OM-2: An Online Multi-class Multi-kernel Learning Algorithm, , , , and , Idiap-RR-06-2010 |
|
| Implementation of VTLN for Statistical Speech Synthesis, , , and , Idiap-RR-32-2010 |
|
| Neural conditional random fields, and , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna, Sardinia, Italy, JMLR: W&CP, 2010 |
|
| Online-Batch Strongly Convex Multi Kernel Learning, , and , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010 |
|
| The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition, , and , in: Image and Vision Computing, 2010 |
[DOI] |
| A Multimodal Corpus for Studying Dominance in Small Group Conversations, , and , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010 |
|
| Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition, , Ecole polytechnique fédérale de Lausanne, 2010 |
|
| Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
| Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, and , Idiap-RR-13-2010 |
|
| Finding without searching, , Idiap-Com-01-2010 |
|
| Application of Out-Of-Language Detection To Spoken-Term Detection, and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
| BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, , and , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010 |
|
| Multistream Speaker Diarization beyond Two Acoustic Feature Streams, , and , in: International Conference on Acoustics, Speech, and Signal Processing, 2010 |
|
| AMIDA/Klewel Mini-Project, , , and , Idiap-RR-03-2010 |
|
| An Alternative Scanning Strategy to Detect Faces, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
| Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, , , and , in: ICASSP 2010, 2010 |
|
| Using Audio and Visual Cues for Speaker Diarisation Initialisation, and , in: International Conference on Acoustics, Speech and Signal Processing, 2010 |
|
| An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010 |
|
| On Improving Face Detection Performance by Modelling Contextual Information, , and , Idiap-RR-43-2010 |
|
| Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
| VTLN Adaptation for Statistical Speech Synthesis, , , and , in: Proceedings of ICASSP, Dallas, Texas, 2010 |
|
| A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , Idiap-RR-05-2010 |
|
| A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis, , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, U.S.A., 2010 |
|
| Autoregressive Models of Amplitude Modulations in Audio Compression, , and , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010 |
[URL] |
| Wide-Band Audio Coding based on Frequency Domain Linear Prediction, , , and , in: EURASIP Journal on Audio Speech and Music Processing, 2010(856280), 2010 |
[DOI] [URL] |
| Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification, and , in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010 |
|
| An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, and , Idiap-RR-02-2010 |
|
| Application of Out-Of-Language Detection To Spoken-Term Detection, and , Idiap-RR-04-2010 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |