All publications sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , in: Transactions on Image Processing, 2014 |
|
On Recognition of Non-Native Speech Using Probabilistic Lexical Model, and , in: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014), 2014 |
|
Posterior-based Sparse Representation for Automatic Speech Recognition, , , and , in: Proceeding of Interspeech, 2014 |
|
Raw Speech Signal-based Continuous Speech Recognition using Convolutional Neural Networks, , and , Idiap-RR-15-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
Recurrent Greedy Parsing with Neural Networks, and , in: Proceedings of ECML 2014, pages 130-144, Springer Berlin Heidelberg, 2014 |
[DOI] |
Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras, and , in: IEEE Computer Vision and Pattern Recognition Conference, Columbus, Ohio,USA, pages 1773-1780, IEEE, 2014 |
[DOI] |
Importance of Prosody in Swiss French Accent for Speech Synthesis, and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph, and , Idiap-RR-09-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Translation and Prosody in Swiss Languages, , , , , , , , , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Dialect Levelling in Finnish: A Universal Speech Attribute Approach, , , , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification, , , , and , in: The 15th Annual Conference of the International Speech Communication Association, 2014 |
|
Comparison of Two Methods for Unsupervised Person Identification in TV Shows, , , , and , in: 12th International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration, , and , in: Signal Processing, 101:242-255, 2014 |
|
Modeling Overlapping Speech using Vector Taylor Series, , and , in: Odyssey: The Speaker and Language Recognition Workshop, Joensuu, Finland, 2014 |
|
MLP-based Factor Analysis for Tandem Speech Recognition, and , in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013 |
|
Enforcing Topic Diversity in a Document Recommender for Conversations, and , in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014 |
|
Recurrent Convolutional Neural Networks for Scene Labeling, and , in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014 |
[URL] |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Face identification from overlaid texts using Local Face Recurrent Patterns and CRF models, , , , and , in: IEEE International Conference on Image Processing 2014, Paris, IEEE, 2014 |
|
Scene Recognition with Naive Bayes Non-linear Learning, and , in: Proceedings of the 22nd International Conference on Pattern Recognition, Stockholm, pages 3404 - 3409, IEEE, 2014 |
[DOI] |
Spoofing Face Recognition with 3D Masks, and , in: IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY:1084-1097, 2014 |
[DOI] |
A task-parameterized probabilistic model with minimal intervention control, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3339 - 3344, IEEE, 2014 |
[DOI] |
Null space redundancy learning for a flexible surgical robot, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 2443 - 2448, IEEE, 2014 |
[DOI] |
Learning from demonstrations with partially observable task parameters, , and , in: Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), Hong Kong, pages 3309 - 3314, IEEE, 2014 |
[DOI] |
Mode of Teaching Based Segmentation and Annotation of Video Lectures, , and , in: International Workshop on Content-Based Multimedia Indexing, 2014 |
|
Improving Speaker Diarization using social role information, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
Exploiting Long-Term Connectivity and Visual Motion in CRF-based Multi-Person Tracking, , and , Idiap-RR-05-2014 |
|
Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, , and , Idiap-RR-18-2015 |
|
Hierarchical speaker clustering methods for the NIST i-vector Challenge, , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Multi-Source Adaptive Learning for Fast Control of Prosthetics Hand, , and , in: Proceedings of the International Conference on Pattern Recognition, Stockholm, pages 2769 - 2774, 2014 |
[DOI] |
Learning to Learn, from Transfer Learning to Domain Adaptation: A Unifying Perspective, and , in: Proceedings of the Computer Vision and Pattern Recognition, Columbus, OH, pages 1442-1449, IEEE, 2014 |
[DOI] |
Sparse Gammatone Signal Model Predicts Perceived Noise Intrusiveness, and , Idiap-RR-07-2014 |
|
On Modeling Context-Dependent Clustered States: Comparing HMM/GMM, Hybrid HMM/ANN and KL-HMM Approaches, , and , in: International Conference on Acoustics, Speech, and Signal Processing, Florence, IT, pages 7659 - 7663, IEEE, 2014 |
[DOI] |
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Scalable Probabilistic Models for Face and Speaker Recognition, , École Polytechnique Fédérale de Lausanne (EPFL), 2014 |
[URL] |
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation, , , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, pages 7639-7643, IEEE, 2014 |
[DOI] |
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
A Conditional Random field approach for audio-visual people diarization, , , , and , in: Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 - 120, IEEE, 2014 |
[DOI] |
Score Calibration in Face Recognition, , , , , and , in: IET Biometrics:1-11, 2014
|
[DOI] [URL] |
EYEDIAP: A Database for the Development and Evaluation of Gaze Estimation Algorithms from RGB and RGB-D Cameras, , and , in: Proceedings of the ACM Symposium on Eye Tracking Research and Applications, Safety Harbor, Florida, United States of America, ACM, 2014 |
[DOI] |
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation, and , in: 9th Edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling, , and , in: The Ninth Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014 |
|
Multimodal Reranking of Content-based Recommendations for Hyperlinking Video Snippets, , , and , in: ACM International Conference on Multimedia Retrieval, 2014 |
|
Acoustic and Lexical Resource Constrained ASR using Language-Independent Acoustic Model and Language-Dependent Probabilistic Lexical Model, and , Idiap-RR-02-2014 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |