All publications sorted by recency
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |
| Transfer in Inverse Reinforcement Learning for Multiple Strategies, and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, pages 3244-3250, IEEE, 2013 |
[DOI] [URL] |
| Autonomous reinforcement learning with experience replay, and , in: Neural Networks, 41:156 - 167, 2013 |
[DOI] [URL] |
| "The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
| Rehabilitation of Count-based Models for Word Vector Representations, and , in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015 |
| From Image-level to Pixel-level Labeling with Convolutional Networks, and , in: Computer Vision and Patter Recognition (CVPR), Boston, MA, pages 1713-1721, IEEE, 2015 |
[DOI] [URL] |
| Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
| DexROV: Dexterous Undersea Inspection and Maintenance in Presence of Communication Latencies, , , , , , , , , , , , , , , and , in: IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), pages 218-223, 2015 |
| Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
| Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
| DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
| Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 3501-3505, 2015 |
[URL] |
| Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , in: Proceedings of Interspeech 2015, pages 3105-3109, 2015 |
|
| Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system, , , and , Idiap-RR-20-2015 |
|
| KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS, and , in: Proceedings of ICASSP 2015, pages 4435-4439, 2015 |
|
| COMBINING SGMM SPEAKER VECTORS AND KL-HMM APPROACH FOR SPEAKER DIARIZATION, , and , in: Proceedings of ICASSP 2015, pages 4834-4837, 2015 |
|
| EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION, , , and , in: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Brisbane, Australia, pages 4445-4449, 2015 |
[URL] |
| Exploiting foreign resources for DNN-based ASR, , , , and , Idiap-RR-27-2015 |
|
| Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
| Weighted Correlation based Atom Decomposition Intonation Modelling, , , and , in: Proceedings of Interspeech, Dresden, Germany, pages 1601--1605, 2015 |
|
| Automatic Recognition of Emergent Social Roles in Small Group Interactions, and , in: Multimedia, IEEE Transactions, 17(5):746 - 760, 2015 |
[DOI] |
| Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, , , , and , in: Proceedings of Interspeech, pages 741-745, 2015 |
|
| Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 1191-1195, ISCA, 2015 |
|
| Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , Idiap-RR-14-2015 |
|
| Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
| Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
| Computational Analysis Of Behavior In Employment Interviews And Video Resumes, , École Polytechnique Fédérale de Lausanne, 2015 |
|
| Probability Occupancy Maps for Occluded Depth Images, , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829-2837, 2015 |
| Machine learning-based tools to model and to remove the off-target effect, , , and , in: Pattern Analysis and Applications, 20(1):87-100, 2017 |
[DOI] |
| Adaptive relevance feedback for large-scale image retrieval, and , in: Multimedia Tools and Applications, 75(12):6777-6807, 2016 |
[DOI] |
| Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-based Speech Recognition, , and , in: Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, 2016 |
[DOI] |
| Dynamic structure and protein expression of the live embryonic heart captured by 2-photon light sheet microscopy and retrospective registration, , , , , and , in: Biomedical Optics Express, 6(6):2056-2066, 2015 |
[DOI] [URL] |
| An Empirical Model of Emphatic Word Detection, and , in: Proc. of Interspeech, Dresden, Germany, pages 573-577, ISCA, 2015 |
|
| Acoustic Data-Driven Grapheme-to-Phoneme Conversion in the Probabilistic Lexical Modeling Framework, , and , Idiap-RR-10-2015 |
|
| Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , Idiap-RR-09-2015 |
|
| Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German, , , , , and , in: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, Beijing, China, pages 8-15, 2015 |
|
| Disambiguating Discourse Connectives for Statistical Machine Translation, , and , in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):1184-1197, 2015 |
[DOI] |
| In the Mood for Vlog: Multimodal Inference in Conversational Social Video, , , and , in: ACM Transactions on Interactive Intelligent Systems, 5(2), 2015 |
[DOI] |
| Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
| Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition; Comparison with the Envelope-Variance Measure, , , , and , Idiap-RR-30-2015 |
|
| Learning Feature Mapping using Deep Neural Network Bottleneck Features for Distant Large Vocabulary Speech Recognition, , , , , and , in: IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540-4544, 2015 |
[DOI] |
| Joint Speaker Verification and Anti-Spoofing in the i-Vector Space, , , , and , in: IEEE Transactions on Information Forensics and Security, 10(4):821-832, 2015 |
[DOI] |
| Gender Classification by LUT based boosting of Overlapping Block Patterns, , and , in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015 |
[DOI] [URL] |
| An Empirical Model of Emphatic Word Detection, and , Idiap-RR-11-2015 |
|
| Reconstruction of Images from Gabor Graphs with Applications in Facial Image Processing, , , and , in: Journal of Wavelets, Multiresolution and Information Processing, 13(4):25, 2015 |
[DOI] |
| Incremental Syllable-Context Phonetic Vocoding, , , , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015 |
[URL] |
| Learning linearly separable features for speech recognition using convolutional neural networks, , and , Idiap-RR-24-2015 |
[URL] |
| Objective Intelligibility Assessment of Text-to-Speech Systems Through Utterance Verification, , , and , Idiap-RR-06-2015 |
|
| Analysis of CNN-based Speech Recognition System using Raw Speech as Input, , and , Idiap-RR-23-2015 |
|
| On the Application of Automatic Subword Unit Derivation and Pronunciation Generation for Under-Resourced Language ASR: A Study on Scottish Gaelic, , and , Idiap-RR-13-2015 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 |