All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
2016
Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, , , and , in: Proc. of EUSIPCO, Budapest, Hungary, 2016 |
|
PhonVoc: A Phonetic and Phonological Vocoding Toolkit, and , in: Interspeech, San Francisco, USA, 2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , in: Interspeech, San Francisco, USA, 2016 |
|
Improving Pronoun Translation by Modeling Coreference Uncertainty, and , in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy, , , and , in: IEEE Transactions on Computational Imaging, 2016 |
[DOI] [URL] |
Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction, and , in: IEEE Transactions on Multimedia, 18(4):643--658, 2016 |
[DOI] [URL] |
High-slope terrain locomotion for torque-controlled quadruped robots, , , , , and , in: Autonomous Robots, 2016 |
[DOI] [URL] |
Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, , , , and , in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016 |
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , in: Data & Knowledge Engineering Journal, 2016 |
Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, , and , Idiap-RR-16-2016 |
|
Towards End-to-End Speech Recognition, , Ecole polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
Importance Sampling Tree for Large-scale Empirical Expectation, , and , in: Proceedings of the International Conference on Machine Learning (ICML), New-York, 2016 |
Heterogeneous Face Recognition using Inter-Session Variability Modelling, and , in: IEEE Computer Society Workshop on Biometrics, Las Vegas - USA, IEEE, 2016 |
|
A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses, and , in: European Association for Machine Translation, 2016 |
SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5495-5499, IEEE, 2016 |
|
Quantifying uncertainties on excursion sets under a Gaussian random field prior, , , and , in: SIAM/ASA J. Uncertainty Quantification, 4(1):850-874, 2016 |
[DOI] [URL] |
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework, , and , in: Speech Communication, 80, 2016 |
[DOI] |
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
INFORMATION THEORETIC CLUSTERING FOR UNSUPERVISED DOMAIN-ADAPTATION, , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5580-5584, IEEE, 2016 |
|
DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5050-5054, IEEE, 2016 |
|
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , Idiap-RR-10-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
"Can you hear me now?" --- Automatic assessment of background noise intrusiveness and speech intelligibility in telecommunications, , Sciences et Techniques de l’Ingénieur (STI), 2016 |
[DOI] |
Overview of BTAS 2016 Speaker Anti-spoofing Competition, , , , , , , , , , , , , , , and , Idiap-RR-24-2016 |
[URL] |
Joint Operation of Voice Biometrics and Presentation Attack Detection, and , Idiap-RR-25-2016 |
[URL] |
Cross-database evaluation of audio-based spoofing detection systems, and , Idiap-RR-23-2016 |
[URL] |
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph, , and , in: Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), ACM, New York, NY, ACM Press, 2016 |
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation, and , in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, 2016 |
|
Tracking Interacting Objects Using Intertwined Flows, , , and , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016 |
Principled Parallel Mean-Field Inference for Discrete Random Fields, , , and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , Idiap-RR-13-2016 |
|
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , Idiap-RR-07-2016 |
[URL] |
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , Idiap-RR-06-2016 |
|
Low-Rank Representation For Enhanced Deep Neural Network Acoustic Models, , Idiap-RR-05-2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , Idiap-RR-04-2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , Idiap-RR-03-2016 |
|
Multilingual Visual Sentiment Concept Matching, , , , , , and , in: Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016 |
|
Large Scale Hard Sample Mining with Monte Carlo Tree Search, and , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016 |
|
Assessing a Shape Descriptor for Analysis of Mesoamerican Hieroglyphics: A View Towards Practice in Digital Humanities, , and , in: Digital Humanities Conference (DH), Krakow, 2016 |
|
Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition, , , , , , and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016 |
|
Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data, , , and , in: Proceedings of CSEDU 2016, 2016 |
|
Learning Robot Manipulation Tasks with Task-Parameterized Semi-Tied Hidden Semi-Markov Model, and , in: IEEE Robotics and Automation Letters, 1(1):235-242, 2016 |
[DOI] [URL] |
Face Recognition in Challenging Environments: An Experimental and Reproducible Research Survey, , and , in: Face Recognition Across the Imaging Spectrum, Springer, 2016 |
|
Learning Explainable User Sentiment and Preferences for Information Filtering, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
A Point-Spread-Function-Aware Filtered Backprojection Algorithm for Focal-Plane-Scanning Optical Projection Tomography, and , in: 2016 IEEE International Symposium on Biomedical Imaging, 2016 |
An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, , , and , Idiap-RR-02-2016 |
|
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , Idiap-RR-01-2016 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |