All publications sorted by author
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |
L
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, , , , and , in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
[DOI] |
Long-Term Time-Sensitive Costs for CRF-Based Tracking by Detection, , and , in: 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, 2016 |
|
Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, , , and , in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016 |
|
EUMSSI team at the MediaEval Person Discovery Challenge 2016, , and , in: MediaEval Benchmarking Initiative for Multimedia Evaluation, Hilversum, Netherlands, 2016 |
|
Improving speech embedding using crossmodal transfer learning with audio-visual data, and , in: Multimedia Tools and Applications, 78(11):15681-15704, 2019 |
[DOI] |
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, and , in: ACM Multimedia, Amsterdam, ACM, 2016 |
|
Towards large scale multimedia indexing: A case study on person discovery in broadcast news, , and , in: 15th International Workshop on Content-Based Multimedia Indexing, 2017 |
|
EUMSSI team at the MediaEval Person Discovery Challenge, , , and , in: Working Notes Proceedings of the MediaEval 2015 Workshop, Wurzen, Germany, 2015 |
[URL] |
Client Dependent GMM-SVM Models for Speaker Verification, and , in: International Conference on Artificial Neural Networks, ICANN/ICONIP 2003, Springer Verlag, 2003 |
|
Noise Robust Discriminative Models, and , Idiap-RR-40-2003 |
|
Client Dependent GMM-SVM Models for Speaker Verification, and , Idiap-RR-03-2003 |
|
Hybrid generative-discriminative models for speech and speaker recognition, and , Idiap-RR-06-2002 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , Idiap-RR-13-2009 |
|
Automatic vs. human question answering over multimedia meeting recordings, and , in: 10th Annual Conference of the International Speech Communication Association, 2009 |
|
Building Word Embeddings for Solving Natural Language Processing, , École Polytechnique Fédérale de Lausanne, 2016 |
[DOI] |
N-gram-Based Low-Dimensional Representation for Document Classification, and , in: International Conference on Learning Representations, 2015 |
[URL] |
Rehabilitation of Count-based Models for Word Vector Representations, and , in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015 |
"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders, and , Idiap-RR-21-2015 |
|
Word Embeddings through Hellinger PCA, and , in: 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014 |
|
Word Embeddings through Hellinger PCA, and , Idiap-RR-29-2013 |
|
Is Deep Learning Really Necessary for Word Embeddings?, , and , Idiap-RR-44-2013 |
|
Twitter Sentiment Analysis (Almost) from Scratch, , and , Idiap-RR-15-2016 |
|
Phrase-based Image Captioning, , and , Idiap-RR-08-2015 |
|
Phrase-based Image Captioning, , and , in: International Conference on Machine Learning (ICML), Lille, France, pages 2085–2094, JMLR, 2015 |
[URL] |
Simple Image Description Generator via a Linear Phrase-based Model, , and , Idiap-RR-22-2015 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , in: Actes de la conference conjointe JEP-TALN-RECITAL 2012, Grenoble, France, pages 193-200, ATALA/AFCP, 2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , Idiap-RR-22-2012 |
|
Supervised and unsupervised Web-based language model domain adaptation, , , and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web, , , and , Idiap-RR-23-2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , Idiap-RR-21-2012 |
|
Conversion of Recurrent Neural Network Language Models to Weighted Finite State Transducers for Automatic Speech Recognition, and , in: Proceedings of Interspeech, Portland, Oregon, USA, pages to appear, 2012 |
|
Domain-specific language model adaptation: a case study, , and , Idiap-Com-01-2013 |
|
The I4U Submission to the 2012 NIST Speaker Recognition Evaluation, , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: NIST Speaker Recognition Conference, 2012 |
Establishment of CORONET, COVID-19 Risk in Oncology Evaluation Tool, to Identify Cancer Patients at Low Versus High Risk of Severe Complications of COVID-19 Infection Upon Presentation to Hospital, , , and , in: Clinical Cancer Informatics, 2022 |
Longitudinal characterisation of haematological and biochemical parameters in cancer patients prior to and during COVID-19 reveals features associated with outcome, , , and , in: ESMO Open, 2021 |
Tractable Approaches to Learning and Planning in High Dimensions, , EPFL, 2014 |
[DOI] |
Jointly Informative Feature Selection, and , in: Journal of Machine Learning Research, 2016 |
Jointly Informative Feature Selection, and , in: International Conference on Artificial Intelligence and Statistics, pages 567–575, 2014 |
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 |