logo Idiap Research Institute        
All publications

2019
Vulnerability assessment and detection of Deepfake videos, Pavel Korshunov and Sébastien Marcel, in: IAPR International Conference on Biometrics, 2019
attachment
Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, Olivia Mariani, Kevin G. Chan, Alexander Ernst, Nadia Mercader and Michael Liebling, in: 2019 IEEE 16th International Symposium on Biomedical Imaging, pages 312--315, 2019
CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model, Florian Mai, Lukas Galke and Ansgar Scherp, in: International Conference on Learning Representations, New Orleans, Louisiana, USA, 2019
[URL]
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model, François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit and Philip N. Garner, in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pages 7040-7044, IEEE, 2019
attachment
[DOI]
[URL]
BookTubing Across Regions: Examining Differences based on Nonverbal and Verbal Cues, Chinchu Thomas, Dinesh Jayagopi and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Interactive Experiences for Television and Online Video (TVX), Salford, ENGLAND, 2019
attachment
[DOI]
Drinks & Crowds: Characterizing Alcohol Consumption through Crowdsensing and Social Media, Thanh-Trung Phan, Skanda Muralidhar and Daniel Gatica-Perez, in: Journal and Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT '19), 2019
attachment
#Drink Or #Drunk: Multimodal Signals and Drinking Practices on Instagram, Thanh-Trung Phan, Skanda Muralidhar and Daniel Gatica-Perez, in: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 2019
attachment
Deep Residual Output Layers for Neural Language Generation, Nikolaos Pappas and James Henderson, in: Proceedings of the 36th International Conference on Machine Learning (ICML), 2019
attachment
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, Yu Yu, Gang Liu and Jean-Marc Odobez, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019
AN INVESTIGATION OF MULTILINGUAL ASR USING END-TO-END LF-MMI, Sibo Tong, Philip N. Garner and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2019
attachment
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, Remy Siegfried, Yu Yu and Jean-Marc Odobez, in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019
attachment
[DOI]
PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT BASED ON THE SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE, Parvaneh Janbakhshi, Ina Kodrasi and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, pages 6405--6409, 2019
attachment
ANALYZING UNCERTAINTIES IN SPEECH RECOGNITION USING DROPOUT, Apoorv Vyas, Pranay Dighe, Sibo Tong and Hervé Bourlard, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, Weipeng He, Petr Motlicek and Jean-Marc Odobez, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, pages 770-774, 2019
attachment
[DOI]
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, Alexandre Nanchen and Philip N. Garner, in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019
attachment
A Learning-Based Framework for Quantized Compressed Sensing, rabeeh karimi mahabadi, Junhong lin and Volkan Cevher, in: A Learning-Based Framework for Quantized Compressed Sensing, 2019
attachment
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal, S. Pavankumar Dubagunta, Selen Hande Kabil and Mathew Magimai.-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
attachment
Learning voice source related information for depression detection, S. Pavankumar Dubagunta, Bogdan Vlasenko and Mathew Magimai.-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
attachment
Segment-level training of ANNs based on acoustic confidence measures for hybrid HMM/ANN Speech Recognition, S. Pavankumar Dubagunta and Mathew Magimai.-Doss, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
attachment
Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models, Julian Fritsch, Sebastian Wankerl and Elmar Nöth, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
attachment
GILE: A Generalized Input-Label Embedding for Text Classification, Nikolaos Pappas and James Henderson, in: Transactions of the Association for Computational Linguistics (TACL), 2019
attachment