All publications
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |
2019
Learning from demonstration for semi-autonomous teleoperation, and , in: Autonomous Robots, 43(3):713-726, 2019 |
[DOI] [URL] |
Heterogeneous Face Recognition Using Domain Specific Units, , and , in: IEEE Transactions on Information Forensics and Security:13, 2019 |
[DOI] |
INVESTIGATING TIME DELAY NEURAL NETWORK (TDNN) FOR LANGUAGE MODELING IN LOW RESOURCE AUTOMATIC SPEECH RECOGNITION, , , and , Idiap-RR-13-2019 |
|
STACKED NEURAL NETWORKS WITH PARAMETER SHARING FOR MULTILINGUAL LANGUAGE MODELING, , , , , and , Idiap-RR-12-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL, , , , and , Idiap-RR-05-2019 |
|
Virtual High-Framerate Microscopy of the Beating Heart via Sorting of Still Images, , , , and , Idiap-RR-04-2019 |
|
Modeling Dyadic and Group Impressions with Inter-Modal and Inter-Person Features, , , and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 15(1), 2019 |
|
Mi Casa es su Casa? Examining Airbnb Hospitality Exchange Practices in a Developing Economy, , , , , , and , in: ACM Transactions on Social Computing, 2(1), 2019 |
|
Learning Control, and , in: Humanoid Robotics: a Reference, pages 1261-1312, Springer, 2019 |
[DOI] [URL] |
2018
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: MLSLP-18 Proceedings, Hyderabad, 2018 |
[URL] |
Theory and Algorithms for Hypothesis Transfer Learning, , EPFL, 2018 |
[DOI] |
Low-latency speaker spotting with online diarization and detection, , , , , , , , and , in: The Speaker and Language Recognition Workshop (Odyssey), 2018 |
|
A Non-Euclidean Gradient Descent Framework for Non-Convex Matrix Factorization, , , , , and , in: IEEE Transactions on Signal Processing, 2018 |
|
Real-Time DCT Learning-based Reconstruction of Neural Signals, , and , in: EUSIPCO, 2018 |
|
Learning-Based Compressive MRI, , , , , , and , in: IEEE Transactions on Medical Imaging, 2018 |
|
Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters, and , in: EURASIP Journal on Advances in Signal Processing(11), 2018 |
|
Phonetic aware techniques for Speaker Verification, , EPFL, 2018 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
Iterative Learning of Speech Recognition Models for Air Traffic Control, , , , , , and , in: Proceedings of Interspeech 2018, ISCA, Hyderabad, India, pages 3519-3523, 2018 |
[DOI] |
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
Multilingual bottleneck features for subword modeling in zero-resource languages, and , in: Proc. Interspeech, pages 2668-2672, 2018 |
[DOI] |
Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, , and , in: Proceedings of the international conference on Neural Information Processing Systems, 2018 |
Geodesic Convolutional Shape Optimization, , , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Kronecker Recurrent Units, , and , in: Proceedings of the International Conference on Machine Learning, 2018 |
Enhancing Trust in eAssessment - the TeSLA System Solution, , , , and , in: Technology Enhanced Assessment Conference., 2018 |
|
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation, and , in: Proc. Interspeech 2018, pages 3147-3151, 2018 |
[DOI] |
SCALAR - Simultaneous Calibration of 2D Laser And Robot's Kinematic Parameters Using Three Planar Constraints, , and , in: International Conference on Intelligent Robots, 2018 |
|
DeepFakes: a New Threat to Face Recognition? Assessment and Detection, and , Idiap-RR-18-2018 |
|
A Cross-database Study of Voice Presentation Attack Detection, and , in: Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, Springer, 2018 |
Dexterous Underwater Manipulation from Distant Onshore Locations, , , , , , , , , , , , , , , , , , , , , , , , and , in: IEEE Robotics and Automation Magazine, 2018 |
|
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models, , , , , , , and , in: 13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018 |
|
Bimanual Skill Learning with Pose and Joint Space Constraints, , , and , in: Proc. of the IEEE-RAS Intl Conf. on Humanoid Robots (Humanoids), 2018 |
|
Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints, , , , and , in: Proc. of the IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS), pages 6552-6559, 2018 |
|
Embedding Context-Dependent Variations of Prosodic Contours using Variational Encoding for Decomposing the Structure of Speech Prosody, , , , and , in: Workshop on Prosody and Meaning: Information Structure and Beyond, Aix-en-Provence, France, 2018 |
[URL] |
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, , , and , in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018 |
[URL] |
Designing second order recurrent neural networks for prosody modelling, , Idiap-RR-16-2018 |
|
Fusing TensorFlow with building energy simulation for intelligent energy management in smart cities, , , and , in: Sustainable Cities and Society, 2018 |
[DOI] |
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes, , and , in: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018 |
|
Facing Employers and Customers: What Do Gaze and Expressions Tell About Soft Skills?, , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, Egypt, pages 121-126, ASSOC COMPUTING MACHINERY, 2018 |
[DOI] |
Vlogging Over Time: Longitudinal Impressions and Behavior in YouTube, , , , and , in: Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, Cairo, EGYPT, pages 37-46, 2018 |
[DOI] |
UNICITY: A depth maps database for people detection in security airlocks, , , , , , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018 |
|
WatchNet: Efficient and Depth-based Network for People Detection in Video Surveillance Systems, , , and , in: IEEE International Conference on Advanced Video and Signal-based Surveillance, Auckland, NEW ZEALAND, pages 109-114, 2018 |
|
Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, , and , in: European Conference on Computer Vision Workshop, 2018 |
|
HeadFusion: 360 degree Head Pose Tracking Combining 3D Morphable Model and 3D Reconstruction, , and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 40(11), 2018 |
[DOI] |
Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation, , , and , in: Transactions of the Association for Computational Linguistics (TACL), 2018 |
|
Phonetic Subspace Features for Improved Query by Example Spoken Term Detection, , and , in: Speech Communication, 103:27-36, 2018 |
[DOI] |
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Cross-lingual Adaptation of a CTC-based multilingual Acoustic Model, , and , in: Speech Communication, 104:39-46, 2018 |
[DOI] |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 |