Publications of MUMMER sorted by journal and type
Publications of type Idiap-RR
2019
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, , and , Idiap-RR-03-2019 |
|
2018
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , Idiap-RR-02-2018 |
|
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , Idiap-RR-17-2018 |
|
2017
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , Idiap-RR-09-2017 |
|
Publications of type Idiap-Com
2019
Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training, , and , Idiap-Com-01-2019 |
|
ACM Transactions on Multimedia Computing, Communications, and Applications
Robust Unsupervised Gaze Calibration using Conversation and Manipulation Attention Priors, and , in: ACM Transactions on Multimedia Computing, Communications, and Applications, 18(1):26, 2022 |
[DOI] [URL] |
Frontiers in Robotics and AI
Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions, , , , , and , in: Frontiers in Robotics and AI, 8:189, 2021 |
[DOI] [URL] |
IEEE Transaction on Pattern Analysis and Machine Intelligence
A Differential Approach for Gaze Estimation, , and , in: IEEE Transaction on Pattern Analysis and Machine Intelligence, 43(3):1092--1098, 2021 |
[DOI] [URL] |
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303-1317, 2021 |
[DOI] [URL] |
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
Efficient Convolutional Neural Networks for Depth-Based Multi-Person Pose Estimation, , , and , in: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 30(11):4207-4221, 2020 |
[DOI] [URL] |
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
HeadFusion: 360 degree Head Pose Tracking Combining 3D Morphable Model and 3D Reconstruction, , and , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 40(11), 2018 |
[DOI] |
Small Group Research
Theories and Models of Teams and Group, , , , and , in: Small Group Research, 48(5):544--567, 2017 |
[DOI] |
Proceedings of Interspeech 2021 (2021)
Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction, , and , in: Proceedings of Interspeech 2021, 2021 |
International Conference in Computer Vision - Workshops (2021)
Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers, , and , in: International Conference in Computer Vision - Workshops, 2021 |
|
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (2021)
Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets, and , in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 9, IEEE, 2021 |
|
Symposium on Eye Tracking Research and Applications (2020)
ManiGaze: a Dataset for Evaluating Remote Gaze Estimator in Object Manipulation Situations, , and , in: Symposium on Eye Tracking Research and Applications, Stuttgart, Germany, pages 5, ACM, 2020 |
[DOI] |
IEEE/RSJ International Conference on Intelligent Robots and Systems (2020)
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020 |
|
Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication (2020)
The MuMMER data set for Robot Perception in multi-party HRI Scenarios, , , and , in: Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, 2020 |
|
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Unsupervised Representation Learning for Gaze Estimation, and , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 |
|
2019 ACM Symposium on Eye Tracking Research and Applications (2019)
A Deep Learning Approach for Robust Head Pose Independent Eye Movements Recognition from Videos, , and , in: 2019 ACM Symposium on Eye Tracking Research and Applications, pages 5, ACM, 2019 |
[DOI] |
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019)
Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis, , and , in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019 |
European Conference on Computer Vision Workshop (2018)
Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model, , and , in: European Conference on Computer Vision Workshop, 2018 |
|
2018 IEEE International Conference on Robotics and Automation (ICRA) (2018)
Deep Neural Networks for Multiple Speaker Detection and Localization, , and , in: 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, AUSTRALIA, pages 74-79, 2018 |
[DOI] |
European Conference on Computer Vision - Workshops (2018)
Investigating Depth Domain Adaptation for Efficient Human Pose Estimation, , , and , in: European Conference on Computer Vision - Workshops, 2018 |
|
Proceedings of Interspeech (2018)
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network, , and , in: Proceedings of Interspeech, pages 312--316, 2018 |
[DOI] |
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2018)
Leveraging Convolutional Pose Machines for Fast and Accurate Head Pose Estimation, , and , in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, SPAIN, pages 1089-1094, IEEE, 2018 |
|
IEEE/RSJ International Conference on Intelligent Robots and Systems (2018)
Real-time Convolutional Networks for Depth-based Human Pose Estimation, , , and , in: IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018 |
|
Proceedings of Interspeech (2018)
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization, and , in: Proceedings of Interspeech, Hyderabad, INDIA, pages 2257-2261, 2018 |
[DOI] |
ACM International Conference on Multimodal Interaction (2017)
A Domain Adaptation Approach to Improve Speaker Turn Embedding Using Face Representation, and , in: ACM International Conference on Multimodal Interaction, Glasgow, Scotland, ACM, 2017 |
|
ICCV Workshop on Computer Vision for Audio-Visual Media (2017)
Improving speaker turn embedding by crossmodal transfer learning from face embedding, and , in: ICCV Workshop on Computer Vision for Audio-Visual Media, 2017 |
|
Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017) (2017)
Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction, , and , in: Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017), 2017 |
|
Proceedings of 19th ACM International Conference on Multimodal Interaction (2017)
Towards the Use of Social Interaction Conventions as Prior for Gaze Model Adaptation, , and , in: Proceedings of 19th ACM International Conference on Multimodal Interaction, pages 9, ACM, 2017 |
[DOI] |
Publications of type Phdthesis
2020
Accurate Nod and 3D Gaze Estimation for Social Interaction Analysis, , EDEE, EPFL, 2020 |
|
2019
Multimodal Person Recognition in Audio-Visual Streams, , EPFL, 2019 |
[DOI] |