CONF
fasel02c-conf/IDIAP
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks
Fasel, B.
EXTERNAL
https://publications.idiap.ch/attachments/reports/2002/rr02-51.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/fasel-rr-02-51
Related documents
International IEEE Conference on Multimodal Interfaces (ICMI 02)
2002
Pittsburgh, USA
October 2002
529-534
IDIAP-RR 02-51
Automatic face analysis has to cope with pose and lighting variations. Especially pose variations are difficult to tackle and many face analysis methods require the use of sophisticated normalization and initialization procedures. We propose a data-driven face analysis approach that is not only capable of extracting features relevant to a given face analysis task, but is also more robust with regard to face location changes and scale variations when compared to classical methods such as e.g. MLPs. Our approach is based on convolutional neural networks that use multi-scale feature extractors, which allow for improved facial expression recognition results with faces subject to in-plane pose variations.
REPORT
fasel-RR-02-51/IDIAP
Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks
Fasel, B.
EXTERNAL
https://publications.idiap.ch/attachments/reports/2002/rr02-51.pdf
PUBLIC
Idiap-RR-51-2002
2002
IDIAP
Published in the Proceedings of the fourth International IEEE Conference on Multimodal Interfaces (ICMI 2002,',','),
Pittsburgh, USA
Automatic face analysis has to cope with pose and lighting variations. Especially pose variations are difficult to tackle and many face analysis methods require the use of sophisticated normalization and initialization procedures. We propose a data-driven face analysis approach that is not only capable of extracting features relevant to a given face analysis task, but is also more robust with regard to face location changes and scale variations when compared to classical methods such as e.g. MLPs. Our approach is based on convolutional neural networks that use multi-scale feature extractors, which allow for improved facial expression recognition results with faces subject to in-plane pose variations.