CONF zhang-rr-06-41b/IDIAP Exploring Contextual Information in a Layered Framework for Group Action Recognition Zhang, Dong Gatica-Perez, Daniel Bengio, Samy EXTERNAL https://publications.idiap.ch/attachments/reports/2006/rr-06-41.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/zhang-rr-06-41 Related documents In the Eighth International Conference on Multimodal Interfaces (ICMI'06) 2006 IDIAP-RR 06-41 Contextual information is important for sequence modeling. Hidden Markov Models (HMMs) and extensions, which have been widely used for sequence modeling, make simplifying, often unrealistic assumptions on the conditional independence of observations given the class labels, thus cannot accommodate overlapping features or long-term contextual information. In this paper, we introduce a principled layered framework with three implementation methods that take into account contextual information (as available in the whole or part of the sequence). The first two methods are based on state {\em alpha} and {\em gamma} posteriors (as usually referred to in the HMM formalism). The third method is based on Conditional Random Fields (CRFs,',','), a conditional model that relaxes the independent assumption on the observations required by HMMs for computational tractability. We illustrate our methods with the application of recognizing group actions in meetings. Experiments and comparison with standard HMM baseline showed the validity of the proposed approach. REPORT zhang-rr-06-41/IDIAP Exploring Contextual Information in a Layered Framework for Group Action Recognition Zhang, Dong Gatica-Perez, Daniel Bengio, Samy EXTERNAL https://publications.idiap.ch/attachments/reports/2006/rr-06-41.pdf PUBLIC Idiap-RR-41-2006 2006 IDIAP Martigny, Switzerland submitted Contextual information is important for sequence modeling. Hidden Markov Models (HMMs) and extensions, which have been widely used for sequence modeling, make simplifying, often unrealistic assumptions on the conditional independence of observations given the class labels, thus cannot accommodate overlapping features or long-term contextual information. In this paper, we introduce a principled layered framework with three implementation methods that take into account contextual information (as available in the whole or part of the sequence). The first two methods are based on state {\em alpha} and {\em gamma} posteriors (as usually referred to in the HMM formalism). The third method is based on Conditional Random Fields (CRFs,',','), a conditional model that relaxes the independent assumption on the observations required by HMMs for computational tractability. We illustrate our methods with the application of recognizing group actions in meetings. Experiments and comparison with standard HMM baseline showed the validity of the proposed approach.