CONF zhang-rr-05-50b/IDIAP Extracting Information from Multimedia Meeting Collections Gatica-Perez, Daniel Zhang, Dong Bengio, Samy EXTERNAL https://publications.idiap.ch/attachments/reports/2005/gatica-mir.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/zhang-rr-05-50 Related documents 7th ACM SIGMM International Workshop on Multimedia Information Retrieval 2005 IDIAP-RR 05-50 Multimedia meeting collections, composed of unedited audio and video streams, handwritten notes, slides, and electronic documents that jointly constitute a raw record of complex human interaction processes in the workplace, have attracted interest due to the increasing feasibility of recording them in large quantities, by the opportunities for information access and retrieval applications derived from the automatic extraction of relevant meeting information, and by the challenges that the extraction of semantic information from real human activities entails. In this paper, we present a succint overview of recent approaches in this field, largely influenced by our own experiences. We first review some of the existing and potential needs for users of multimedia meeting information systems. We then summarize recent work on various research areas addressing some of these requirements. In more detail, we describe our work on automatic analysis of human interaction patterns from audio-visual sensors, discussing open issues in this domain. REPORT zhang-rr-05-50/IDIAP Extracting Information from Multimedia Meeting Collections Gatica-Perez, Daniel Zhang, Dong Bengio, Samy EXTERNAL https://publications.idiap.ch/attachments/reports/2005/rr-05-50.pdf PUBLIC Idiap-RR-50-2005 2005 IDIAP Martigny, Switzerland Published in 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, Nov. 2005 Multimedia meeting collections, composed of unedited audio and video streams, handwritten notes, slides, and electronic documents that jointly constitute a raw record of complex human interaction processes in the workplace, have attracted interest due to the increasing feasibility of recording them in large quantities, by the opportunities for information access and retrieval applications derived from the automatic extraction of relevant meeting information, and by the challenges that the extraction of semantic information from real human activities entails. In this paper, we present a succint overview of recent approaches in this field, largely influenced by our own experiences. We first review some of the existing and potential needs for users of multimedia meeting information systems. We then summarize recent work on various research areas addressing some of these requirements. In more detail, we describe our work on automatic analysis of human interaction patterns from audio-visual sensors, discussing open issues in this domain.