CONF
zhang-rr-05-15b/IDIAP
Semi-supervised Meeting Event Recognition with Adapted HMMs
Zhang, Dong
Gatica-Perez, Daniel
Bengio, Samy
EXTERNAL
https://publications.idiap.ch/attachments/reports/2005/zhang-icme-05.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/zhang-rr-05-15
Related documents
Pro. IEEE ICME
2005
IDIAP-RR 05-15
This paper investigates the use of unlabeled data to help labeled data for audio-visual event recognition in meetings. To deal with situations in which it is difficult to collect enough labeled data to capture event characteristics, but collecting a large amount of unlabeled data is easy, we present a semi-supervised framework using HMM adaptation techniques. Instead of directly training one model for each event, we first train a well-estimated general event model for all events using both labeled and unlabeled data, and then adapt the general model to each specific event model using its own labeled data. We illustrate the proposed approach with a set of eight audio-visual events defined in meetings. Experiments and comparison with the fully-supervised baseline method show the validity of the proposed semi-supervised approach.
REPORT
zhang-rr-05-15/IDIAP
Semi-supervised Meeting Event Recognition with Adapted HMMs
Zhang, Dong
Gatica-Perez, Daniel
Bengio, Samy
EXTERNAL
https://publications.idiap.ch/attachments/reports/2005/rr-05-15.pdf
PUBLIC
Idiap-RR-15-2005
2005
IDIAP
Martigny, Switzerland
Published in ``Prof. IEEE ICME'', July, 2005
This paper investigates the use of unlabeled data to help labeled data for audio-visual event recognition in meetings. To deal with situations in which it is difficult to collect enough labeled data to capture event characteristics, but collecting a large amount of unlabeled data is easy, we present a semi-supervised framework using HMM adaptation techniques. Instead of directly training one model for each event, we first train a well-estimated general event model for all events using both labeled and unlabeled data, and then adapt the general model to each specific event model using its own labeled data. We illustrate the proposed approach with a set of eight audio-visual events defined in meetings. Experiments and comparison with the fully-supervised baseline method show the validity of the proposed semi-supervised approach.