Continuous Audio-Visual Speech Recognition

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Conference paper
Citation:	luettin-eccv98
Booktitle:	Proc. 5th European Conference on Computer Vision
Series:	Lecture Notes in Computer Science
Volume:	II
Year:	1998
Publisher:	Springer Verlag
Note:	IDIAP-RR 98-02
Crossref:	luettin-rr-98-02: Continuous Audio-Visual Speech Recognition, Luettin, Juergen and Dupont, Stéphane, Idiap-RR-02-1998
Abstract:	We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audio-visual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal modelling of the acoustic and visual speech signals by applying Multi-Stream hidden Markov models. This approach allows the use of different temporal topologies and levels of stream integration and hence enables to model temporal dependencies more accurately. The system has been evaluated for a continuously spoken digit recognition task of 37 subjects.
Userfields:	ipdmembership={vision},
Keywords:
Projects	Idiap
Authors	Luettin, Juergen Dupont, Stéphane
Added by:	[UNK]
Total mark:	0
Attachments
eccv98.pdf luettin-eccv98.ps.gz
Notes

processing time: 0.0002 seconds.