CONF
hynek-rr-05-63a/IDIAP
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input)
Hermansky, Hynek
Fousek, Petr
Lehtonen, Mikko
EXTERNAL
https://publications.idiap.ch/attachments/reports/2005/rr05-63.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/hynek-rr-05-63
Related documents
Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005
2005
September 2005
IDIAP-RR 2005-63
Natural audio-visual interface between human user and machine requires understanding of user's audio-visual commands. This does not necessarily require full speech and image recognition. It does require, just as the interaction with any working animal does, that the machine is capable of reacting to certain particular sounds and/or gestures while ignoring the rest. Towards this end, we are working on sound identification and classification approaches that would ignore most of the acoustic input and react only to a particular sound (keyword).
REPORT
hynek-rr-05-63/IDIAP
The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input)
Hermansky, Hynek
Fousek, Petr
Lehtonen, Mikko
EXTERNAL
https://publications.idiap.ch/attachments/reports/2005/rr05-63.pdf
PUBLIC
Idiap-RR-63-2005
2005
IDIAP
Published in 8th International Conference on Text, Speech and Dialogue - TSD 2005
Natural audio-visual interface between human user and machine requires understanding of user's audio-visual commands. This does not necessarily require full speech and image recognition. It does require, just as the interaction with any working animal does, that the machine is capable of reacting to certain particular sounds and/or gestures while ignoring the rest. Towards this end, we are working on sound identification and classification approaches that would ignore most of the acoustic input and react only to a particular sound (keyword).