Speech Acquisition in Meetings with an Audio-Visual Sensor Array
| Type of publication: | Idiap-RR |
| Citation: | hari-rr-05-03 |
| Number: | Idiap-RR-03-2005 |
| Year: | 2005 |
| Institution: | IDIAP |
| Address: | Martigny, Switzerland |
| Note: | Published in ``Prof. IEEE ICME'', July, 2005 |
| Abstract: | Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio -needed for recognition tasks- than single distant microphones. However, in multi-party conversational settings like meetings, microphone arrays represent an important alternative to close-talking microphones, as they allow for localisation and tracking of speakers and signal-independent enhancement, while providing a non-intrusive, hands-free operation mode. In this article, we investigate the joint use of a small table-top microphone array and a camera array for speaker tracking and speech enhancement in meetings. Our methodology first fuses audio and video for person tracking, and then integrates the output of the tracker with a beamformer for speech enhancement. We compare and discuss the features of the resulting speech signal with respect to that obtained from single close-talking and table-top microphones. |
| Userfields: | ipdinar={2005}, ipdmembership={speech}, language={English}, |
| Keywords: | |
| Projects: |
Idiap |
| Authors: | |
| Crossref by |
hari-rr-05-03b |
| Added by: | [UNK] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|