Automatic Temporal Alignment of AV Data with Confidence Estimation
Type of publication: | Conference paper |
Citation: | Korchagin_ICASSP_2010 |
Booktitle: | Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing |
Year: | 2010 |
Month: | 3 |
Location: | Dallas, USA |
Address: | P.O. Box 592, CH-1920 Martigny, Switzerland |
Crossref: | Korchagin_Idiap-RR-40-2009: |
Abstract: | In this paper, we propose a new approach for the automatic audio-based temporal alignment with confidence estimation of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is temporally aligned based on ASR-related features with a common master track, recorded by a reference camera, and the corresponding confidence of alignment is estimated. The core of the algorithm is based on perceptual time-frequency analysis with a precision of 10 ms. The results show correct alignment in 99% of cases for a real life dataset and surpass the performance of cross correlation while keeping lower system requirements. |
Keywords: | pattern matching, reliability estimation, time synchronization, time-frequency analysis |
Projects |
Idiap TA2 |
Authors | |
Added by: | [UNK] |
Total mark: | 0 |
Attachments
|
|
Notes
|
|
|