ARTICLE
odobez-prl05/IDIAP
Video Text Recognition using Sequential Monte Carlo and Error Voting Methods
Chen, Datong
Odobez, Jean-Marc
EXTERNAL
https://publications.idiap.ch/attachments/reports/2005/odobez-prl.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/chen-rr0343
Related documents
Pattern Recognition Letters
26
9
1386-1403
2005
July 2005
A shorter version of the paper appeared in the techreport.
This paper addresses the issue of segmentation and recognition of text embedded in video sequences from their associated text image sequence extracted by a text detection module. To this end, we propose a probabilistic algorithm based on Bayesian adaptive thresholding and Monte-Carlo sampling. The algorithm approximates the posterior distribution of segmentation thresholds of text pixels in an image by a set of weighted samples. The set of samples is initialized by applying a classical segmentation algorithm on the first video frame and further refined by random sampling under a temporal Bayesian framework. One important contribution of the paper is to show that, thanks to the proposed methodology, the likelihood of a segmentation parameter sample can be estimated not using a classification criterion or a visual quality criterion based on the produced segmentation map, but directly from the induced text recognition result, which is directly relevant to our task. Furthermore, as a second contribution of the paper, we propose to align text recognition results from high confidence samples gathered over time, to composite a final result using error voting technique (ROVER) at the character level. Experiments are conducted on a two hour video database. Character recognition rates higher than 93\%, and word error rates higher than 90\% are achieved, which are 4 and 3\% more than state-of-the-art methods applied to the same database.
REPORT
chen-rr0343/IDIAP
Video Text Segmentation Using Particle Filters
Chen, Datong
Odobez, Jean-Marc
EXTERNAL
https://publications.idiap.ch/attachments/reports/2003/rr03-43.pdf
PUBLIC
Idiap-RR-43-2003
2003
IDIAP
May 2003
published in Int. Journal of Pattern Recognition and Artificial Intelligence
This paper presents a probabilistic algorithm for segmenting and recognizing text embedded in video sequences based on adaptive thresholding using a Bayes filtering method. The algorithm approximates the posterior distribution of segmentation thresholds of video text by a set of weighted samples. The set of samples is initialized by applying a classical segmentation algorithm on the first video frame and further refined by random sampling under a temporal Bayesian framework. This framework allows us to evaluate an text image segmentor on the basis of recognition result instead of visual segmentation result, which is directly relevant to our character recognition task. Results on a database of 6944 images demonstrate the validity of the algorithm.