The Community for Technology Leaders
2011 International Conference on Document Analysis and Recognition (2011)
Beijing, China
Sept. 18, 2011 to Sept. 21, 2011
ISSN: 1520-5363
ISBN: 978-0-7695-4520-2
pp: 1120-1124
Content-based video indexing and retrieval has become an interesting research area with the tremendous growth in the amount of digital media. In addition to the audio-visual content, text appearing in videos can serve as a powerful tool for semantic indexing and retrieval of videos. This paper proposes a method based on edge-features for horizontally aligned artificial Urdu text detection from video images. The system exploits edge based segmentation to extract textual content from videos. We first find the vertical gradients in the input video image and average the gradient magnitude in a fixed neighborhood of each pixel. The resulting image is binarized and the horizontal run length smoothing algorithm (RLSA) is applied to merge possible text regions. An edge density filter is then applied to eliminate noisy non-text regions. Finally, the candidate regions satisfying certain geometrical constraints are accepted as text regions. The proposed approach evaluated on a data set of 150 video images exhibited promising results.
Urdu text detection, Gradient edge detection, Run length smoothing alogrithm

F. Arif, A. Jamil, I. Siddiqi and A. Raza, "Edge-Based Features for Localization of Artificial Urdu Text in Video Images," 2011 International Conference on Document Analysis and Recognition(ICDAR), Beijing, China, 2011, pp. 1120-1124.
91 ms
(Ver 3.3 (11022016))