Sept. 18, 2011 to Sept. 21, 2011
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDAR.2011.35
We present a new video character recognition method based on hierarchical classification. In the first step, we propose a method for character segmentation of the text line detected by the text detection method. The segmentation algorithm uses dynamic programming to find least-cost paths in the gray domain to identify the spaces between characters. For the segmented characters, we get a Canny edge image as input for the character recognition step. We introduce hierarchical classification based on voting criteria with structural features to classify 62 character classes into different smaller classes. We divide the perimeter of a character into 8 segments according to 8 directions at the centroid. Then the shape of each segment is studied to recognize the characters based on distances between the centroid and end points, and distances between the midpoint and end points. Our experiments on 1462 characters of upper case, lower case and numerals shows that 10% samples per class for training is enough to obtain 94.5% recognition accuracy. The dataset is chosen from TRECVID database of 2005 and 2006.
Structural features, Hierarchical classification, Invariant features, Confusion matrix, Video character recognition
Palaiahnakote Shivakumara, Trung Quy Phan, Shijian Lu, Chew Lim Tan, "Video Character Recognition through Hierarchical Classification", ICDAR, 2011, 2013 12th International Conference on Document Analysis and Recognition, 2013 12th International Conference on Document Analysis and Recognition 2011, pp. 131-135, doi:10.1109/ICDAR.2011.35