Eighth International Conference on Document Analysis and Recognition (ICDAR'05) Fast Convolutional OCR with the Scanning N-Tuple Grid Seoul, Korea August 31-September 01 ISBN: 0-7695-2420-6
This paper introduces a novel high speed convolutional character recognition system. Convolutional mode operation means that no prior localization or segmentation of characters is required, making this mode extremely robust. The method uses a 2-d n-tuple grid to sample the image, but decomposes the address calculations into two onedimensional scans. This simple innovation leads to a very fast system, and speeds in excess of 100,000 recognitions per second have been achieved for a 10-class character recognition problem, when operated in convolutional mode. Quantitative performance results show an error rate of 4.3% on the MNist dataset of isolated hand-written characters. Qualitative results are presented on museum archive card images, indicating that the method has great potential for the character recognition component in a document image analysis system for images of this type.
Citation:
Simon M. Lucas, Kyu Tae Cho, "Fast Convolutional OCR with the Scanning N-Tuple Grid," icdar, pp.799-805, Eighth International Conference on Document Analysis and Recognition (ICDAR'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||