|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2012 10th IAPR International Workshop on Document Analysis Systems
Toward Part-Based Document Image Decoding
Gold Coast, Queensland Australia
March 27-March 29
ISBN: 978-0-7695-4661-2
| ASCII Text | x | ||
| Wang Song, Seiichi Uchida, Marcus Liwicki, "Toward Part-Based Document Image Decoding," Document Analysis Systems, IAPR International Workshop on, pp. 266-270, 2012 10th IAPR International Workshop on Document Analysis Systems, 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/DAS.2012.90, author = {Wang Song and Seiichi Uchida and Marcus Liwicki}, title = {Toward Part-Based Document Image Decoding}, journal ={Document Analysis Systems, IAPR International Workshop on}, volume = {0}, year = {2012}, isbn = {978-0-7695-4661-2}, pages = {266-270}, doi = {http://doi.ieeecomputersociety.org/10.1109/DAS.2012.90}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Document Analysis Systems, IAPR International Workshop on TI - Toward Part-Based Document Image Decoding SN - 978-0-7695-4661-2 SP266 EP270 A1 - Wang Song, A1 - Seiichi Uchida, A1 - Marcus Liwicki, PY - 2012 KW - Document image decoding KW - part-based VL - 0 JA - Document Analysis Systems, IAPR International Workshop on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DAS.2012.90
Document image decoding (DID) is a trial to understand the contents of a whole document without any reference information about font, language, etc. Typically, DID approaches assume the correct segmentation of the document and some a priori knowledge about the language or the script. Unfortunately, this assumption will not hold if we deal with various documents, such as documents with various sized fonts, camera-captured documents, free-layout documents, or historical documents. In this paper, we propose a part-based character identification method where no segmentation into characters is necessary and no a priori information about the document is needed. The approach clusters similar key points and groups frequent neighboring key point clusters. Then a second iteration is performed, i.e., the groups are again clustered and optionally pairs frequent group clusters are detected. Our first experimental results on multi font-size documents look already very promising. We could find nearly perfect correspondences between characters and detected group clusters.
Index Terms:
Document image decoding, part-based
Citation:
Wang Song, Seiichi Uchida, Marcus Liwicki, "Toward Part-Based Document Image Decoding," das, pp.266-270, 2012 10th IAPR International Workshop on Document Analysis Systems, 2012
Usage of this product signifies your acceptance of the Terms of Use.
