2013 12th International Conference on Document Analysis and Recognition (2011)
Sept. 18, 2011 to Sept. 21, 2011
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDAR.2011.14
Document image binarization has been studied for decades, and many practical binarization techniques have been proposed for different kinds of document images. However, many state-of-the-art methods are particularly suitable for the document images that suffer from certain specific type of image degradation or have certain specific type of image characteristics. In this paper, we propose a classification framework to combine different thresholding methods and produce better performance for document image binarization. Given the binarization results of some reported methods, the proposed framework divides the document image pixels into three sets, namely, foreground pixels, background pixels and uncertain pixels. A classifier is then applied to iteratively classify those uncertain pixels into foreground and background, based on the pre-selected froeground and background sets. Extensive experiments over different datasets including the Document Image Binarization Contest(DIBCO)2009 and Handwritten Document Image Binarization Competition(H-DIBCO)2010 show that our proposed framework outperforms most state-of-the-art methods significantly.
document image binarization, pixel classification, thresholding technique combination
Shijian Lu, Chew Lim Tan, Bolan Su, "Combination of Document Image Binarization Techniques", 2013 12th International Conference on Document Analysis and Recognition, vol. 00, no. , pp. 22-26, 2011, doi:10.1109/ICDAR.2011.14