Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1
Document Images Retrieval Based on Multiple Features Combination
Curitiba, Parana, Brazil
September 23-September 26
ISBN: 0-7695-2822-8
G. Meng, Institute of Artificial Intelligence and Robotics, Xi?an Jiaotong University, China
N. Zheng, Institute of Artificial Intelligence and Robotics, Xi?an Jiaotong University, China
Y. Song, Institute of Artificial Intelligence and Robotics, Xi?an Jiaotong University, China
Y. Zhang, Institute of Artificial Intelligence and Robotics, Xi?an Jiaotong University, China
Retrieving the relevant document images from a great number of digitized pages with different kinds of artificial variations and documents quality deteriorations caused by scanning and printing is a meaningful and challenging problem. We attempt to deal with this problem by combining up multiple different kinds of document features in a hybrid way. Firstly, two new kinds of document image features based on the projection histograms and crossings number histograms of an image are proposed. Secondly, the proposed two features, together with density distribution feature and local binary pattern feature, are combined in a multistage structure to develop a novel document image retrieval system. Experimental results show that the proposed novel system is very efficient and robust for retrieving different kinds of document images, even if some of them are severely degraded.
Citation:
G. Meng, N. Zheng, Y. Song, Y. Zhang, "Document Images Retrieval Based on Multiple Features Combination," icdar, vol. 1, pp.143-147, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1, 2007