loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Third International Conference on Document Analysis and Recognition (ICDAR'95) - Volume 2
Zone classification in a document using the method of feature vector generation
Montr?al, Canada
August 14-August 15
ISBN: 0-8186-7128-9
R. Sivaramakrishnan, Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
I.T. Phillips, Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
J. Ha, Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
S. Subramanium, Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
R.M. Haralick, Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
A document can be divided into zones on the basis of its content. For example, a zone can be either text or non-text. This paper describes an algorithm to classify each given document zone into one of nine different classes. Features for each zone such as run length mean and variance, spatial mean and variance, fraction of the total number of black pixels in the zone, and the zone width ratio for each zone are extracted. Run length related features are computed along four different canonical directions. A decision tree classifier is used to assign a zone class on the basis of its feature vector. The performance on an independent test set was 97%.
Index Terms:
image classification; document image processing; decision theory; feature extraction; zone classification; document; feature vector generation; run length mean; spatial mean; black pixels; zone width ratio; decision tree classifier; feature vector; performance
Citation:
R. Sivaramakrishnan, I.T. Phillips, J. Ha, S. Subramanium, R.M. Haralick, "Zone classification in a document using the method of feature vector generation," icdar, vol. 2, pp.541, Third International Conference on Document Analysis and Recognition (ICDAR'95) - Volume 2, 1995
Usage of this product signifies your acceptance of the Terms of Use.