loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
15th International Conference on Pattern Recognition (ICPR'00) - Volume 4
Image-Based Document Vectors for Text Retrieval
Barcelona, Spain
September 03-September 08
ISBN: 0-7695-0750-6
Zhaohui Yu, National University of Singapore
Chew Lim Tan, National University of Singapore
We propose a method for constructing a vector for a document image to represent its content to facilitate text retrieval. The method is based on an N-Gram algorithm for text similarity measure based on the frequency of occurrence of n-character strings appearing in the electronic text. Instead of using ASCII values, the present study investigates the use of character images to obtain the document vector and has found promising results for use in our news article retrieval project.
Index Terms:
Document Image, Text Retrieval, Similarity Measure, N-Gram Algorithm
Citation:
Zhaohui Yu, Chew Lim Tan, "Image-Based Document Vectors for Text Retrieval," icpr, vol. 4, pp.4393, 15th International Conference on Pattern Recognition (ICPR'00) - Volume 4, 2000
Usage of this product signifies your acceptance of the Terms of Use.