loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2008 The Eighth IAPR International Workshop on Document Analysis Systems
A Comparison of Clustering Methods for Word Image Indexing
September 16-September 19
ISBN: 978-0-7695-3337-7
In this paper we compare three clustering methods used to perform word image indexing. The three methods are: the Self-Organizing Map (SOM), the Growing Hierarchical Self-Organizing Map (GHSOM), and the Spectral Clustering. We test these methods on a real data set composed of word images extracted from an encyclopedia of the XIX-th Century. The word images are grouped on the basis of the clustering methods and subsequently retrieved identifying the closest clusters to a query word. The accuracy of the methods is compared evaluating the performance of the word retrieval algorithm. From the experimental results we conclude that methods designed to automatically determine the number and the structure of clusters, such as GHSOM, are particularly suitable in the context represented by our data set.
Index Terms:
Self Organizing Map, clustering, Document Image Retrieval
Citation:
Simone Marinai, Emanuele Marino, Giovanni Soda, "A Comparison of Clustering Methods for Word Image Indexing," das, pp.671-676, 2008 The Eighth IAPR International Workshop on Document Analysis Systems, 2008
Usage of this product signifies your acceptance of the Terms of Use.