This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2
On Segmentation of Documents in Complex Scripts
Curitiba, Parana, Brazil
September 23-September 26
ISBN: 0-7695-2822-8
K.S. Kumar, International Institute of Information Technology, Hyderabad, India
S. Kumar, International Institute of Information Technology, Hyderabad, India
C. Jawahar, International Institute of Information Technology, Hyderabad, India
Document image segmentation algorithms primarily aim at separating text and graphics in presence of complex lay- outs. However, for many non-Latin scripts, segmentation becomes a challenge due to the characteristics of the script. In this paper, we empirically demonstrate that successful al- gorithms for Latin scripts may not be very effective for Indic and complex scripts. We explain this based on the differ- ences in the spatial distribution of symbols in the scripts. We argue that the visual information used for segmenta- tion needs to be enhanced with other information like script models for accurate results.
Citation:
K.S. Kumar, S. Kumar, C. Jawahar, "On Segmentation of Documents in Complex Scripts," icdar, vol. 2, pp.1243-1247, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, 2007
Usage of this product signifies your acceptance of the Terms of Use.