loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1
Strategies for Large Handwritten Farsi/Arabic Lexicon Reduction
Curitiba, Parana, Brazil
September 23-September 26
ISBN: 0-7695-2822-8
S. Mozaffari, Amirkabir University of Technology, Tehran, Iran.
K. Faez, Amirkabir University of Technology, Tehran, Iran.
V. M?rgner, Technical University of Braunschweig, Germany
H. El-Abed, Technical University of Braunschweig, Germany
Given large number of words to be recognized, lexicon reduction strategy for eliminating unlikely candidates before recognition can be a reasonable and powerful approach for increasing the recognition speed. In this paper, we describe a holistic approach for large Arabic handwritten lexicon reduction which is based on inherent properties of Arabic writing. The principal of this technique involves extraction of dots, diacritics and subwords from the cursive Arabic word image to describe its shape. In the first stage of lexicon reduction, the number of subwords in the input word is estimated. Then, in the second stage, the word descriptor, based on the dots and diacritics information, is used while taking into account only the candidates selected in the first stage. Experimental results on IFN/ENIT database, consisting of 26,459 cursive Arabic word images, show a lexicon reduction of 92.5% with accuracy of 74%.
Citation:
S. Mozaffari, K. Faez, V. M?rgner, H. El-Abed, "Strategies for Large Handwritten Farsi/Arabic Lexicon Reduction," icdar, vol. 1, pp.98-102, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1, 2007
Usage of this product signifies your acceptance of the Terms of Use.