This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Offline Arabic Handwriting Recognition: A Survey
May 2006 (vol. 28 no. 5)
pp. 712-724
Liana M. Lorigo, IEEE Computer Society
Venu Govindaraju, IEEE Computer Society
The automatic recognition of text on scanned images has enabled many applications such as searching for words in large volumes of documents, automatic sorting of postal mail, and convenient editing of previously printed documents. The domain of handwriting in the Arabic script presents unique technical challenges and has been addressed more recently than other domains. Many different methods have been proposed and applied to various types of images. This paper provides a comprehensive review of these methods. It is the first survey to focus on Arabic handwriting recognition and the first Arabic character recognition survey to provide recognition rates and descriptions of test data for the approaches discussed. It includes background on the field, discussion of the methods, and future research directions.

[1] Ethnologue: Languages of the World, 14th ed. SIL Int'l, 2000.
[2] Y.S. Chen and W.H. Hsu, “A New Parallel Thinning Algorithm for Binary Image,” Proc. Nat'l Computer Symp., pp. 295-299, 1985.
[3] H. Freeman, “On the Encoding of Arbitrary Geometric Configurations,” IRE Trans. Electronic Computing, vol. 10, pp. 260-268, 1961.
[4] S. Madhvanath, G. Kim, and V. Govindaraju, “Chain Code Processing for Handwritten Word Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, pp. 928-932, 1999.
[5] M. Pechwitz and V. Märgner, “Baseline Estimation for Arabic Handwritten Words,” Proc. Eighth Int'l Workshop Frontiers in Handwriting Recognition, pp. 479-484, 2002.
[6] F. Farooq, V. Govindaraju, and M. Perrone, “Preprocessing Methods for Handwritten Arabic Documents,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 267-271, 2005.
[7] M. Soleymani and F. Razzazi, “An Efficient Front-End System for Isolated Persian/Arabic Character Recognition of Handwritten Data-Entry Forms,” Int'l J. Computational Intelligence, vol. 1, pp. 193-196, 2003.
[8] H.Y. Abdelazim, “Arabic Script Recognition Using Hopfield Networks,” Int'l J. Computers and Their Applications, vol. 2, pp. 43-49, 1995.
[9] M.S. El-Wakil and A. Shoukry, “On-Line Recognition of Handwritten Isolated Arabic Characters,” Pattern Recognition, vol. 22, pp. 97-105, 1989.
[10] S. Al-Emami and M. Usher, “On-Line Recognition of Handwritten Arabic Characters,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, pp. 704-710, 1990.
[11] I.S.I. Abuhaiba and P. Ahmed, “Restoration of Temporal Information in Off-Line Arabic Handwriting,” Pattern Recognition, vol. 26, pp. 1009-1017, 1993.
[12] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, second ed. John Wiley and Sons, 2001.
[13] L.R. Rabiner and B.H. Juang, “An Introduction to Hidden Markov Models,” IEEE Acoustics, Speech, and Signal Processing Magazine, vol. 3, pp. 4-16, 1986.
[14] N. BenAmara, A. Belaïd, and N. Ellouze, “Utilisation des Modèles Markoviens en Reconnaissance de l'Écriture Arabe: Etat de L'art,” Proc. Colloque Int'l Francophone sur l'Ecrit et le Document, 2000.
[15] S. Madhvanath and V. Govindaraju, “The Role of Holistic Paradigms in Handwritten Word Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, pp. 149-164, 2001.
[16] B. Al-Badr and S.A. Mahmoud, “Survey and Bibliography of Arabic Optical Text Recognition,” Signal Processing, vol. 41, pp. 49-77, 1995.
[17] A. Amin, “Offline Arabic Character Recognition: The State of the Art,” Pattern Recognition, vol. 31, pp. 517-530, 1998.
[18] M.S. Khorsheed, “Off-Line Arabic Character Recognition— A Review,” Pattern Analysis and Applications, vol. 5, pp. 31-45, 2002.
[19] A.S. Eldin and A.S. Nouh, “Arabic Character Recognition: A Survey,” Proc. SPIE Conf. Optical Pattern Recognition, pp. 331-340, 1998.
[20] A. Nouh, A. Sultan, and R. Tolba, “An Approach for Arabic Characters Recognition,” J. Eng. Science, vol. 6, pp. 185-191, 1980.
[21] A. Nouh, A. Sultan, and R. Tolba, “On Feature Extraction and Selection for Arabic Character Recognition,” Arab Gulf J. Scientific Research, vol. 2, pp. 329-347, 1984.
[22] B. Parhami and M. Taraghi, “Automatic Recognition of Printed Farsi Texts,” Pattern Recognition, vol. 14, pp. 395-403, 1981.
[23] A. Amin and G. Masini, “Machine Recognition of Multi-Font Printed Arabic Texts,” Proc. Int'l Conf. Pattern Recognition, pp. 392-395, 1986.
[24] T.S. El-Sheikh and R.M. Guindi, “Computer Recognition of Arabic Cursive Scripts,” Pattern Recognition, vol. 21, pp. 293-302, 1988.
[25] S. Sami El-Dabi, R. Ramsis, and A. Kamel, “Arabic Character Recognition System: A Statistical Approach for Recognizing Cursive Typewritten Text,” Pattern Recognition, vol. 23, pp. 485-495, 1990.
[26] M.R. Hashemi, O. Fatemi, and R. Safavi, “Persian Cursive Script Recognition,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 869-873, 1995.
[27] A. Ymin and Y. Aoki, “On the Segmentation of Multi-Font Printed Uygur Scripts,” Proc. 13th Int'l Conf. Pattern Recognition, vol. 3, pp. 215-219, 1996.
[28] B. Al-Badr and R. Haralick, “Segmentation-Free Word Recognition with Application to Arabic,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 355-359, 1995.
[29] B. Al-Badr and R. Haralick, “A Segmentation-Free Approach to Text Recognition with Application to Arabic Text,” Int'l J. Document Analysis and Recognition, vol. 1, pp. 147-166, 1998.
[30] J. Alherbish, R.A. Ammar, and M. Abdalla, “Arabic Character Recognition in a Multiprocessing Environment,” Proc. IEEE Symp. Computers and Comm., pp. 286-292, 1997.
[31] M.S. Khorsheed and W.F. Clocksin, “Structural Features of Cursive Arabic Script,” Proc. British Machine Vision Conf., pp. 422-431, 1999.
[32] J. Makhoul, R. Schwartz, C. Lapre, and I. Bazzi, “A Script-Independent Methodology for Optical Character Recognition,” Pattern Recognition, vol. 31, pp. 1285-1294, 1998.
[33] P. Natarajan, M. Decerbo, T. Keller, R. Schwartz, and J. Makhoul, “Porting the BBN BYBLOS OCR System to New Languages,” Proc. Symp. Document Image Understanding Technology, pp. 47-52, 2003.
[34] M. Decerbo, P. Natarajan, R. Prasad, E. MacRostie, and A. Ravindran, “Performance Improvements to the BBN Byblos OCR System,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 411-415, 2005.
[35] I. Bazzi, R. Schwartz, and J. Makhoul, “An Omnifont Open-Vocabulary OCR System for English and Arabic,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, pp. 495-504, 1999.
[36] J. Trenkle, A. Gillies, E. Erlandson, S. Schlosser, and S. Cavin, “Advances in Arabic Text Recognition,” Proc. Symp. Document Image Understanding Technology, 2001.
[37] J. Trenkle, A. Gillies, E. Erlandson, and S. Schlosser, “Arabic Character Recognition,” Proc. Symp. Document Image Understanding Technology, pp. 191-195, 1995.
[38] A. Gillies, E. Erlandson, J. Trenkle, and S. Schlosser, “Arabic Text Recognition System,” Proc. Symp. Document Image Understanding Technology, 1999.
[39] J. Trenkle, A. Gillies, and S. Schlosser, “An Off-Line Arabic Recognition System for Machine-Printed Arabic Documents,” Proc. Symp. Document Image Understanding Technology, pp. 155-161, 1997.
[40] L. Hamami and D. Berkani, “Recognition System for Printed Multifont and Multisize Arabic Characters,” The Arabian J. Science and Eng., vol. 27, pp. 57-72, 2002.
[41] S.A. Al-Qahtani and M.S. Khorsheed, “An Omni-Font HTK-Based Arabic Recognition System,” Proc. Eighth IASTED Int'l Conf. Artificial Intelligence and Soft Computing, 2004.
[42] S.A. Al-Qahtani and M.S. Khorsheed, “A HTK-Based System to Recognise Arabic Script,” Proc. Fourth IASTED Int'l Conf. Visualization, Imaging, and Image Processing, 2004.
[43] M. Pechwitz and V. Märgner, “HMM Based Approach for Handwritten Arabic Word Recognition Using the IFN/ENIT-Database,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 890-894, 2003.
[44] A. Amin, “Recognition of Hand-Printed Characters Based on Structural Description and Inductive Logic Programming,” Pattern Recognition Letters, vol. 24, pp. 3187-3196, 2003.
[45] L. Souici-Meslati and M. Sellami, “A Hybrid Approach for Arabic Literal Amounts Recognition,” The Arabian J. Science and Eng., vol. 29, pp. 177-194, 2004.
[46] M.S. Khorsheed, “Recognising Handwritten Arabic Manuscripts Using a Single Hidden Markov Model,” Pattern Recognition Letters, vol. 24, pp. 2235-2242, 2003.
[47] R. Davidson and R. Hopely, “Arabic and Persian OCR Training and Test Data Sets,” Proc. Symp. Document Image Understanding Technology, pp. 303-307, 1997.
[48] T. Kanungo, G. Marton, and O. Bulbul, “OmniPage vs. Sakhr: Paired Model Evaluation of Two Arabic OCR Products,” Proc. SPIE Conf. Document Recognition and Retrieval (VI), pp. 109-121, 1999.
[49] I.S. Abuhaiba, “A Discrete Arabic Script for Better Automatic Document Understanding,” The Arabian J. Science and Eng., vol. 28, pp. 77-94, 2003.
[50] J.J. Hull, “A Database for Handwritten Text Recognition Research,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, pp. 550-554, 1994.
[51] S. Alma'adeed, D. Elliman, and C.A. Higgins, “A Data Base for Arabic Handwritten Text Recognition Research,” Proc. Eighth Int'l Workshop Frontiers in Handwriting Recognition, pp. 485-489, 2002.
[52] Y. Al-Ohali, M. Cheriet, and C. Suen, “Databases for Recognition of Handwritten Arabic Cheques,” Pattern Recognition, vol. 36, pp. 111-121, 2003.
[53] I.S.I. Abuhaiba, S.A. Mahmoud, and R.J. Green, “Recognition of Handwritten Cursive Arabic Characters,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, pp. 664-672, 1994.
[54] A. Amin, H. Al-Sadoun, and S. Fischer, “Hand-Printed Arabic Character Recognition System Using an Artificial Network,” Pattern Recognition, vol. 29, pp. 663-675, 1996.
[55] I.S.I. Abuhaiba, M.J.J. Holt, and S. Datta, “Recognition of Off-Line Cursive Handwriting,” Computer Vision and Image Understanding, vol. 71, pp. 19-38, 1998.
[56] A. Dehghani, F. Shabani, and P. Nava, “Off-Line Recognition of Isolated Persian Handwritten Characters Using Multiple Hidden Markov Models,” Proc. Int'l Conf. Information Technology: Coding and Computing, pp. 506-510, 2001.
[57] A.A. Al-Shaher and E.R. Hancock, “Learning Mixtures of Point Distribution Models with the EM Algorithm,” Pattern Recognition, vol. 36, pp. 2805-2818, 2003.
[58] S. Mozaffari, K. Faez, and M. Ziaratban, “Structural Decomposition and Statistical Description of Farsi/Arabic Handwritten Numeric Characters,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 237-241, 2005.
[59] S. Alma'adeed, C. Higgens, and D. Elliman, “Recognition of Off-Line Handwritten Arabic Words Using Hidden Markov Model Approach,” Proc. 16th Int'l Conf. Pattern Recognition, vol. 3, pp. 481-484, 2002.
[60] S. Alma'adeed, C. Higgens, and D. Elliman, “Off-Line Recognition of Handwritten Arabic Words Using Multiple Hidden Markov Models,” Knowledge-Based Systems, vol. 17, pp. 75-79, 2004.
[61] R. Haraty and A. Hamid, “Segmenting Handwritten Arabic Text,” Proc. Int'l Conf. Computer Science, Software Eng., Information Technology, e-Business, and Applications, 2002.
[62] R. Haraty and H. El-Zabadani, “Abjad: An Off-Line Arabic Handwritten Recognition System,” Proc. Int'l Arab Conf. Information Technology, 2002.
[63] R. Haraty and C. Ghaddar, “Neuro-Classification for Handwritten Arabic Text,” Proc. ACS/IEEE Int'l Conf. Computer Systems and Applications, 2003.
[64] R. Haraty and C. Ghaddar, “Arabic Text Recognition,” Int'l Arab J. Information Technology, vol. 1, pp. 156-163, 2004.
[65] M.M.M. Fahmy and S. Al Ali, “Automatic Recognition of Handwritten Arabic Characters Using Their Geometrical Features,” Studies in Informatics and Control J., vol. 10, 2001.
[66] H. Goraine, M. Usher, and S. Al-Emami, “Off-Line Arabic Character Recognition,” Computer, vol. 25, pp. 71-74, 1992.
[67] H. Almuallim and S. Yamaguchi, “ A Method of Recognition of Arabic Cursive Handwriting,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 9, pp. 715-722, 1987.
[68] R. Safabakhsh and P. Adibi, “Nastaaligh Handwritten Word Recognition Using a Continuous-Density Variable-Duration HMM,” The Arabian J. Science and Eng., vol. 30, pp. 95-118, 2005.
[69] N. Farah, L. Souici, L. Farah, and M. Sellami, “Arabic Words Recognition with Classifiers Combination: An Application to Literal Amounts,” Proc. Artificial Intelligence: Methodology, Systems, and Applications, pp. 420-429, 2004.
[70] L. Souici, N. Farah, T. Sari, and M. Sellami, “Rule Based Neural Networks Construction for Handwritten Arabic City-Names Recognition,” Proc. Artificial Intelligence: Methodology, Systems, and Applications, pp. 331-340, 2004.
[71] T. Sari, L. Souici, and M. Sellami, “Off-Line Handwritten Arabic Character Segmentation Algorithm: ACSA,” Proc. Int'l Workshop Frontiers in Handwriting Recognition, pp. 452-457, 2002.
[72] S. Snoussi Maddouri, H. Amiri, A. Belaid, and C. Choisy, “Combination of Local and Global Vision Modeling for Arabic Handwritten Words Recognition,” Proc. Int'l Conf. Frontiers in Handwriting Recognition, pp. 128-135, 2002.
[73] M. Dehghan, K. Faez, M. Ahmadi, and M. Shridhar, “Handwritten Farsi (Arabic) Word Recognition: A Holistic Approach Using Discrete HMM,” Pattern Recognition, vol. 34, pp. 1057-1065, 2001.
[74] G. Olivier, H. Miled, K. Romeo, and Y. Lecourtier, “Segmentation and Coding of Arabic Handwritten Words,” Proc. 13th Int'l Conf. Pattern Recognition, vol. 3, pp. 264-268, 1996.
[75] H. Miled and N.E. Ben Amara, “Planar Markov Modeling for Arabic Writing Recognition: Advancement State,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 69-73, 2001.
[76] K. Mostafa and A.M. Darwish, “Robust Base-Line Independent Algorithms for Segmentation and Reconstruction of Arabic Handwritten Cursive Script,” Proc. IS&T/SPIE Conf. Document Recognition and Retrieval VI, vol. 3651, pp. 73-83, 1999.
[77] K. Romeo-Pakker, H. Miled, and Y. Lecourtier, “A New Approach for Latin/Arabic Character Segmentation,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 874-877, 1995.
[78] R. El-Hajj, L. Likforman-Sulem, and C. Mokbel, “Arabic Handwriting Recognition Using Baseline Dependant Features and Hidden Markov Modeling,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 893-897, 2005.
[79] W.F. Clocksin and P.P.J. Fernando, “Towards Automatic Transcription of Syriac Handwriting,” Proc. Int'l Conf. Image Analysis and Processing, pp. 664-669, 2003.
[80] D. Motawa, A. Amin, and R. Sabourin, “Segmentation of Arabic Cursive Script,” Proc. Int'l Conf. Document Analysis and Recognition, vol. 2, pp. 625-628, 1997.
[81] H. Al-Yousefi and S.S. Udpa, “Recognition of Arabic Characters,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, pp. 853-857, 1992.
[82] H.S. Park and S.W. Lee, “Off-Line Recognition of Large-Set Handwritten Characters with Multiple Hidden Markov Models,” Pattern Recognition, vol. 29, pp. 231-244, 1996.
[83] A.A. Al-Shaher and E.R. Hancock, “Arabic Character Recognition Using Shape Mixtures,” Proc. British Machine Vision Conf., pp. 497-506, 2002.
[84] T.F. Cootes and C.J. Taylor, “A Mixture Model for Representing Shape Variation,” Image and Vision Computing, vol. 17, pp. 567-573, 1999.
[85] M. Sellami, L. Souici, T. Sari, and Z. Zemirli, “Contribution à la Reconnaissance de Mots Arabes Manuscrits,” Proc. Colloque Africain de Recherche en Informatique, pp. 122-124, 1998.
[86] L. Lorigo and V. Govindaraju, “Segmentation and Pre-Recognition of Arabic Handwriting,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 605-609, 2005.
[87] J.R. Quinlan, “Learning Logical Definitions from Relations,” Machine Learning, vol. 5, pp. 239-266, 1990.
[88] M.-Y. Chen, A. Kundu, and S.N. Srihari, “Variable Duration Hidden Markov Model and Morphological Segmentation for Handwritten Word Recognition,” IEEE Trans. Image Processing, vol. 4, pp. 1675-1688, 1995.
[89] N. Farah, A. Ennaji, T. Khadir, and M. Sellami, “Benefits of Multi-Classifier Systems for Arabic Handwritten Words Recognition,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 222-226, 2005.
[90] V. Märgner, M. Pechwitz, and H. ElAbed, “ICDAR 2005 Arabic Handwriting Recognition Competition,” Proc. Int'l Conf. Document Analysis and Recognition, pp. 70-74, 2005.
[91] J. Jin, H. Wang, X. Ding, and L. Peng, “Printed Arabic Document Recognition System,” Proc. SPIE-IS&T Electronic Imaging, vol. 5676, pp. 48-55, 2005.
[92] C. Mokbel, H. Abi Akl, and H. Greige, “Automatic Speech Recognition of Arabic Digits over Telefone Network,” Proc. Int'l Conf. Research Trends in Science and Technology, 2002.
[93] S.M. Touj and N. Ben Amara, “Arabic Handwritten Words Recognition Based on a Planar Hidden Markov Model,” Int'l Arab J. Information Technology, vol. 2, 2005.
[94] H. Bunke, S. Bengio, and A. Vinciarelli, “Off-Line Recognition of Unconstrained Handwritten Texts Using HMMS and Statistical Language Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, pp. 709-720, 2004.
[95] G. Kim, V. Govindaraju, and S. Srihari, “Architecture for Handwritten Text Recognition Systems,” Advances in Handwriting Recognition, Series in Machine Perception and Artificial Intelligence, pp. 163-172, 1999.
[96] T.A. El-Sadany and M.A. Hashish, “An Arabic Morphological System,” IBM Sytems J., vol. 28, pp. 600-612, 1989.
[97] I.A. Al-Sughaiyer and I.A. Al-Kharashi, “Arabic Morphological Analysis Techniques: A Comprehensive Survey,” J. Am. Soc. Information Science and Technology, vol. 55, pp. 189-213, 2004.
[98] S. Kanoun, A.M. Alimi, and Y. Lecourtier, “Affixal Approach for Arabic Decomposable Vocabulary Recognition: A Validation on Printed Word in Only One Font,” Proc. Int'l Conf. Document Analaysis and Recognition, pp. 1025-1029, 2005.
[99] M. Yaseen, B. Haddad, H. Papageorgiou, S. Piperidis, M. Hattab, N. Theophilopoulos, and S. Krauwer, “A Term Base Translator over the Web,” Proc. ACL/EACL 2001 Workshop— ARABIC Language Processing: Status and Prospects, pp. 58-65, 2001.
[100] I.A. Al-Kharashi, “A Web Search Engine for Indexing, Searching and Publishing Arabic Bibliographic Databases,” Proc. Internet Global Summit, 1999.
[101] I.A. Al-Kharashi and M.W. Evens, “Comparing Words, Stems, and Roots as Index Terms in an Arabic Information Retrieval System,” J. Am. Soc. Information Science, vol. 45, pp. 548-560, 1994.

Index Terms:
Computer vision, document analysis, handwriting analysis, optical character recognition.
Citation:
Liana M. Lorigo, Venu Govindaraju, "Offline Arabic Handwriting Recognition: A Survey," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 5, pp. 712-724, May 2006, doi:10.1109/TPAMI.2006.102
Usage of this product signifies your acceptance of the Terms of Use.