CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 2011 vol.33 Issue No.10 - October
Issue No.10 - October (2011 vol.33)
Anne-Laure Bianne-Bernard , A2iA SA, Artificial Intelligence and Image Analysis and Telecom ParisTech/TSI and CNRS LTCI, Paris
Farès Menasri , A2iA SA, Artificial Intelligence and Image Analysis, Paris
Rami Al-Hajj Mohamad , University of Balamand, Lebanon
Chafic Mokbel , University of Balamand, Lebanon
Christopher Kermorvant , A2iA SA, Artificial Intelligence and Image Analysis, Paris
Laurence Likforman-Sulem , Telecom ParisTech/TSI and CNRS LTCI, Paris
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2011.22
This study aims at building an efficient word recognition system resulting from the combination of three handwriting recognizers. The main component of this combined system is an HMM-based recognizer which considers dynamic and contextual information for a better modeling of writing units. For modeling the contextual units, a state-tying process based on decision tree clustering is introduced. Decision trees are built according to a set of expert-based questions on how characters are written. Questions are divided into global questions, yielding larger clusters, and precise questions, yielding smaller ones. Such clustering enables us to reduce the total number of models and Gaussians densities by 10. We then apply this modeling to the recognition of handwritten words. Experiments are conducted on three publicly available databases based on Latin or Arabic languages: Rimes, IAM, and OpenHart. The results obtained show that contextual information embedded with dynamic modeling significantly improves recognition.
Latin and Arabic handwriting recognition, context-dependent HMMs, neural-network combination.
Anne-Laure Bianne-Bernard, Farès Menasri, Rami Al-Hajj Mohamad, Chafic Mokbel, Christopher Kermorvant, Laurence Likforman-Sulem, "Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.33, no. 10, pp. 2066-2080, October 2011, doi:10.1109/TPAMI.2011.22