Issue No. 05 - May (2005 vol. 27)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2005.89
Yefeng Zheng , IEEE
Huiping Li , IEEE
David Doermann , IEEE
The detection of groups of parallel lines is important in applications such as form processing and text (handwriting) extraction from rule lined paper. These tasks can be very challenging in degraded documents where the lines are severely broken. In this paper, we propose a novel model-based method which incorporates high-level context to detect these lines. After preprocessing (such as skew correction and text filtering), we use trained Hidden Markov Models (HMM) to locate the optimal positions of all lines simultaneously on the horizontal or vertical projection profiles, based on the Viterbi decoding. The algorithm is trainable so it can be easily adapted to different application scenarios. The experiments conducted on known form processing and rule line detection show our method is robust, and achieves better results than other widely used line detection methods.
Line detection, form processing, form registration, form identification, hidden Markov model, document image analysis.
Yefeng Zheng, Huiping Li, David Doermann, "A Parallel-Line Detection Algorithm Based on HMM Decoding", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. , pp. 777-792, May 2005, doi:10.1109/TPAMI.2005.89