Computer Science and Information Engineering, World Congress on (2009)
Los Angeles, California USA
Mar. 31, 2009 to Apr. 2, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.562
Semi-structured Chinese document anlysis is the most diffcult task for complex structure and Chinese semantics. According to the generic characteristics of the semi-structured document and the specific characteristics of the resume document, the paper researched on resume document block anlysis based on pattern matching, multi-level information identification and feedback control algorithms was also prompted. Based on the research, Resume Parser system was implemented for ChinaHR, which is the biggest recruitment website. It can read, analysis, retrieval and store the information automatically. According to all kinds of experienments results, the accuracy and efficiency of this system can generally satisfy the practical requirements. As the research on the processing of the semi-structured document, it will not only be as a directive of the further research on the resume analysis, but also be as the reference to other form of the semi-structured document.
document anlysis, semi-structured, resume parsing, pattern matching
W. Ming, L. Zhi-qing, X. Bo, Z. Chuang and L. C. Guang, "Resume Parser: Semi-structured Chinese Document Analysis," 2009 WRI World Congress on Computer Science and Information Engineering, CSIE(CSIE), Los Angeles, CA, 2009, pp. 12-16.