Intelligent Agent Technology, IEEE / WIC / ACM International Conference on (2003)
Oct. 13, 2003 to Oct. 17, 2003
Bing Shen , Binghamton University
Zhongfei (Mark) Zhang , Binghamton University
Chunfa Yuan , Tsinghua University
This research is about automatic identification and extraction of person names in Chinese text documents. Solutions to this problem have immediate and extensive applications in many areas especially in Web Intelligent Agents related applications such as Web search engines, Web data mining, and automatic Web information analysis. We have noted that while finite state automata (FSA) based techniques have been extensively used in NLP and IE in English, they have not yet been extensively used in processing Chinese text, and in particular, to our knowledge, no work has been reported in using FSA in person name identification and extraction. Motivated by this need, we have proposed a person name identification method based on FSA, called NICF. Evaluations show that NICF works very well in terms of identification recall and accuracy, as well as the processing speed, and thus holds a great promise for future applications.
C. Yuan, Z. (. Zhang and B. Shen, "Person Name Identification in Chinese Documents Using Finite State Automata," Intelligent Agent Technology, IEEE / WIC / ACM International Conference on(IAT), Halifax, Canada, 2003, pp. 478.