Autonomic Computing, International Conference on (2005)
June 13, 2005 to June 16, 2005
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICAC.2005.46
David Loewenstern , IBM T.J. Watson Research & AC
Sheng Ma , IBM T.J. Watson Research & AC
Abdi Salahshour , IBM T.J. Watson Research & AC
Motivated by the real-world application of categorizing system log messages into defined situation categories, this paper describes an interactive text categorization method, PICCIL1, that leverages supervised machine learning to reduce the burden of assigning categories to documents in large finite data sets but, by coupling human expertise to the machine learning, does so without sacrificing accuracy. PICCIL uses keywords and keyword rules both to preclassify documents and to assist in the manual process of grouping and reviewing documents. The reviewed documents, in turn, are used to refine the keyword rules iteratively to improve subsequent grouping and document review. We apply PICCIL to the problem of assigning semantic situation labels to the entries of a catalog of log events to support on-line labeling of log events.
D. Loewenstern, S. Ma and A. Salahshour, "PICCIL: Interactive Learning to Support Log File Categorization," Autonomic Computing, International Conference on(ICAC), Seattle, Washington, 2005, pp. 311-312.