This paper discusses the knowledge discovery in Text (KDT) system for the ?Request for Comments (RFC) Document Series?. The paper proposes versatile system architecture for the Text Mining in RFC that maintains structured and unstructured data components of the document. The documents are represented by keywords and knowledge discovery is performed by analysing the co-occurrence frequencies of the various keywords representing the document. The clustering of the documents is done by extracted knowledge, which can reduce the search space for searching. The relevant documents retrieved during the search process for a query are ranked based on the relevance of topic in it. This paper describes RFC Viewer, our tool for viewing the RFC document in rich text format rather than in text format, it also provides the knowledge extracted from the RFC document and supports various KDD Operations on the document.
Citation:
Siva Gurusamy, D. Manjula, T.V. Geetha, "Text Mining in ?Request for Comments Document Series?," lec, pp.147, Language Engineering Conference (LEC'02), 2002