Issue No. 06 - November/December (1999 vol. 11)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/69.824599
<p><b>Abstract</b>—We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two real-world document collections from the MEDLINE database. Next, we investigate the application of automatic categorization to text retrieval. Our experiments clearly indicate that automatic categorization improves the retrieval performance compared with no categorization. We also demonstrate that the retrieval performance using automatic categorization achieves the same retrieval quality as the performance using manual categorization. Furthermore, detailed analysis of the retrieval performance on each individual test query is provided.</p>
Text categorization, automatic classification, text retrieval, instance-based learning, query processing.
M. Ruiz, P. Srinivasan and W. Lam, "Automatic Text Categorization and Its Application to Text Retrieval," in IEEE Transactions on Knowledge & Data Engineering, vol. 11, no. , pp. 865-879, 1999.