The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - May (2003 vol.25)
pp: 628-633
Wai Lam , IEEE
ABSTRACT
<p><b>Abstract</b>—We propose a new approach to text categorization known as generalized instance set (GIS) algorithm under the framework of generalized instance patterns. Our GIS algorithm unifies the strengths of k-NN and linear classifiers and adapts to characteristics of text categorization problems. It focuses on refining the original instances and constructs a set of generalized instances. We also propose a metamodel framework based on category feature characteristics. It has a metalearning phase which discovers a relationship between category feature characteristics and each component algorithm. Extensive experiments have been conducted on two large-scale document corpora for both GIS and the metamodel. The results demonstrate that both approaches generally achieve promising text categorization performance.</p>
INDEX TERMS
Text classification, instance-based learning, metamodel learning.
CITATION
Wai Lam, Yiqiu Han, "Automatic Textual Document Categorization Based on Generalized Instance Sets and a Metamodel", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.25, no. 5, pp. 628-633, May 2003, doi:10.1109/TPAMI.2003.1195997
15 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool