The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - February (2004 vol.16)
pp: 203-215
Mark Last , IEEE Computer Society
ABSTRACT
<p><b>Abstract</b>—We describe and evaluate an information-theoretic algorithm for data-driven induction of classification models based on a minimal subset of available features. The relationship between input (predictive) features and the target (classification) attribute is modeled by a tree-like structure termed an <it>information network</it> (<it>IN</it>). Unlike other decision-tree models, the information network uses the same input attribute across the nodes of a given layer (level). The input attributes are selected incrementally by the algorithm to maximize a global decrease in the conditional entropy of the target attribute. We are using the prepruning approach: When no attribute causes a statistically significant decrease in the entropy, the network construction is stopped. The algorithm is shown empirically to produce much more compact models than other methods of decision-tree learning while preserving nearly the same level of classification accuracy.</p>
INDEX TERMS
Knowledge discovery in databases, data mining, classification, dimensionality reduction, feature selection, decision trees, information theory, Information theoretic network.
CITATION
Mark Last, Oded Maimon, "A Compact and Accurate Model for Classification", IEEE Transactions on Knowledge & Data Engineering, vol.16, no. 2, pp. 203-215, February 2004, doi:10.1109/TKDE.2004.1269598
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool