CSDL Home IEEE Transactions on Knowledge & Data Engineering 2003 vol.15 Issue No.06 - November/December
Issue No.06 - November/December (2003 vol.15)
Samuel H. Huang , IEEE
<p><b>Abstract</b>—Knowledge acquisition is the process of collecting domain knowledge, documenting the knowledge, and transforming it into a computerized representation. Due to the difficulties involved in eliciting knowledge from human experts, knowledge acquisition was identified as a bottleneck in the development of knowledge-based system. Over the past decades, a number of automatic knowledge acquisition techniques have been developed. However, the performance of these techniques suffers from the so called <it>curse of dimensionality</it>, i.e., difficulties arise when many irrelevant (or redundant) parameters exist. This paper presents a heuristic approach based on statistics and greedy search for dimensionality reduction to facilitate automatic knowledge acquisition. The approach deals with classification problems. Specifically, Chi-square statistics are used to rank the importance of individual parameters. Then, a backward search procedure is employed to eliminate parameters (less important parameters first) that do not contribute to class separability. The algorithm is very efficient and was found to be effective when applied to a variety of problems with different characteristics.</p>
Curse of dimensionality, feature selection, Chi-square test of independence, greedy search, classification.
Samuel H. Huang, "Dimensionality Reduction in Automatic Knowledge Acquisition: A Simple Greedy Search Approach", IEEE Transactions on Knowledge & Data Engineering, vol.15, no. 6, pp. 1364-1373, November/December 2003, doi:10.1109/TKDE.2003.1245278