Issue No. 05 - May (2006 vol. 18)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2006.70
Wai-Ho Au , IEEE
Andrew K.C. Wong , IEEE
Classification is an important topic in data mining research. To better handle continuous data, fuzzy sets are used to represent interval events in the domains of continuous attributes, allowing continuous data lying on the interval boundaries to partially belong to multiple intervals. Since the membership functions of fuzzy sets can profoundly affect the performance of the models or rules discovered, the determination of membership functions or fuzzy partitioning is crucial. In this paper, we present a new method to determine the membership functions of fuzzy sets directly from data to maximize the class-attribute interdependence and, hence, improve the classification results. In other words, it forms a fuzzy partition of the input space automatically, using an information-theoretic measure to evaluate the interdependence between the class membership and an attribute as the objective function for fuzzy partitioning. To find the optimum of the measure, it employs fractional programming. To evaluate the effectiveness of the proposed method, several real-world data sets are used in our experiments. The experimental results show that this method outperforms other well-known discretization and fuzzy partitioning approaches.
Fuzzy partition, discretization, fuzzy sets, membership functions, classification, data mining.
K. C. Chan, W. Au and A. K. Wong, "A Fuzzy Approach to Partitioning Continuous Attributes for Classification," in IEEE Transactions on Knowledge & Data Engineering, vol. 18, no. , pp. 715-719, 2006.