This Article 
 Bibliographic References 
 Add to: 
A Fuzzy Approach to Partitioning Continuous Attributes for Classification
May 2006 (vol. 18 no. 5)
pp. 715-719
Classification is an important topic in data mining research. To better handle continuous data, fuzzy sets are used to represent interval events in the domains of continuous attributes, allowing continuous data lying on the interval boundaries to partially belong to multiple intervals. Since the membership functions of fuzzy sets can profoundly affect the performance of the models or rules discovered, the determination of membership functions or fuzzy partitioning is crucial. In this paper, we present a new method to determine the membership functions of fuzzy sets directly from data to maximize the class-attribute interdependence and, hence, improve the classification results. In other words, it forms a fuzzy partition of the input space automatically, using an information-theoretic measure to evaluate the interdependence between the class membership and an attribute as the objective function for fuzzy partitioning. To find the optimum of the measure, it employs fractional programming. To evaluate the effectiveness of the proposed method, several real-world data sets are used in our experiments. The experimental results show that this method outperforms other well-known discretization and fuzzy partitioning approaches.

[1] J.C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms. New York: Plenum, 1981.
[2] J.Y. Ching, A.K.C. Wong, and K.C.C. Chan, “Class-Dependent Discretization for Inductive Learning from Continuous and Mixed-Mode Data,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 7, pp. 641-651, July 1995.
[3] D.K.Y. Chiu, A.K.C. Wong, and B. Cheung, “Information Discovery through Hierarchical Maximum Entropy Discretization and Synthesis,” Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W.J. Frawley, eds., Menlo Park, Calif. and Cambridge, Mass.: The AAAI/MIT Press, pp. 125-140, 1991.
[4] J. Dougherty, R. Kohavi, and M. Sahami, “Supervised and Unsupervised Discretization of Continuous Features,” Proc. 12th Int'l Conf. Machine Learning, pp. 194-202, 1995.
[5] M. Fajfer and C.Z. Janikow, “Bottom-Up Fuzzy Partitioning in Fuzzy Decision Trees,” Proc. 19th Int'l Conf. North Am. Fuzzy Information Processing Soc., pp. 326-330, 2000.
[6] U.M. Fayyad and K.B. Irani, “Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning,” Proc. 13th Int'l Joint Conf. Artificial Intelligence, pp. 1022-1029, 1993.
[7] D.E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning. Reading, Mass.: Addison-Wesley, 1989.
[8] J.-S. R. Jang, “ANFIS: Adaptive-Network-Based Fuzzy Inference System,” IEEE Trans. Systems, Man, and Cybernetics, vol. 23, no. 3, pp. 665-685, 1993.
[9] C.Z. Janikow and M. Fajfer, “Fuzzy Partitioning with FID3.1,” Proc. 18th Int'l Conf. North Am. Fuzzy Information Processing Soc., pp. 467-471, 1999.
[10] C.L. Karr, “Design of an Adaptive Fuzzy Logic Controller Using a Genetic Algorithm,” Proc. Fourth Int'l Conf. Genetic Algorithms, pp. 450-457, 1991.
[11] T. Kohonen, Self-Organizing Maps, third ed., Berlin: Springer-Verlag, 2001.
[12] L. Liu, A.K. C. Wong, and Y. Wang, “A Global Optimal Algorithm for Class-Dependent Discretization of Continuous Data,” Intelligent Data Analysis, vol. 8, no. 2, pp. 151-170, 2004.
[13] Machine Learning, Neural and Statistical Classification, D. Michie, D.J. Spiegelhalter, and C.C. Taylor, eds. New York: Ellis Horwood, 1994.
[14] A. Paterson and T.B. Niblett, ACLS Manual. Edinburgh: Intelligent Terminals Ltd., 1982.
[15] J.R. Quinlan, C4.5: Programs for Machine Learning. San Mateo, Calif.: Morgan Kaufmann, 1993.
[16] M. Sniedovich, Dynamic Programming. New York: Marcel Dekker, Inc., 1992.
[17] R.E. Walpole and R.H. Myers, Probability and Statistics for Engineers and Scientists, fifth ed. Upper Saddle River, N.J.: Prentice-Hall, 1993.
[18] C.C. Wang and A.K. C. Wong, “Classification of Discrete-Valued Data with Feature Space Transformation,” IEEE Trans. Automatic Control, vol. 24, no. 3, pp. 434-437, 1979.
[19] A.K.C. Wong and D.K.Y. Chiu, “Synthesizing Statistical Knowledge from Incomplete Mixed-Mode Data,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 9, no. 6, pp. 796-805, 1987.
[20] A.K.C. Wong and T.S. Liu, “Typicality, Diversity, and Feature Patterns of an Ensemble,” IEEE Trans. Computers, vol. 24, no. 2, pp. 158-181, 1975.

Index Terms:
Fuzzy partition, discretization, fuzzy sets, membership functions, classification, data mining.
Wai-Ho Au, Keith C.C. Chan, Andrew K.C. Wong, "A Fuzzy Approach to Partitioning Continuous Attributes for Classification," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 5, pp. 715-719, May 2006, doi:10.1109/TKDE.2006.70
Usage of this product signifies your acceptance of the Terms of Use.