This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
An Efficient Inductive Learning Method for Object-Oriented Database Using Attribute Entropy
December 1996 (vol. 8 no. 6)
pp. 946-951

Abstract—The data-driven characteristic of Version Space works efficiently in memory even if the training set is enormous. However, the concept hierarchy of each attribute used to generalize/specialize the hypothesis of S/G-set is processed sequentially and instance-by-instance, which degrades its performance. As for ID3, the decision tree is generated from the order of attributes according to their entropies to reduce the number of attributes in some of the tree paths. Unlike Version Space, ID3 generates an extremely complex decision tree when the training set is enormous. Therefore, we propose a method, AGE, taking advantages of Version Space and ID3 to learn rules from object-oriented databases (OODB) with the least number of learning features according to the entropy. By simulations, we found the performance of our learning algorithm is better than both Version Space and ID3. Furthermore, AGE's time complexity and space complexity are both linear to the number of training instances.

[1] P.R. Cohen and E.A. Feigenbaum, "Learning and Inductive Inference," The Handbook of Artificial Intelligence, vol. III, Heuristic Press and William Kaufman, pp. 325-400, 1983.
[2] T.M. Mitchell, "An Analysis of Generalization as a Search Problem," Proc. Int'l Joint Conf. Artificial Intelligence, pp. 557-582, 1979.
[3] T.M. Mitchell, "Version Space: A Candidate Elimination Approach to Rule Learning," Proc. Int'l Joint Conf. Artificial Intelligence, pp. 305-310, 1977.
[4] Y. Cai, N. Cercone, and J. Han, "An Attribute-Oriented Approach for Learning Classification Rules from Relational Databases," Proc. IEEE Int'l Conf. Data Eng., pp. 281-288, 1990.
[5] R. Yasdi, "Learning Classification Rules from Database in the Context of Knowledge Acquisition and Representation," IEEE Trans. Knowledge and Data Eng., pp. 293-306, Sept. 1991.
[6] J. Han, Y. Cai, and N. Cercone, "Data-Driven Discovery of Quantitative Rules in Relational Databases," IEEE Trans. Knowledge and Data Eng., pp. 29-40, Feb. 1993.
[7] J. Migers, "An Empirical Comparison of Selection Measures for Decision-Tree Induction," Machine Learning, vol. 3, pp. 319-342, 1989.
[8] T.P. Hong and S.S. Tseng, "Models of Parallel Learning Systems," IEEE Int'l Conf. Distributed Computing Systems, pp. 125-132, 1991.
[9] J.R. Quinlan,"Induction of decision trees," Machine Learning, vol. 1, pp. 81-106, 1986.
[10] P. Brazdil and I. Bruha, "A Method of Processing Unknown Attribute Values by ID3," Proc. Fourth Int'l Conf. Computing and Information (ICCI), pp. 253-256, 1992.
[11] P.E. Utgoff, "Incremental Induction of Decision Trees," Machine Learning, vol. 4, no. 2, pp. 81-106, 1989.
[12] S.H. Lin and Y.M. Huang, "The Design of a Learning Algorithm for Object-Oriented Data Model," Nat'l Computer Symp.,Taiwan, pp. 832-839, 1993.
[13] P.W. Baim, "A Method for Attribute Selection in Inductive Learning Systems," IEEE Trans. Pattern Analysis and Machine Intelligence, pp. 888-896, Nov. 1988.
[14] G. J. Klir and T. A. Folger,Fuzzy sets, Uncertainty, and Information. Englewood Cliffs, NJ: Prentice-Hall, 1988.
[15] O. Dexu et al., "The Story of O2," IEEE Trans. Knowledge and Data Eng., pp. 91-108, Mar. 1990.
[16] F. Bancilhon, "Object-Oriented Database Systems," Proc. ACM SIGMOD Int'l Conf., pp. 152-162, 1988.
[17] J. Banerjee, W. Kim, and K.C. Kim, "Queries in Object-Oriented Databases," Proc. IEEE Int'l Conf. Data Eng., pp. 31-38, 1988.
[18] W. Kim, N. Ballou, H.T. Chou, J.F. Garza, and D. Woelk, "Features of the ORION Object-Oriented Database System," Object-Oriented Concepts, Databases, and Applications, Addison Wesley, pp. 251-282, 1989.
[19] R.E. Blahut, Principles and Practice of Information Theory. Addison Wesley, 1987.
[20] C.L. Forgy, "Rete: A Fast Algorithm for the Many Pattern/Many Object Pattern Match Problem," Artificial Intelligence, North-Holland, pp. 17-37, 1982.

Index Terms:
Inductive learning, Version Space, ID3, entropy, object-oriented database.
Citation:
Yueh-Min Huang, Shian-Hua Lin, "An Efficient Inductive Learning Method for Object-Oriented Database Using Attribute Entropy," IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 6, pp. 946-951, Dec. 1996, doi:10.1109/69.553161
Usage of this product signifies your acceptance of the Terms of Use.