This Article 
 Bibliographic References 
 Add to: 
Finding Interesting Patterns Using User Expectations
November/December 1999 (vol. 11 no. 6)
pp. 817-832

Abstract—One of the major problems in the field of knowledge discovery (or data mining) is the interestingness problem. Past research and applications have found that, in practice, it is all too easy to discover a huge number of patterns in a database. Most of these patterns are actually useless or uninteresting to the user. But due to the huge number of patterns, it is difficult for the user to comprehend them and to identify those interesting to him/her. To prevent the user from being overwhelmed by the large number of patterns, techniques are needed to rank them according to their interestingness. In this paper, we propose such a technique, called the user-expectation method. In this technique, the user is first asked to provide his/her expected patterns according to his/her past knowledge or intuitive feelings. Given these expectations, the system uses a fuzzy matching technique to match the discovered patterns against the user's expectations, and then rank the discovered patterns according to the matching results. A variety of rankings can be performed for different purposes, such as to confirm the user's knowledge and to identify unexpected patterns, which are by definition interesting. The proposed technique is general and interactive.

[1] R. Agrawal, T. Imielinski, and A. Swami, Database Mining: A Performance Perspective IEEE Trans. Knowledge and Data Eng., vol. 5, no. 6, Dec. 1993.
[2] Fuzzy Models for Pattern Recognition: Methods that Search for Patterns in Data, J.C. Bezdek and S.K. Pal, eds., IEEE Press, 1992.
[3] I. Bhandari, “Attribute Focusing: Machine-Assisted Knowledge Discovery Applied to Software Production Process Control,” Proc. AAAI-93 Workshop Knowledge Discovery in Databases, 1993.
[4] Readings in Knowledge Acquisition and Learning, B. Buchanan and D. Wilkins, eds., Morgan Kaufmann, 1993.
[5] V. Dhar and A. Tuzhilin, "Abstract-Driven Pattern Discovery in Databases," IEEE Trans. Knowledge and Data Engineering, vol. 5, no. 6, 1993.
[6] J. Durkin, Expert Systems: Design and Development. Prentice Hall, 1994.
[7] U.M. Fayyad, G. Piatesky-Shapiro, and P. Smith, “From Data Mining to Knowledge Discovery: An Overview,” Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, U.M. Fayyad et al., eds., pp. 1-34, 1996.
[8] W.J. Frawley, G. Piatetsky-Shapiro, and C.J. Matheus, “Knowledge Discovery in Databases: An Overview,” Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W.J Frawley, eds., Cambridge, Mass.: AAAI/MIT Press, pp. 1-27, 1991.
[9] J. Giarratano and G. Riley, Expert Systems: Principles and Programming. PWS Publishing, 1994.
[10] J. Han, Y. Cai, and N. Cercone, "Data-Driven Discovery of Quantitative Rules in Relational Databases," IEEE Trans. Knowledge and Data Eng., pp. 29-40, Feb. 1993.
[11] J. Hong and C. Mao, “Incremental Discovery of Rules and Structure by Hierarchical and Parallel Clustering,” Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W.J Frawley, eds., Cambridge, Mass.: AAAI/MIT Press, 1991.
[12] Fuzzy Logic and Control, M. Jamshidi, N. Vadiee, and T. Ross, eds. Prentice Hall, 1993.
[13] M. Kamber and R. Shinghal, “Evaluating the Interestingness of Characteristic Rules,” Proc. KDD '96, pp. 263-266, 1996.
[14] Fuzzy Expert Systems, A. Kandel, ed. Boca Raton, Fla.: CRC Press, 1991.
[15] A. Kandel, Fuzzy Techniques in Pattern Recognition. Wiley, 1982.
[16] M. Klemettinen, H. Mannila, P. Ronkainen, H. Toivonen, and A.I. Verkamo, “Finding Interesting Rules from Large Sets of Association Rules,” Proc. Third Int'l Conf. Information and Knowledge Management, N.R. Adam, K.B. Bhargava, and Y. Yesha, eds. pp. 401-407, 1994.
[17] G.J. Klir and B. Yuan, Fuzzy Sets and Fuzzy Logic, Theory and Applications, Prentice-Hall, Englewood Cliffs, N.J., 1995.
[18] G. J. Klir and T. A. Folger,Fuzzy sets, Uncertainty, and Information. Englewood Cliffs, NJ: Prentice-Hall, 1988.
[19] R. Kruse, E. Schwecke, and J. Heinsohn, Uncertainty and Vagueness in Knowledge Based Systems: Numerical Methods. Springer-Verlag, 1991.
[20] W. Klosgen, “Problems for Knowledge Discovery in Databases and Their Treatment in the Statistics Interpreter Explora,” Int'l J. Intelligent Systems, vol. 7, no. 7, pp. 649-673, 1992.
[21] D. Li and D. Liu, A Fuzzy Prolog Database System, Taunton, England: Research Studies Press, 1990.
[22] B. Liu and W. Hsu, “Post-Analysis of Learned Rules,” Proc. AAAI '96, 13th Nat'l Conf. Artificial Intelligence, Portland, Ore., pp. 828-834, 1996.
[23] B. Liu, W. Hsu, and S. Chen, “Using General Impressions to Analyze Discovered Classification Rules,” Proc. KDD '97, pp. 31-36, 1997.
[24] J.A. Major and J.J. Mangano, “Selecting Among Rules Induced from a Hurricane Database,” Proc. AAAI 93, Workshop Knowledge Discovery in Databases, pp. 28-41, 1993.
[25] C.J. Matheus, P.K. Chan, and G. Piatetsky-Shapiro, “Systems for Knowledge Discovery in Databases,” IEEE Trans. Knowledge and Data Eng., vol. 5, no. 6, 1993.
[26] C.J. Matheus, G. Piatetsky-Shapiro, and D. McNeill, “An Application of KEFIR to the Analysis of Healthcare Information,” Proc. AAAI '94 Workshop Knowledge Discovery in Databases, 1994.
[27] S. Miyamoto, Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, 1990.
[28] G. Piatetsky-Shapiro and C.J. Matheus, “The Interestingness of Deviations,” Proc. AAAI '94, Workshop Knowledge Discovery in Databases, pp. 25-36, 1994.
[29] G. Piatetsky-Shapiro, C. Matheus, P. Smyth, and R. Uthurusamy, “Proc. KDD '93: Progress and Challenges,” AI Magazine, pp. 77-87, Fall 1994.
[30] J.R. Quinlan, C4.5: Programs for Machine Learning,San Mateo, Calif.: Morgan Kaufman, 1992.
[31] S.J. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, Prentice Hall, Upper Saddle River, N.J., 1994.
[32] A. Silberschatz and A. Tuzhilin, “On Subjective Measures of Interestingness in Knowledge Discovery,” Proc. First Int'l Conf. Knowledge Discovery and Data Mining, pp. 275-281, 1995.
[33] A. Silberschatz and A. Tuzhilin, "What Makes Patterns Interesting in Knowledge Discovery Systems," IEEE Trans. Knowledge and Data Engineering, Dec. 1996, pp. 970-974.
[34] Industrial Applications of Fuzzy Control, M. Sugeno, ed. North-Holland, 1985.
[35] A. Tuzhilin, “A Pattern Discovery Algebra,” Proc. SIGMOD Workshop Research Issues in Data Mining and Knowledge Discovery, pp. 71-76, 1997.
[36] R. Uthurusamy, U.M. Fayyad, and S. Spangler, “Learning Useful Rules from Inconclusive Data,” Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W.J Frawley, eds. AAAI/MIT Press, 1991.
[37] H.-J. Zimmermann, Fuzzy Set Theory and Its Applications, Kluwer, Boston, 1991.
[38] R. Camposano and W. Rosensteil, "Synthesizing Circuits from Behavioral Descriptions," IEEE Trans. CAD, Vol. CAD-8, Feb. 1989, pp. 171-180.

Index Terms:
Knowledge discovery, interesting patterns, unexpectedness, post-analysis of patterns, pattern ranking.
Bing Liu, Wynne Hsu, Lai-Fun Mun, Hing-Yan Lee, "Finding Interesting Patterns Using User Expectations," IEEE Transactions on Knowledge and Data Engineering, vol. 11, no. 6, pp. 817-832, Nov.-Dec. 1999, doi:10.1109/69.824588
Usage of this product signifies your acceptance of the Terms of Use.