The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.06 - June (2010 vol.22)
pp: 755-769
Longbing Cao , University of Technology, Sydney
ABSTRACT
Traditional data mining research mainly focus]es on developing, demonstrating, and pushing the use of specific algorithms and models. The process of data mining stops at pattern identification. Consequently, a widely seen fact is that 1) many algorithms have been designed of which very few are repeatable and executable in the real world, 2) often many patterns are mined but a major proportion of them are either commonsense or of no particular interest to business, and 3) end users generally cannot easily understand and take them over for business use. In summary, we see that the findings are not actionable, and lack soft power in solving real-world complex problems. Thorough efforts are essential for promoting the actionability of knowledge discovery in real-world smart decision making. To this end, domain-driven data mining (D^3M) has been proposed to tackle the above issues, and promote the paradigm shift from “data-centered knowledge discovery” to “domain-driven, actionable knowledge delivery.” In D^3M, ubiquitous intelligence is incorporated into the mining process and models, and a corresponding problem-solving system is formed as the space for knowledge discovery and delivery. Based on our related work, this paper presents an overview of driving forces, theoretical frameworks, architectures, techniques, case studies, and open issues of D^3M. We understand D^3M discloses many critical issues with no thorough and mature solutions available for now, which indicates the challenges and prospects for this new topic.
INDEX TERMS
Data mining, domain-driven data mining (D^3M), actionable knowledge discovery and delivery.
CITATION
Longbing Cao, "Domain-Driven Data Mining: Challenges and Prospects", IEEE Transactions on Knowledge & Data Engineering, vol.22, no. 6, pp. 755-769, June 2010, doi:10.1109/TKDE.2010.32
REFERENCES
[1] M. Ankerst, "Report on the SIGKDD-2002 Panel the Perfect Data Mining Tool: Interactive or Automated?" ACM SIGKDD Explorations Newsletter, vol. 4, no. 2, pp. 110-111, 2002.
[2] P. Brazdil, C. Giraud-Carrier, C. Soares, and R. Vilalta, Metalearning: Applications to Data Mining. Springer, 2008.
[3] L. Cao, "Domain-Driven Actionable Knowledge Discovery," IEEE Intelligent Systems, vol. 22, no. 4, pp. 78-89, July 2007.
[4] L. Cao, "Developing Actionable Trading Strategies," Knowledge Processing and Decision Making in Agent-Based Systems, pp. 193-215, Springer, 2008.
[5] Data Mining and Multi-Agent Integration, L. Cao, ed. Springer, 2009.
[6] L. Cao and R. Dai, Open Complex Intelligent Systems, Post & Telecom, 2008.
[7] L. Cao, R. Dai, and M. Zhou, "Metasynthesis: M-Space, M Interaction and M-Computing for Open Complex Giant Systems," IEEE Trans. SMC—Part A, vol. 39, no. 5, pp. 1007-1021, Sept. 2009.
[8] L. Cao, V. Gorodetsky, and P. Mitkas, "Agent Mining: The Synergy of Agents and Data Mining," IEEE Intelligent Systems, vol. 24, no. 3, pp. 64-72, May/June 2009.
[9] L. Cao and T. He, "Developing Actionable Trading Agents," Knowledge and Information Systems: An Int'l J., vol. 18, no. 2, pp. 183-198, 2009.
[10] L. Cao and C. Zhang, "Domain-Driven Data Mining: A Practical Methodology," Int'l J. Data Warehousing and Mining, vol. 2, no. 4, pp. 49-65, 2005.
[11] L. Cao and C. Zhang, "The Evolution of KDD: Towards Domain-Driven Data Mining," Int'l J. Pattern Recognition and Artificial Intelligence, vol. 21, no. 4, pp. 677-692, 2006.
[12] L. Cao and C. Zhang, "Knowledge Actionability: Satisfying Technical and Business Interestingness," Int'l J. Business Intelligence and Data Mining, vol. 2, no. 4, pp. 496-514, 2007.
[13] L. Cao, Y. Zhao, and C. Zhang, "Mining Impact-Targeted Activity Patterns in Imbalanced Data," IEEE Trans. Knowledge and Data Eng., vol. 20, no. 8, pp. 1053-1066, Aug. 2008.
[14] L. Cao and Y. Ou, "Market Microstructure Pattern Analysis for Powering Trading and Surveillance Agents," J. Universal Computer Science, vol. 14, no. 14, pp. 2288-2308, 2008.
[15] Data Mining for Business Applications, L. Cao, P. Yu, C. Zhang, and H. Zhang, eds. Springer, 2008.
[16] L. Cao, P. Yu, C. Zhang, and Y. Zhao, Domain Driven Data Mining. Springer, 2009.
[17] L. Cao, Y. Zhao, H. Zhang, D. Luo, and C. Zhang, "Flexible Frameworks for Actionable Knowledge Discovery," IEEE Trans. Data and Knowledge Eng, preprint, 4 June 2009, doi: 10.1109/TKDE.2009.143.
[18] G. Dong and L. Li, "Efficient Mining of Emerging Patterns: Discovering Trends and Differences," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD '99), pp. 43-52, 1999.
[19] U. Fayyad, G. Shapiro, and R. Uthurusamy, "Summary from the KDD-03 Panel—Data Mining: The Next 10 Years," ACM SIGKDD Explorations Newsletter, vol. 5, no. 2, pp. 191-196, 2003.
[20] U. Fayyad and P. Smyth, "From Data Mining to Knowledge Discovery: An Overview," Advances in Knowledge Discovery and Data Mining, U. Fayyad and P. Smyth, eds., pp. 1-34, 1996.
[21] A. Freitas, "On Objective Measures of Rule Surprisingness," Proc. European Conf. Principles and Practice of Knowledge Discovery in Databases (PKDD '98), pp. 1-9, 1998.
[22] H. Kargupta, B. Park, D. Hershbereger, and E. Johnson, "Collective Data Mining: A New Perspective toward Distributed Data Mining," Advances in Distributed and Parallel Knowledge Discovery, MIT/AAAI Press, 2000.
[23] J. Kleinberg, C. Papadimitriou, and P. Raghavan, "A Microeconomic View of Data Mining," Data Mining and Knowledge Discovery, vol. 2, no. 4, pp. 311-324, 1998.
[24] R. Hilderman and H. Hamilton, "Applying Objective Interestingness Measures in Data Mining Systems," Proc. European Conf. Principles and Practice of Knowledge Discovery in Databases (PKDD '00), pp. 432-439, 2000.
[25] B. Lent, A.N. Swami, and J. Widom, "Clustering Association Rules," Proc. Int'l Conf. Data Eng. (ICDE '97), pp. 220-231, 1997.
[26] L. Lin and L. Cao, "Mining In-Depth Patterns in Stock Market," Int'l J. Intelligent System Technologies and Applications, vol. 4, nos. 3/4, pp. 225-238, 2008.
[27] B. Liu, W. Hsu, and Y. Ma, "Pruning and Summarizing the Discovered Associations," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD '99), pp. 125-134, 1999.
[28] B. Liu, "Analyzing the Subjective Interestingness of Association Rules," IEEE Intelligent Systems, vol. 15, no. 5, pp. 47-55, Sept./Oct. 2000.
[29] E. Omiecinski, "Alternative Interest Measures for Mining Associations in Databases," IEEE Trans. Knowledge and Data Eng., vol. 15, no. 1, pp. 57-69, Jan./Feb. 2003.
[30] B. Padmanabhan and A. Tuzhilin, "A Belief-Driven Method for Discovering Unexpected Patterns," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD '98), pp. 94-100, 1998.
[31] B. Park and H. Kargupta, "Distributed Data Mining: Algorithms and Systems, Applications," Data Mining Handbook, N. Ye, ed., 2002.
[32] X. Qian, J. Yu, and R. Dai, "A New Scientific Field: Open Complex Giant Systems and the Methodology," Chinese J. Nature, vol. 13, no. 1, pp. 3-10, 1990.
[33] X.S. Qian and H.S. Tsien, "Revisiting Issues on Open Complex Giant Systems," Pattern Recognition and Artificial Intelligence, vol. 4, no. 1, pp. 5-8, 1991.
[34] A. Silberschatz and A. Tuzhilin, "On Subjective Measures of Interestingness in Knowledge Discovery," Knowledge Discovery and Data Mining, vol. 8, no. 6, pp. 275-281, 1995.
[35] A. Tzacheva and Z. Ras, "Action Rules Mining," Int'l J. Intelligent Systems, vol. 20, no. 7, pp. 719-736, 2005.
[36] K. Wang, S. Zhou, and J. Han, "Profit Mining: From Patterns to Actions," Proc. Int'l Conf. Extending Database Technology (EBDT), 2002.
[37] G. Williams and Z. Huang, "Mining the Knowledge Mine: The Hot Spots Methodology for Mining Large Real World Databases," Lecture Notes in Artificial Intelligence, pp. 340-348, Springer, 1997.
[38] Q. Yang, J. Yin, C. Ling, and R. Pan, "Extracting Actionable Knowledge from Decision Trees," IEEE Trans. Knowledge and Data Eng., vol. 19, no. 1, pp. 43-56, Jan. 2007.
[39] H. Zhang, Y. Zhao, L. Cao, C. Zhang, and H. Bohlscheid, "Customer Activity Sequence Classification for Debt Prevention in Social Security," to be published in J. Computer Science and Technology.
[40] Post-Mining of Association Rules: Techniques for Effective Knowledge Extraction, Y. Zhao, C. Zhang, and L. Cao, eds. IGI Press, 2008.
[41] Y. Zhao, H. Zhang, L. Cao, C. Zhang, and H. Bohlscheid, "Combined Pattern Mining: From Learned Rules to Actionable Knowledge," Proc. Australasian Joint Conf. Artificial Intelligence (AI '08), pp. 393-403, 2008.
[42] Web Intelligence, N. Zhong, J. Liu, and Y.Y. Yao, eds. Springer, 2003.
22 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool