• Publication
  • 2010
  • Issue No. 6 - June
  • Abstract - Domain-Driven Classification Based on Multiple Criteria and Multiple Constraint-Level Programming for Intelligent Credit Scoring
 This Article 
 Bibliographic References 
 Add to: 
Domain-Driven Classification Based on Multiple Criteria and Multiple Constraint-Level Programming for Intelligent Credit Scoring
June 2010 (vol. 22 no. 6)
pp. 826-838
Jing He, Victoria University, Australia
Yanchun Zhang, Victoria University, Australia
Yong Shi, Chinese Academy of Sciences, China
Guangyan Huang, Victoria University, Australia
Extracting knowledge from the transaction records and the personal data of credit card holders has great profit potential for the banking industry. The challenge is to detect/predict bankrupts and to keep and recruit the profitable customers. However, grouping and targeting credit card customers by traditional data-driven mining often does not directly meet the needs of the banking industry, because data-driven mining automatically generates classification outputs that are imprecise, meaningless, and beyond users' control. In this paper, we provide a novel domain-driven classification method that takes advantage of multiple criteria and multiple constraint-level programming for intelligent credit scoring. The method involves credit scoring to produce a set of customers' scores that allows the classification results actionable and controllable by human interaction during the scoring process. Domain knowledge and experts' experience parameters are built into the criteria and constraint functions of mathematical programming and the human and machine conversation is employed to generate an efficient and precise solution. Experiments based on various data sets validated the effectiveness and efficiency of the proposed methods.

[1] L.C. Thomas, D.B. Edelman, and J.N. Crook, Credit Scoring and Its Applications. SIAM, 2002.
[2] L. Cao and C. Zhang, "Domain-Driven Actionable Knowledge Discovery in the Real World," Proc. PAKDD, pp. 821-830, 2006.
[3] L. Cao and C. Zhang, "Knowledge Actionability: Satisfying Technical and Business Interestingness," Int'l J. Business Intelligence and Data Mining, vol. 2, no. 4, pp. 496-514, 2007.
[4] J.R. Quilan, "Induction of Decision Trees," Machine Learning, vol. 1, pp. 81-106, 1986.
[5] R. Rymon, "An SE-Tree Based Characterization of the Induction Problem," Proc. Int'l Conf. Machine Learning, 1993.
[6] D. Martens, M. De Backer, R. Haesen, J. Vanthienen, M. Snoeck, and B. Baesens, "Classification with Ant Colony Optimization," IEEE Trans. Evolutionary Computation, vol. 11, no. 5, pp. 651-665, Oct. 2007.
[7] D. Martens, B.B. Baesens, and T. Van Gestel, "Decomposition Rule Extraction from Support Vector Machines by Active Learning," IEEE Trans. Knowledge and Data Eng., vol. 21, no. 2, pp. 178-191, Dec. 2008.
[8] D. Pedro and M. Pazzani, "On the Optimality of the Simple Bayesian Classifier under Zero-One Loss," Machine Learning, vol. 29, pp. 103-137, 1997.
[9] Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques, B.V. Dasarathy, ed. IEEE Computer Society Press, 1991.
[10] G. Guo, "CR Dyer, Learning from Examples in the Sample Case: Face Expression Recognition," IEEE Trans. Systems, Man, and Cybernetics, Part B, vol. 35, no. 3, pp. 477-488, June 2005.
[11] W.A. Chaovalitwon Se, Y.-J. Fan, and R.C. Sachdeo, "Support Feature Machine for Classification of Abnormal Brain Activity," Proc. ACM SIGMOD, pp. 113-122, 2007.
[12] B.P. Rachel, T. Shlomo, R. Alex, L. Anna, and K. Patrick, "Multiplex Assessment of Serum Biomarker Concentrations in Well-Appearing Children with Inflicted Traumatic Brain Injury," Pediatric Research, vol. 65, no. 1, pp. 97-102, 2009.
[13] S. Olafsson, X. Li, and S. Wu, "Operations Research and Data Mining," European J. Operational Research, vol. 187, no. 3, pp. 1429-1448, 2008.
[14] A. Benos and G. Papanastasopoulos, "Extending the Merton Model: A Hybrid Approach to Assessing Credit Quality," Math. and Computer Modelling, vol. 48, pp. 47-68, 2007.
[15] H. Guo and S.B. Gelfand, "Classification Trees with Neural Network Feature Extraction," IEEE Trans. Neural Networks, vol. 3, pp. 923-933, 1992.
[16] L. Yu, S. Wang, and K. Lai, "Credit Risk Assessment with a Multistage Neural Network Ensemble Learning Approach," Expert Systems with Applications, vol. 34, no. 2, pp. 1434-1444, 2008.
[17] T. Joachims, "Making Large-Scale SVM Learning Practical," Advances in Kernel Methods: Support Vector Learning, B. Schol Lkopf, C. Burges, and A. Smola, eds., MIT-Press, 1999.
[18] Y. Wang, S. Wang, and K.K. Lai, "A New Fuzzy Support Vector Machine to Evaluate Credit Risk," IEEE Trans. Fuzzy Systems, vol. 13, no. 6, pp. 820-831, Dec. 2005.
[19] N. Freed and F. Glover, "Simple but Powerful Goal Programming Models for Discriminant Problems," European J. Operational Research, vol. 7, pp. 44-60, 1981.
[20] J. Pacheco, S. Casado, and L. Nunez, "Use of VNS and TS in Classification: Variable Selection and Determination of the Linear Discrimination Function Coefficients," IMA J. Management Math., vol. 18, no. 2, pp. 191-206, 2007.
[21] J. Zhang, Y. Shi, and P. Zhang, "Several Multi-Criteria Programming Methods for Classification," Computers and Operations Research, vol. 36, no. 3, pp. 823-836, 2009.
[22] Y. Shi, Multiple Criteria and Multiple Constraint Levels Linear Programming: Concepts, Techniques and Applications. World Scientific, 2001.
[23] http://www.fico.com/en/Pagesdefault.aspx , 2010.
[24] http:/www.firstdata.com/, 2010.
[25] J. Han and M. Kamber, Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2003.
[26] J. He, X. Liu, Y. Shi, W. Xu, and N. Yan, "Classifications of Credit Cardholder Behavior by Using Fuzzy Linear Programming," Int'l J. Information Technology and Decision Making, vol. 3, no. 4, pp. 633-650, 2004.
[27] J. Quinlan See5.0., http://www.rulequest.comsee5-info.html, 2004.
[28] Y. Peng, G. Kou, Y. Shi, and Z. Chen, "A Multi-Criteria Convex Quadratic Programming Model for Credit Data Analysis," Decision Support Systems, vol. 44, pp. 1016-1030, 2008.
[29] N. Freed and F. Glover, "Evaluating Alternative Linear Programming Models to Solve the Two-Group Discriminant Analysis Formulations," Decision Sciences, vol. 17, pp. 151-162, 1986.
[30] N. Freed and F. Glover, "Resolving Certain Difficulties and Improving Classification Power of LP Discriminant Analysis Formulations," Decision Sciences, vol. 17, pp. 589-595, 1986.
[31] S.I. Gass, Linear Programming. McGraw-Hill, 1985.
[32] H.J. Zimmermann, Fuzzy Set Theory and Its Application. Kluwer Academic Publisher, 1991.
[33] A. Sengupta, T. Kumar Pal, and D. Chakraborty, "Interpretation of Inequality Constraints Involving Interval Coefficients and a Solution to Interval Linear Programming," Fuzzy Sets and Systems, vol. 119, pp. 129-138, 2001.
[34] H.J. Zimmermann, "Fuzzy Programming and Linear Programming with Several Objective," Fuzzy Sets and Systems, vol. 13, no. 1, pp. 45-55, 1978.
[35] F. Gyetvan and Y. Shi, "Weak Duality Theorem and Complementary Slackness Theorem for Linear Matrix Programming Problems," Operations Research Letters, vol. 11, pp. 249-252, 1992.
[36] C.Y. Chianglin, T. Lai, and P. Yu, "Linear Programming Models with Changeable Parameters-Theoretical Analysis on, Taking Loss at the Ordering Time and Making Profit at the Delivery Time," Int'l J. Information Technology & Decision Making, vol. 6, no. 4, pp. 577-598, 2007.
[37] http:/www.lindo.com/, 2010.
[38] P.M. Murphy and D.W. Aha, UCI Repository of Machine Learning Databases, www.ics.uci.edu/melearnMLRepository.html , 1992.
[39] Y. Shi, J. He, L. Wang, and W. Fan, "Computer-Based Algorithms for Multiple Criteria and Multiple Constraint Level Integer Linear Programming," Computers and Math. with Applications, vol. 49, nos. 5-6, pp. 903-921, 2005.
[40] D.R. Musicant, NDC: Normally Distributed Clustered Data Sets, www.cs.wise.edu/musicant/datandc/, 1998.
[41] T. Joachims, SVM-Light: Support Vector Machine, http:/svmlight.joachims.org/, 2004.
[42] http:/www.mathworks.com/, 2010.
[43] SAS, http:/www.sas.com, 2010.
[44] H.A. Taha, Operations Research: An Introduction, eighth ed., Prentice Hall, 2007.
[45] L. Seiford and P.L. Yu, "Potential Solutions of Linear Systems: The Multi-Criteria Multiple Constriant Level Program," J. Math. Analysis and Application, vol. 69, pp. 283-303, 1979.
[46] J. Philip, "Algorithms for the Vector Maximization Problem," Math. Programming, vol. 2, pp. 207-229, 1972.
[47] P.L. Yu and M. Zeleny, "The Set of All Nondominated Solutions in the Linear Cases and a Muticriteria Simplex Method," J. Math. Analysis and Applications, vol. 49, pp. 430-458, 1975.
[48] D.R. Musicant, NDC Description, http://www.cs.wisc.edu/dmi/svm/ndccontent.html , 2010.

Index Terms:
Credit scoring, domain-driven classification, mathematical programming, multiple criteria and multiple constraint-level programming, fuzzy programming, satisfying solution.
Jing He, Yanchun Zhang, Yong Shi, Guangyan Huang, "Domain-Driven Classification Based on Multiple Criteria and Multiple Constraint-Level Programming for Intelligent Credit Scoring," IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 6, pp. 826-838, June 2010, doi:10.1109/TKDE.2010.43
Usage of this product signifies your acceptance of the Terms of Use.