11th International Symposium on Software Reliability Engineering (ISSRE'00)
Improving Tree-Based Models of Software Quality with Principal Components Analysis
San Jose, California
October 08-October 11
ISBN: 0-7695-0807-3
Software-quality classification models can predict which modules will be considered fault-prone, or not, based on software product metrics, process metrics, and execution metrics. Such predictions can be used to target improvement efforts to those modules that need it the most. Classification-tree modeling is a robust technique for building such software quality models. However, model structure maybe unstable and accuracy may suffer when predictors are highly correlated. This paper presents an empirical case study of four releases of a very large telecommunications system, which showed that the tree-based models could be improved by transforming the predictors with principal components analysis, so that transformed predictors are not correlated. The case study used the regression-tree algorithm in the S-Plus package and then applied our general decision rule to classify modules.
Index Terms:
software quality, software metrics, fault-prone modules, classification trees, S-Plus, principal components analysis
Citation:
Taghi M. Khoshgoftaar, Ruqun Shan, Edward B. Allen, "Improving Tree-Based Models of Software Quality with Principal Components Analysis," issre, pp.198, 11th International Symposium on Software Reliability Engineering (ISSRE'00), 2000