First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007) Comparison of Outlier Detection Methods in Fault-proneness Models Madrid, Spain September 20-September 21 ISBN: 0-7695-2886-4
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ESEM.2007.83
In this paper, we experimentally evaluated the effect of outlier detection methods to improve the prediction performance of fault-proneness models. Detected outliers were removed from a fit dataset before building a model. In the experiment, we compared three outlier detection methods (Mahalanobis outlier analysis (MOA), local outlier factor method (LOFM) and rule based modeling (RBM)) each applied to three well-known fault-proneness models (linear discriminant analysis (LDA), logistic regression analysis (LRA) and classification tree (CT)). As a result, MOA and RBM improved F1-values of all models (0.04 at minimum, 0.17 at maximum and 0.10 at mean) while improvements by LOFM were relatively small (-0.01 at minimum, 0.04 at maximum and 0.01 at mean).
Citation:
Shinsuke Matsumoto, Yasutaka Kamei, Akito Monden, Ken-ichi Matsumoto, "Comparison of Outlier Detection Methods in Fault-proneness Models," esem, pp.461-463, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007), 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||