2015 29th Brazilian Symposium on Software Engineering (SBES) (2015)
Belo Horizonte-MG, Brazil
Sept. 21, 2015 to Sept. 26, 2015
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SBES.2015.21
Change propagation occurs when a change in an artifact leads to changes in other artifacts. Previous research has used frequency of past changes between artifacts and different types of artifacts coupling to build prediction models of change propagation. To improve the accuracy of the prediction, we explored the combination of different data from software development repository, such as change requests, communication data, and artifacts modifications. This information can capture different dimensions of software development, what can lead to improvements on the accuracy of the models. We conducted an empirical study in four open source projects, namely Cassandra, Camel, Hadoop, and Lucene. Classifiers were constructed for each pair of artifacts that change together to predict if the change propagation between two files occurs in a certain change request. The models obtained values of area under the curve (AUC) of 0.849 on average. Furthermore, the sensitivity (recall) obtained is almost 4 times higher (57.06% vs. 15.70%) when compared our models to a baseline model built using association rules. With a reduced number of false positives, the models could be used in practice to help developers during software evolution.
Software, Context modeling, Accuracy, Software engineering, Couplings, Predictive models, Sensitivity
I. S. Wiese, R. Re, I. Steinmacher, R. T. Kuroda, G. A. Oliva and M. A. Gerosa, "Predicting Change Propagation from Repository Information," 2015 29th Brazilian Symposium on Software Engineering (SBES), Belo Horizonte-MG, Brazil, 2015, pp. 100-109.