2008 Seventh International Conference on Machine Learning and Applications An Application of Latent Dirichlet Allocation to Analyzing Software Evolution December 11-December 13 ISBN: 978-0-7695-3495-4
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICMLA.2008.47
We develop and apply unsupervised statistical topic models, in particular Latent Dirichlet Allocation, to identify functional components of source code and study their evolution over multiple project versions. We present results for two large, open source Java projects, Eclipse and Argo UML, which are well-known and well-studied within the software mining community. Our results demonstrate the effectiveness of probabilistic topic models in automatically summarizing the temporal dynamics of software concerns, with direct application to project management and program understanding.
Index Terms:
topic models, software evolution, software mining, latent dirichlet allocation
Citation:
Erik Linstead, Cristina Lopes, Pierre Baldi, "An Application of Latent Dirichlet Allocation to Analyzing Software Evolution," icmla, pp.813-818, 2008 Seventh International Conference on Machine Learning and Applications, 2008 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||