loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2008 Seventh International Conference on Machine Learning and Applications
An Application of Latent Dirichlet Allocation to Analyzing Software Evolution
December 11-December 13
ISBN: 978-0-7695-3495-4
We develop and apply unsupervised statistical topic models, in particular Latent Dirichlet Allocation, to identify functional components of source code and study their evolution over multiple project versions. We present results for two large, open source Java projects, Eclipse and Argo UML, which are well-known and well-studied within the software mining community. Our results demonstrate the effectiveness of probabilistic topic models in automatically summarizing the temporal dynamics of software concerns, with direct application to project management and program understanding.
Index Terms:
topic models, software evolution, software mining, latent dirichlet allocation
Citation:
Erik Linstead, Cristina Lopes, Pierre Baldi, "An Application of Latent Dirichlet Allocation to Analyzing Software Evolution," icmla, pp.813-818, 2008 Seventh International Conference on Machine Learning and Applications, 2008
Usage of this product signifies your acceptance of the Terms of Use.