loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
29th International Conference on Software Engineering (ICSE'07)
Detection of Duplicate Defect Reports Using Natural Language Processing
Minneapolis, Minnesota
May 20-May 26
ISBN: 0-7695-2828-7
Per Runeson, Lund University, Sweden
Magnus Alexandersson, Lund University, Sweden
Oskar Nyholm, Lund University, Sweden
Defect reports are generated from various testing and development activities in software engineering. Sometimes two reports are submitted that describe the same problem, leading to duplicate reports. These reports are mostly written in structured natural language, and as such, it is hard to compare two reports for similarity with formal methods. In order to identify duplicates, we investigate using Natural Language Processing (NLP) techniques to support the identification. A prototype tool is developed and evaluated in a case study analyzing defect reports at Sony Ericsson Mobile Communications. The evaluation shows that about 2/3 of the duplicates can possibly be found using the NLP techniques. Different variants of the techniques provide only minor result differences, indicating a robust technology. User testing shows that the overall attitude towards the technique is positive and that it has a growth potential.
Citation:
Per Runeson, Magnus Alexandersson, Oskar Nyholm, "Detection of Duplicate Defect Reports Using Natural Language Processing," icse, pp.499-510, 29th International Conference on Software Engineering (ICSE'07), 2007
Usage of this product signifies your acceptance of the Terms of Use.