The Community for Technology Leaders
2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) (2017)
Klagenfurt, Austria
Feb. 20, 2017 to Feb. 24, 2017
ISBN: 978-1-5090-5502-9
pp: 382-386
Preetha Chatterjee , University of Delaware, Newark, USA
Manziba Akanda Nishi , Virginia Commonwealth University, Richmond, USA
Kostadin Damevski , Virginia Commonwealth University, Richmond, USA
Vinay Augustine , ABB Corporate Research, Raleigh, NC, USA
Lori Pollock , University of Delaware, Newark, USA
Nicholas A. Kraft , ABB Corporate Research, Raleigh, NC, USA
ABSTRACT
A large corpora of software-related documents is available on the Web, and these documents offer the unique opportunity to learn from what developers are saying or asking about the code snippets that they are discussing. For example, the natural language in a bug report provides information about what is not functioning properly in a particular code snippet. Previous research has mined information about code snippets from bug reports, emails, and Q&A forums. This paper describes an exploratory study into the kinds of information that is embedded in different software-related documents. The goal of the study is to gain insight into the potential value and difficulty of mining the natural language text associated with the code snippets found in a variety of software-related documents, including blog posts, API documentation, code reviews, and public chats.
INDEX TERMS
Natural languages, Computer bugs, Blogs, Documentation, Data mining, Software, Context
CITATION
Preetha Chatterjee, Manziba Akanda Nishi, Kostadin Damevski, Vinay Augustine, Lori Pollock, Nicholas A. Kraft, "What information about code snippets is available in different software-related documents? An exploratory study", 2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER), vol. 00, no. , pp. 382-386, 2017, doi:10.1109/SANER.2017.7884638
91 ms
(Ver 3.3 (11022016))