The Community for Technology Leaders
2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE) (2017)
Urbana, IL, USA
Oct. 30, 2017 to Nov. 3, 2017
ISBN: 978-1-5386-3976-4
pp: 123-134
Zeqi Lin , Key Laboratory of High Confidence Software Technologies, Ministry of Education, Beijing, China, 100871
Yanzhen Zou , Key Laboratory of High Confidence Software Technologies, Ministry of Education, Beijing, China, 100871
Junfeng Zhao , Key Laboratory of High Confidence Software Technologies, Ministry of Education, Beijing, China, 100871
Bing Xie , Key Laboratory of High Confidence Software Technologies, Ministry of Education, Beijing, China, 100871
ABSTRACT
A large software project usually has lots of various textual learning resources about its API, such as tutorials, mailing lists, user forums, etc. Text retrieval technology allows developers to search these API learning resources for related documents using free-text queries, but it suffers from the lexical gap between search queries and documents. In this paper, we propose a novel approach for improving the retrieval of API learning resources through leveraging software-specific conceptual knowledge in software source code. The basic idea behind this approach is that the semantic relatedness between queries and documents could be measured according to software-specific concepts involved in them, and software source code contains a large amount of software-specific conceptual knowledge. In detail, firstly we extract an API graph from software source code and use it as software-specific conceptual knowledge. Then we discover API entities involved in queries and documents, and infer semantic document relatedness through analyzing structural relationships between these API entities. We evaluate our approach in three popular open source software projects. Comparing to the state-of-the-art text retrieval approaches, our approach lead to at least 13.77% improvement with respect to mean average precision (MAP).
INDEX TERMS
Semantics, Knowledge based systems, Software engineering, Internet, Tutorials, Open source software
CITATION

Z. Lin, Y. Zou, J. Zhao and B. Xie, "Improving software text retrieval using conceptual knowledge in source code," 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE), Urbana, IL, USA, 2017, pp. 123-134.
doi:10.1109/ASE.2017.8115625
231 ms
(Ver 3.3 (11022016))