|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Exemplar: A Source Code Search Engine for Finding Highly Relevant Applications
Sept.-Oct. 2012 (vol. 38 no. 5)
pp. 1069-1087
| ASCII Text | x | ||
| Collin McMillan, Mark Grechanik, Denys Poshyvanyk, Chen Fu, Qing Xie, "Exemplar: A Source Code Search Engine for Finding Highly Relevant Applications," IEEE Transactions on Software Engineering, vol. 38, no. 5, pp. 1069-1087, Sept.-Oct., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TSE.2011.84, author = {Collin McMillan and Mark Grechanik and Denys Poshyvanyk and Chen Fu and Qing Xie}, title = {Exemplar: A Source Code Search Engine for Finding Highly Relevant Applications}, journal ={IEEE Transactions on Software Engineering}, volume = {38}, number = {5}, issn = {0098-5589}, year = {2012}, pages = {1069-1087}, doi = {http://doi.ieeecomputersociety.org/10.1109/TSE.2011.84}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Software Engineering TI - Exemplar: A Source Code Search Engine for Finding Highly Relevant Applications IS - 5 SN - 0098-5589 SP1069 EP1087 EPD - 1069-1087 A1 - Collin McMillan, A1 - Mark Grechanik, A1 - Denys Poshyvanyk, A1 - Chen Fu, A1 - Qing Xie, PY - 2012 KW - Search engines KW - Engines KW - Software KW - Java KW - Cryptography KW - Vocabulary KW - Data mining KW - software reuse KW - Source code search engines KW - information retrieval KW - concept location KW - open source software KW - mining software repositories VL - 38 JA - IEEE Transactions on Software Engineering ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TSE.2011.84
A fundamental problem of finding software applications that are highly relevant to development tasks is the mismatch between the high-level intent reflected in the descriptions of these tasks and low-level implementation details of applications. To reduce this mismatch we created an approach called EXEcutable exaMPLes ARchive (Exemplar) for finding highly relevant software projects from large archives of applications. After a programmer enters a natural-language query that contains high-level concepts (e.g., MIME, datasets), Exemplar retrieves applications that implement these concepts. Exemplar ranks applications in three ways. First, we consider the descriptions of applications. Second, we examine the Application Programming Interface (API) calls used by applications. Third, we analyze the dataflow among those API calls. We performed two case studies (with professional and student developers) to evaluate how these three rankings contribute to the quality of the search results from Exemplar. The results of our studies show that the combined ranking of application descriptions and API documents yields the most-relevant search results. We released Exemplar and our case study data to the public.
Index Terms:
Search engines,Engines,Software,Java,Cryptography,Vocabulary,Data mining,software reuse,Source code search engines,information retrieval,concept location,open source software,mining software repositories
Citation:
Collin McMillan, Mark Grechanik, Denys Poshyvanyk, Chen Fu, Qing Xie, "Exemplar: A Source Code Search Engine for Finding Highly Relevant Applications," IEEE Transactions on Software Engineering, vol. 38, no. 5, pp. 1069-1087, Sept.-Oct. 2012, doi:10.1109/TSE.2011.84
Usage of this product signifies your acceptance of the Terms of Use.

