The Community for Technology Leaders
2015 IEEE 31st International Conference on Data Engineering (ICDE) (2015)
Seoul, South Korea
April 13, 2015 to April 17, 2015
ISBN: 978-1-4799-7964-6
pp: 1408-1411
Feiran Huang , DEKE and School of Information, Renmin University of China, China
Jia Li , DEKE and School of Information, Renmin University of China, China
Jiaheng Lu , DEKE and School of Information, Renmin University of China, China
Tok Wang Ling , School of Computing, National University of Singapore, Singapore
Zhaoan Dong , DEKE and School of Information, Renmin University of China, China
ABSTRACT
In the world of academia, research documents enable the sharing and dissemination of scientific discoveries. During these “big data” times, academic search engines are widely used to find the relevant research documents. Considering the domain of computer science, a researcher often inputs a query with a specific goal to find an algorithm or a theorem. However, to this date, the return result of most search engines is just as a list of related papers. Users have to browse the results, download the interesting papers and look for the desired information, which is obviously laborious and inefficient. In this paper, we present a novel academic search system, called PandaSearch, that returns the results with a fine-grained interface, where the results are well organized by different categories, such as definitions, theorems, lemmas, algorithms and figures. The key technical challenges in our system include the automatic identification and extraction of different parts in a research document, the discovery of the main topic phrases for a definition or a theorem, and the recommendation of related definitions or figures to elegantly satisfy the search intention of users. Based on this, we have built a user friendly search interface for users to conveniently explore the documents, and find the relevant information.
INDEX TERMS
Context, Search engines, Portable document format, Detectors, XML, Google
CITATION
Feiran Huang, Jia Li, Jiaheng Lu, Tok Wang Ling, Zhaoan Dong, "PandaSearch: A fine-grained academic search engine for research documents", 2015 IEEE 31st International Conference on Data Engineering (ICDE), vol. 00, no. , pp. 1408-1411, 2015, doi:10.1109/ICDE.2015.7113388
97 ms
(Ver 3.3 (11022016))