The Community for Technology Leaders
Green Image
Issue No. 01 - March (2017 vol. 3)
ISSN: 2332-7790
pp: 18-35
Feng Xia , School of Software, Dalian University of Technology, Dalian, China
Wei Wang , School of Software, Dalian University of Technology, Dalian, China
Teshome Megersa Bekele , School of Software, Dalian University of Technology, Dalian, China
Huan Liu , School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, Tempe, AZ
ABSTRACT
With the rapid growth of digital publishing, harvesting, managing, and analyzing scholarly information have become increasingly challenging. The term Big Scholarly Data is coined for the rapidly growing scholarly data, which contains information including millions of authors, papers, citations, figures, tables, as well as scholarly networks and digital libraries. Nowadays, various scholarly data can be easily accessed and powerful data analysis technologies are being developed, which enable us to look into science itself with a new perspective. In this paper, we examine the background and state of the art of big scholarly data. We first introduce the background of scholarly data management and relevant technologies. Second, we review data analysis methods, such as statistical analysis, social network analysis, and content analysis for dealing with big scholarly data. Finally, we look into representative research issues in this area, including scientific impact evaluation, academic recommendation, and expert finding. For each issue, the background, main challenges, and latest research are covered. These discussions aim to provide a comprehensive review of this emerging area. This survey paper concludes with a discussion of open issues and promising future directions.
INDEX TERMS
Data mining, Big data, Metadata, Libraries, Social network services, Crawlers, Data analysis
CITATION

F. Xia, W. Wang, T. M. Bekele and H. Liu, "Big Scholarly Data: A Survey," in IEEE Transactions on Big Data, vol. 3, no. 1, pp. 18-35, 2017.
doi:10.1109/TBDATA.2016.2641460
349 ms
(Ver 3.3 (11022016))