This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Opening Doors to Sharing Social Media Data
January/February 2012 (vol. 27 no. 1)
pp. 47-51
Fred Morstatter, Arizona State University
Huan Liu, Arizona State University
Daniel Zeng, Chinese Academy of Sciences and University of Arizona

Research data sharing becomes increasingly difficult in the context of social media. Increasing restrictions from social media sites are creating an environment where data cannot be freely shared and as a result scientific claims cannot be verified. In this work, we present a novel approach to data sharing that does not require explicitly publishing a dataset. We create a framework where researchers systematically share the parameters they used to crawl the dataset along with the code used to collect the data, allowing the reader to re-assemble the dataset at a later time. While this approach is by no means a silver bullet, we seek to start a conversation for researchers to implement approaches to data sharing that can be embraced by the research community.

1. K. Coffman and A. Odlyzko, "The Size and Growth Rate of the Internet," First Monday, vol. 3, no. 10, 1998; http://firstmonday.org/htbin/cgiwrap/bin/ ojs/index.php/fm/article/view/620541.
2. J. Ginsberg et al., "Detecting Influenza Epidemics Using Search Engine Query Data," Nature, vol. 457, no. 7232, 2009, pp. 1012–1014.
3. F. Leisch, "Sweave: Dynamic Generation of Statistical Reports Using Literate Data Analysis," Proc. Computational Statistics (Compstat 2002), W. Härdle and B. Rönz, eds., Physica-Verlag, 2002.
4. R. Peng, "Reproducible Research in Computational Science," Science, vol. 334, no. 6060, 2011, pp. 1226–1227.
5. D. Zeng et al., "Social Media Analytics and Intelligence," IEEE Intelligent Systems, vol. 25, no. 6, 2010, pp. 13–16.
6. W.W. Zachary, "An Information Flow Model for Conflict and Fission in Small Groups," J. Anthropological Research, vol. 33, no. 4, 1977, pp. 452–473.
7. J. Leskovec, J Kleinberg, and C. Faloutsos, "Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations," (KDD 05), ACM, 2005, pp. 177–187.
8. D. Heath, "Pete Warden vs. Facebook: A Case of Too Much Data Access," ITWire,7 Apr. 2010; www.itwire.com/it-policy-news/regulation 38101-pete-warden-vs-facebook-a-case-of-too-much-data-access .

Index Terms:
Data Sharing, Social Media, Data Distribution Policy, Reproducibility, Social Computing
Citation:
Fred Morstatter, Huan Liu, Daniel Zeng, "Opening Doors to Sharing Social Media Data," IEEE Intelligent Systems, vol. 27, no. 1, pp. 47-51, Jan.-Feb. 2012, doi:10.1109/MIS.2012.19
Usage of this product signifies your acceptance of the Terms of Use.