2011 IEEE 27th International Conference on Data Engineering (2011)

Hannover, Germany

Apr. 11, 2011 to Apr. 16, 2011

ISBN: 978-1-4244-8959-6

pp: 51-62

James Cheng , School of Computer Engineering, Nanyang Technological University, Singapore

Yiping Ke , Shenzhen Institutes of Advanced Technology, Chinese Academy of Science, China

Shumo Chu , School of Computer Engineering, Nanyang Technological University, Singapore

M. Tamer Ozsu , David R. Cheriton School of Computer Science, University of Waterloo, Ontario, Canada

ABSTRACT

The k-core of a graph is the largest subgraph in which every vertex is connected to at least k other vertices within the subgraph. Core decomposition finds the k-core of the graph for every possible k. Past studies have shown important applications of core decomposition such as in the study of the properties of large networks (e.g., sustainability, connectivity, centrality, etc.), for solving NP-hard problems efficiently in real networks (e.g., maximum clique finding, densest subgraph approximation, etc.), and for large-scale network fingerprinting and visualization. The k-core is a well accepted concept partly because there exists a simple and efficient algorithm for core decomposition, by recursively removing the lowest degree vertices and their incident edges. However, this algorithm requires random access to the graph and hence assumes the entire graph can be kept in main memory. Nevertheless, real-world networks such as online social networks have become exceedingly large in recent years and still keep growing at a steady rate. In this paper, we propose the first external-memory algorithm for core decomposition in massive graphs. When the memory is large enough to hold the graph, our algorithm achieves comparable performance as the in-memory algorithm. When the graph is too large to be kept in the memory, our algorithm requires only O(k<inf>max</inf>) scans of the graph, where k<inf>max</inf> is the largest core number of the graph. We demonstrate the efficiency of our algorithm on real networks with up to 52.9 million vertices and 1.65 billion edges.

INDEX TERMS

CITATION

M. T. Ozsu, S. Chu, Y. Ke and J. Cheng, "Efficient core decomposition in massive networks,"

*2011 IEEE 27th International Conference on Data Engineering(ICDE)*, Hannover, Germany, 2011, pp. 51-62.

doi:10.1109/ICDE.2011.5767911

CITATIONS

SEARCH