Issue No. 04 - April (2013 vol. 24)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2012.173
D. Carra , Comput. Sci. Dept., Univ. of Verona, Verona, Italy
M. Steiner , Bell Labs., Alcatel-Lucent, Murray Hill, NJ, USA
P. Michiardi , EURECOM, Sophia Antipolis, France
E. W. Biersack , EURECOM, Sophia Antipolis, France
W. Effelsberg , Dept. of Comput. Sci. IV, Univ. of Mannheim, Mannheim, Germany
T. En-Najjary , Orange Labs., RESA/PEAK, Orange, France
The endeavor of this work is to study the impact of content popularity in a large-scale Peer-to-Peer network, namely KAD. Based on an extensive measurement campaign, we pinpoint several deficiencies of KAD in handling popular content and provide a series of improvements to address such shortcomings. Our work reveals that keywords, which are associated with content, may become popular for two distinct reasons. First, we show that some keywords are intrinsically popular because they are common to many disparate contents: in such case we ameliorate KAD by introducing a simple mechanism that identifies stopwords. Then, we focus on keyword popularity that directly relates to popular content. We design and evaluate an adaptive load balancing mechanism that is backward compatible with the original implementation of KAD. Our scheme features the following properties: 1) it drives the process that selects the location of peers responsible to store references to objects, based on object popularity; 2) it solves problems related to saturated peers that would otherwise inflict a significant drop in the diversity of references to objects, and 3) if coupled with a load-aware content search procedure, it allows for a more fair and efficient usage of peer resources.
resource allocation, content management, content-based retrieval, peer-to-peer computing, Kademlia-based P2P routing system, KAD, large-scale peer-to-peer network, popular content handling, keyword popularity, adaptive load balancing mechanism, saturated peer resources, load-aware content search procedure, stopword identification, content popularity, Peer to peer computing, Publishing, Load management, Routing, Instruments, Search problems, Electronic mail, DHT, Peer-to-Peer, measurements
W. Effelsberg, E. W. Biersack, P. Michiardi, M. Steiner, D. Carra and T. En-Najjary, "Characterization and Management of Popular Content in KAD," in IEEE Transactions on Parallel & Distributed Systems, vol. 24, no. , pp. 662-671, 2013.