Fifth IEEE International Conference on Peer-to-Peer Computing (P2P'05)
Finding Rare Data Objects in P2P File-Sharing Systems
Konstanz, Germany
August 31-September 02
ISBN: 0-7695-2376-5
Peer-to-peer file-sharing systems have hundreds of thousands of users sharing petabytes of data, however, their search functionality is limited. In general, query results contain many references to the same data object. These references are grouped, and the size of the group — the number of references it contains — is the typical ranking metric. Although group size is effective in finding popular data, it works poorly for rare, less popular data. Other ranking functions, such as precision and cosine similarity, are more appropriate in this case. We show the significant performance benefit in finding rare data using these ranking functions through extensive simulation.
Citation:
Wai Gen Yee, Dongmei Jia, Ophir Frieder, "Finding Rare Data Objects in P2P File-Sharing Systems," p2p, pp.181-190, Fifth IEEE International Conference on Peer-to-Peer Computing (P2P'05), 2005