Issue No. 06 - June (2007 vol. 18)
Demetrios Zeinalipour-Yazti , IEEE
Vana Kalogeraki , IEEE
Dimitrios Gunopulos , IEEE
<p><b>Abstract</b>—The emerging Peer-to-Peer (P2P) model has become a very powerful and attractive paradigm for developing Internet-scale systems for sharing resources, including files and documents. The distributed nature of these systems, where nodes are typically located across different networks and domains, inherently hinders the efficient retrieval of information. In this paper, we consider the effects of topologically aware overlay construction techniques on efficient P2P keyword search algorithms. We present the Peer Fusion (<it>pFusion</it>) architecture that aims to efficiently integrate heterogeneous information that is geographically scattered on peers of different networks. Our approach builds on work in unstructured P2P systems and uses only local knowledge. Our empirical results, using the pFusion middleware architecture and data sets from Akamai's Internet mapping infrastructure (AKAMAI), the <it>Active Measurement Project</it> (NLANR), and the Text REtrieval Conference (TREC) show that the architecture we propose is both efficient and practical.</p>
Information retrieval, peer-to-peer, overlay construction algorithms.
Demetrios Zeinalipour-Yazti, Vana Kalogeraki, Dimitrios Gunopulos, "pFusion: A P2P Architecture for Internet-Scale Content-Based Search and Retrieval", IEEE Transactions on Parallel & Distributed Systems, vol. 18, no. , pp. 804-817, June 2007, doi:10.1109/TPDS.2007.1060