Efficient Semantic-Based Content Search in P2P Network
July 2004 (vol. 16 no. 7)
pp. 813-826

Abstract—Most existing Peer-to-Peer (P2P) systems support only title-based searches and are limited in functionality when compared to today's search engines. In this paper, we present the design of a distributed P2P information sharing system that supports semantic-based content searches of relevant documents. First, we propose a general and extensible framework for searching similar documents in P2P network. The framework is based on the novel concept of Hierarchial Summary Structure. Second, based on the framework, we develop our efficient document searching system by effectively summarizing and maintaining all documents within the network with different granularity. Finally, an experimental study is conducted on a real P2P prototype, and a large-scale network is further simulated. The results show the effectiveness, efficiency, and scalability of the proposed system.

Index Terms:
Content-based, similarity search, peer-to-peer, hierarchical summary, indexing.
Heng Tao Shen, Yanfeng Shu, Bei Yu, "Efficient Semantic-Based Content Search in P2P Network," IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 7, pp. 813-826, July 2004, doi:10.1109/TKDE.2004.1318564
