This Article 
 Bibliographic References 
 Add to: 
A Scalable Peer-to-Peer Architecture for Distributed Information Monitoring Applications
June 2005 (vol. 54 no. 6)
pp. 767-782
Ling Liu, IEEE Computer Society
We present PeerCQ, a decentralized architecture for Internet scale information monitoring using a network of heterogeneous peer nodes. PeerCQ uses Continual Queries (CQs) as its primitives to express information-monitoring requests. The PeerCQ development has three unique characteristics. First, we develop a systematic and serverless approach to large scale information monitoring, aiming at providing a fully distributed, highly scalable, and self-configurable architecture for scalable and reliable processing of a large number of CQs over a network of loosely coupled, heterogeneous, and possibly unreliable nodes (peers). Second, we introduce an effective service partitioning scheme at the P2P protocol layer to distribute the processing of CQs over a peer-to-peer information monitoring overlay network while maintaining a good balance between system utilization and load balance in the presence of peer joins, departures, and failures. A unique feature of our service partitioning scheme is its ability to incorporate strategies for handling hot spot monitoring requests and peer heterogeneity into the load balancing scheme in PeerCQ. Third, but not least, we develop a dynamic passive replication scheme to enable reliable processing of long-running information monitoring requests in an environment of inherently unreliable peers, including an analytical model to discuss its fault tolerance properties. We report a set of experiments demonstrating the feasibility and the effectiveness of the PeerCQ approach to large-scale peer-to-peer information monitoring.

[1] I. Clarke, O. Sandberg, B. Wiley, and T.W. Hong, “Freenet: A Distributed Anonymous Information Storage and Retrieval System,” Proc. ICSI Workshop Design Issues in Anonymity and Unobservability, 2000.
[2] F. Dabek, M.F. Kaashoek, D. Karger, R. Morris, and I. Stoica, “Wide-Area Cooperative Storage with CFS,” Proc. ACM Symp. Operating Systems Principles (SOSP), 2001.
[3] B. Gedik and L. Liu, “Building Reliable Peer-to-Peer Information Monitoring Service through Replication,” Technical Report GIT-CC-02-66, Georgia Inst. of Tech nology, 2002.
[4] B. Gedik and L. Liu, “PeerCQ: A Scalable and Self-Configurable Peer-to-Peer Information Monitoring System,” Technical Report GIT-CC-02-32, Georgia Inst. of Tech nology, 2002.
[5] B. Gedik and L. Liu, “PeerCQ: A Decentralized and Self-Configuring Peer-to-Peer Information Monitoring System,” Proc. Int'l Conf. Distributed Computing Systems, 2003.
[6] Gnutella, “The Gnutella Home Page,” http:/, 2002.
[7] D. Karger, E. Lehman, T. Leighton, M. Levine, D. Lewin, and R. Panigrahy, “Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web,” Proc. ACM Symp. Theory of Computing Author Index, 1997.
[8] KaZaa, “The Kazaa Home Page,” http:/, 2003.
[9] L. Liu, C. Pu, and W. Tang, “Continual Queries for Internet Scale Event-Driven Information Delivery,” IEEE Trans. Knowledge and Data Eng., vol. 11, no. 4, pp. 610-628, July/Aug. 1999.
[10] L. Liu, C. Pu, and W. Tang, “Detecting and Delivering Information Changes on the Web,” Proc. Int'l Conf. Information and Knowledge Management, 2000.
[11] L. Liu, W. Tang, D. Buttler, and C. Pu, “Information Monitoring on the Web: A Scalable Solution,” World Wide Web J., 2003.
[12] C.G. Plaxton, R. Rajaraman, and A.W. Richa, “Accessing Nearby Copies of Replicated Objects in a Distributed Environment,” Proc. ACM Symp. Parallel Algorithms and Architectures, 1997.
[13] A. Rao, K. Lakshminarayanan, S. Surana, R. Karp, and I. Stoica, “Load Balancing in Structured P2P Systems,” Proc. Int'l Workshop Peer-to-Peer Systems, 2003.
[14] S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker, “A Scalable Content-Addressable Network,” Proc. ACM SIGCOMM, 2001.
[15] G. Ricart and A.K. Agrawala, “An Optimal Algorithm for Mutual Exclusion in Computer Networks,” Comm. ACM, pp. 9-17, 1981.
[16] A. Rowstron and P. Druschel, “Pastry: Scalable, Decentralized Object Location and Routing for Largescale Peer-to-Peer Systems,” Proc. IFIP/ACM Int'l Conf. Distributed Systems Platforms, 2001.
[17] A. Rowstron, A. Kermarrec, M. Castro, and P. Druschel, “Scribe: The Design of a Large-Scale Event Notification Infrastructure,” Proc. Int'l Workshop Networked Group Comm., 2001.
[18] S. Saroiu, P.K. Gummadi, and S.D. Gribble, “A Measurement Study of Peer-to-Peer File Sharing Systems,” Technical Report UW-CSE-01-06-02, Univ. of Washington, 2001.
[19] I. Stoica, R. Morris, D. Karger, M. Kaashoek, and H. Balakrishnan, “Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications,” Proc. ACM SIGCOMM, 2001.
[20] B.Y. Zhao, J.D. Kubiatowicz, and A.D. Joseph, “Tapestry: An Infrastructure for Fault-Tolerant Wide-Area Location and Routing,” Technical Report UCB/CSD-01-1141, Univ. of California Berkeley, 2001.

Index Terms:
Distributed information monitoring, peer-to-peer networks, continual query systems.
Bugra Gedik, Ling Liu, "A Scalable Peer-to-Peer Architecture for Distributed Information Monitoring Applications," IEEE Transactions on Computers, vol. 54, no. 6, pp. 767-782, June 2005, doi:10.1109/TC.2005.87
Usage of this product signifies your acceptance of the Terms of Use.