This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Query Merging: Improving Query Subscription Processing in a Multicast Environment
January/February 2003 (vol. 15 no. 1)
pp. 174-191

Abstract—This paper introduces techniques for reducing data dissemination costs of query subscriptions in a multicast environment. The reduction is achieved by merging queries with overlapping, but not necessarily equal, answers. The paper formalizes the query-merging problem and introduces a general framework and cost model for evaluating merging. We prove that the problem is NP-hard and propose exhaustive algorithms and three heuristic algorithms: The Pair Merging Algorithm, the Directed Search Algorithm, and the Clustering Algorithm. We develop a simulator, which uses geographical queries as a representative example for evaluating the different heuristics and show that the performance of our heuristics is close to optimal.

[1] S. Acharya, R. Alonso, M. Franklin, and S. Zdonik, “Broadcast Disks: Data Management for Asymmetric Communication Environments,” Proc. ACM SIGMOD, pp. 199-210, May 1995.
[2] Airmedia,http:/www.airmedia.com, 1999.
[3] BackWeb,http:/www.backweb.com, 1999.
[4] F. Bauer and M. Brown, “A Reliable Multicast Transport Protocol for a Global Broadcast Service-Based Network,” Proc. Military Comm. Conf. (MILCOM '97), vol. 2, pp. 988-992, 1997.
[5] A. Bestavros, “Speculative Data Dissemination and Service to Reduce Server Load, Network Traffic and Service Time for Distributed Information Systems,” Proc. Int'l Conf. Data Eng. (ICDE '96), Mar. 1996.
[6] T.F. Bowen, G. Gopal, G. Herman, T. Hickey, K.C. Lee, W.H. Mansfield, J. Raitz, and A. Weinrib, “The Datacycle Architecture,” Comm. ACM, vol. 35, no. 12, Dec. 1992.
[7] A. Chan, “Transactional Publish/Subscribe: The Proactive Multicast of Database Changes,” ACM SIGMOD, June 1998.
[8] J.C.-I. Chuang and M.A. Sirbu, “Pricing Multicast Communication: A Cost Based Approach,” Technical Report 98354, Carnegie Mellon Univ., 1998.
[9] M. Corson and J. Macker, “Global Quality of Service-Based and Reliable Data Dissemination Via Asymmetric Direct Broadcast Satellite Channels,” Proc. 19th Pacific Telecomm. Conf., Jan. 1997.
[10] J.C. Culberson and R.A. Reckhow, “Covering Polygons is Hard (preliminary abstract),” Proc. 29th Ann. Symp. Foundations of Computer Science, Oct. 1988.
[11] S. Dao and B. Perry, “Efficient Dissemination of Information on the Internet,” Bull. of the Technical Committee on Data Eng., vol. 19, no. 3, pp. 48-54, Sept. 1996.
[12] S. Dao and B. Perry, “Information Dissemination in Hybrid Satellite/Terrestrial Networks,” Bull. of the Technical Committee on Data Eng., vol. 19, no. 3, pp. 12-18, Sept. 1996.
[13] R. Douglass, J. Mork, and B. Suresh, “Battlefield Awareness and Data Dissemination (BADD) for the Warfighter,” Int'l Soc. for Optical Eng., vol. 3080, pp. 18-24, 1997.
[14] J. Dukes-Schlossberg, Y. Lee, and N. Lehrer, “Lids: Intelligent Information Dissemination Server,” Proc. Military Comm. Conf. (MILCOM '97), vol. 2, pp. 635-639, 1997.
[15] M. Franklin and S. Zdonik, “Dissemination-Based Information Systems,” Bull. of the Technical Committee on Data Eng., vol. 19, no. 3, pp. 20-30, Sept. 1996.
[16] M. Franklin and S. Zdonik, “A Framework for Scalable Dissemination-Based Systems,” Object Oriented Programming Systems Languages and Applications (OOPSLA '97), ACM Oct. 1997.
[17] D. Gifford, “Polychannel Systems for Mass Digital Comm.,” Comm. Assoc. for Computing Machinery, Feb. 1990.
[18] H. Gould, “Research Bibliography of Two Special Number Sequences,” Math. Monongaliae, vol. 12, 1971.
[19] J. Graham-Cumming, “Hits and Miss-es: A Year Watching the Web,” Sixth Int'l World Wide Web Conf., 1997.
[20] Q. Hu, D. Lee, and W.-C. Lee, “Optimal Channel Allocation for Data Dissemination in Mobile Computing Environments,” Proc. 18th IEEE Int'l Conf. Distributed Computing Systems, pp. 480-487, 1998.
[21] T. Imielinski and B.R. Badrinath, “Wireless Computing: Challenges in Data Management,” Comm. ACM, vol. 37, no. 10, Oct. 1994.
[22] A. Keller and J. Basu, "A Predicate-Based Caching Scheme for Client-Server Database Architecture," The VLDB J., Jan. 1996, pp. 35-47.
[23] J.J. King, “Quist: A System for Semantic Query Optimization in Relational Databases,” Proc. Seventh Int'l Conf. Very Large Data Bases, pp 510-517, Sept. 1981.
[24] M. Lazaroff and P. Sage, “Any Information, Anywhere, Anytime for the Warfighter,” Int'l Soc. for Optical Eng., vol. 3080, pp. 35-42, 1997.
[25] W.-C. Lee, Q. Hu, and D.L. Lee, “Channel Allocation Methods for Data Dissemination in Mobile Computing Environments,” Proc. Sixth IEEE Int'l Symp. High Performance Distributed Computing, Aug. 1997.
[26] R. Lindell, J. Bannister, C. DeMatteis, M. O'Brien, J. Stepanek, M. Campbell, and F. Bauer, “Deploying Internet Services Over a Direct Broadcast Satellite Network: Challenges and Opportunities in the Global Broadcast Service,” Proc. Military Comm. Conf. (MILCOM '97), 1997.
[27] N. Lu, “Network Interface to Tactical Communications,” Proc. Military Comm. Conf. (MILCOM '97), 1997.
[28] M.V. Mannino, P. Chu, and T. Sager, “Statiscal Profile Estimation in Database Systems,” ACM Computing Surveys, vol. 20, no. 3, Sept. 1988.
[29] MARIMBA,Castanethttp:/www.marimba.com, 1999.
[30] B. Nadel, “A Kinder, Gentler Pointcast,” PC Magazine, vol. 15, no. 18, 1996.
[31] S. Ramakrishnan and V. Dayal, “The Pointcast Network,” ACM SIGMOD, June 1998.
[32] H. Salkin and J. Saha, “Set Covering: Algorithms, Results and Codes,” Bull. of the Operations Research Soc. of Am., vol. 20, no. 2, Nov. 1972.
[33] T. Stephenson, B. DeCleene, G. Speckert, and H. Voorhees, “BADD Phase II. DDS Information Management Architecture,” Int'l Soc. for Optical Eng., vol. 3080, pp. 49-58, 1997.
[34] M. Tan, M.D. Theys, H.J. Siegel, N.B. Beck, and M. Jurczyk, “A Mathematical Model, Heuristic, and Simulation Study for a Basic Data Staging Problem in a Heterogeneous Networking Environment,” Proc. Seventh Int'l Computing Workshop (HCW '98), 1998.

Index Terms:
Query processing, data dissemination, query merging, query subscriptions, query result multicast, geographical queries.
Citation:
Arturo Crespo, Orkut Buyukkokten, Hector Garcia-Molina, "Query Merging: Improving Query Subscription Processing in a Multicast Environment," IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 1, pp. 174-191, Jan.-Feb. 2003, doi:10.1109/TKDE.2003.1161589
Usage of this product signifies your acceptance of the Terms of Use.