This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Boolean Query Mapping Across Heterogeneous Information Sources
August 1996 (vol. 8 no. 4)
pp. 515-521

Abstract—Searching over heterogeneous information sources is difficult because of the nonuniform query languages. Our approach is to allow a user to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final result. In this paper we introduce the architecture and associated algorithms for generating the supported subsuming queries and filters. We show that generated subsuming queries return a minimal number of documents; we also discuss how minimal cost filters can be obtained. We have implemented prototype versions of these algorithms and demonstrated them on heterogeneous Boolean systems.

[1] K.C.-C. Chang, H. Garcia-Molina, and A. Paepcke, "Boolean Query Mapping Across Heterogeneous Information Sources," Tech Report, SIDL-WP-1996-0044 (URL:http://www-diglib.stanford.edu). Dept. of Computer Science, Stanford Univ., 1996, in preparation.
[2] K.C.-C. Chang, H. Garcia-Molina, and A. Paepcke, "Predicate Rewriting for Translating Boolean Queries in a Heterogeneous Information System," Tech Report, SIDL-WP-1996-0028 (URL: http://www-diglib.stanford.edu). Dept. of Computer Science, Stanford Univ., 1996.
[3] W. Cross All-In-One Search Page, URL: http://www.albany.net/allinone/.
[4] M.I. Crystal and G.E. Jakobson, "FRED, A Front End for Databases," Online, vol. 6, no. 5, pp. 27-30, Sept. 1982.
[5] D. Dreilinger SavvySearch Home Page, URL: http://www.cs.colostate.edu/~dreiling/smartform.html.
[6] W.B. Frakes and R. Baeza-Yates, Information Retrieval Data Structures&Algorithmss.Englewood Cliffs, N.J.: Prentice Hall, 1992.
[7] D.T. Hawkins and L.R. Levy, "Front End Software for Online Database Searching Part1: Definitions, System Features, and Evaluation," Online, vol. 9, no. 6, pp. 30-37, Nov. 1985.
[8] ISO, ISO 8777:1993, Information and Documentation—Commands for Interactive Text Searching,Geneva, Switzerland: Int'l Organization for Standardization, 1993.
[9] J.B. Lovins, "Development of a Stemming Algorithm," Mechanical Translation and Computational Linguistics, vol. 11, no. 1-2, pp. 22-31, 1968.
[10] R.S. Marcus, "User Assistance in Bibliographic Retrieval Networks Through a Computer Intermediary," IEEE Trans. Systems, Man, and Cybernetics, vol. 12, no. 2, pp. 116-133, 1982.
[11] T.H. Martin, "A Feature Analysis of Interactive Retrieval Systems," Report SU-COMM-ICR-74-1. Stanford, Calif.: Inst. of Comm. Research, Stanford Univ., Sept. 1974.
[12] E.J. McCluskey, Logic Design Principles.Englewood Cliffs, N.J.: Prentice Hall, 1986.
[13] P.C. Mitchell, "A Note about the Proximity Operators in Information Retrieval," Proc. ACM SIGPLAN—SIGIR Interface Meeting, pp. 177-180, Nov. 1973.
[14] National Information Standards Organization, Z39.58-1992 Common Command Language for Online Interactive Information Retrieval.Bethesda, Md.: NISO Press, 1992.
[15] A.E. Negus, "Development of the Euronet-Diane Common Command Language," Proc. Third Int'l Online Information Meetting, pp. 95-98, 1979.
[16] M.F. Porter, "An Algorithm for Suffix Stripping," Program, vol. 14, no. 3, pp. 130-137, 1980.
[17] S.E. Preece and M.E. Williams, "Software for the Searcher's Workbench," Proc. 43rd Am. Soc. for Information Science Ann. Meeting, vol. 17, pp. 403-405, 1980.
[18] G. Salton, Automatic Text Processing. Addison-Wesley, 1988.
[19] E. Selberg and O. Etzioni, "Multi-Service Search and Comparison using the MetaCrawler," Proc. Fourth Int'l WWW Conf., URL:http://metacrawler.cs.washington.edu:8080/papers/www4/html/Overview.html.
[20] D.E. Toliver, "OL'SAM: An Intelligent Front-End for Bibliographic Information Retrieval," Information, Technology and Libraries, vol. 1, no. 4, pp. 317-326, 1982.
[21] J. Ullman, Principles of Database and Knowledge-Base Systems, vol. 1. Computer Science Press, 1988.
[22] M.E. Williams, "Transparent Information Systems through Gateways, Front Ends, Intermediaries and Interfaces," J. Am. Soc. for Information Science, vol. 37, no. 4, July 1986, pp. 204-214.
[23] S. Zinn, M. Sellers, and D. Bohli, "OCLC's Intelligent Gateway Service: Online Information Access for Libraries," Library Hi Tech, vol. 4, no. 3, pp. 25-29, 1986.

Index Terms:
Boolean queries, query translation, information retrieval, heterogeneity, digital libraries, query subsumption, filtering.
Citation:
Kevin Chen-Chuan Chang, Hector Garcia-Molina, Andreas Paepcke, "Boolean Query Mapping Across Heterogeneous Information Sources," IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 4, pp. 515-521, Aug. 1996, doi:10.1109/69.536244
Usage of this product signifies your acceptance of the Terms of Use.