This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
4th International Conference on Parallel and Distributed Information Systems (PDIS '96)
Querying the World Wide Web
December 18-December 20
ISBN: 0-8186-7475-X
A.O. Mendelzon, Dept. of Comput. Sci., Toronto Univ., Ont., Canada
G.A. Mihaila, Dept. of Comput. Sci., Toronto Univ., Ont., Canada
T. Milo, Dept. of Comput. Sci., Toronto Univ., Ont., Canada
Abstract: The World Wide Web is a large, heterogeneous, distributed collection of documents connected by hypertext links. The most common technology currently used for searching the Web depends on sending information retrieval requests to "index servers". One problem with this is that these queries cannot exploit the structure and topology of the document network. The authors propose a query language, WebSQL, that takes advantage of multiple index servers without requiring users to know about them, and that integrates textual retrieval with structure and topology-based queries. They give a formal semantics for WebSQL using a calculus based on a novel "virtual graph" model of a document network. They propose a new theory of query cost based on the idea of "query locality," that is, how much of the network must be visited to answer a particular query. Finally, they describe a prototype implementation of WebSQL written in Java.
Index Terms:
hypermedia; World Wide Web querying; large heterogeneous distributed document collection; hypertext links; Web searching; document network; WebSQL query language; multiple index servers; textual retrieval; topology-based queries; formal semantics; calculus; virtual graph model; query cost; query locality; Java; information retrieval requests
Citation:
A.O. Mendelzon, G.A. Mihaila, T. Milo, "Querying the World Wide Web," pdis, pp.0080, 4th International Conference on Parallel and Distributed Information Systems (PDIS '96), 1996
Usage of this product signifies your acceptance of the Terms of Use.