10th International Workshop on Database & Expert Systems Applications
Modeling and Querying Structure and Contents of the Web
Florence, Italy
September 01-September 03
ISBN: 0-7695-0281-4
For accessing and processing the information provided on the Web, there is a need for extarction, restructuring, and integration of semistructured data from autonomous, heterogeneous sources. In this paper, we regard the Web and its contents as a unit, represented in an object-oriented data model: the Web structure (inter-document level), given by its hyperlinks, the parse-trees of Web pages (intra-document level), and their contents. The model is complemented by a rule-based object-oriented language which is extended by Web access capabilities and allows for and navigation in the unified model. We show the practicability of our approach by using the FLORID system.