This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Database and Expert Systems Applications, 15th International Workshop on (DEXA'04)
Hierarchies in HTML Documents: Linking Text to Concepts
Zaragoza, Spain
August 30-September 03
ISBN: 0-7695-2195-9
Radek Burget, Brno University of Technology, Czech Republic
For the successful setting of the Semantic Web, it is necessary to provide tools for linking the large amounts of data that are currently available in HTML documents to the Semantic Web ontologies. Due to the enormous variability of the HTML code, it is very limiting to define direct bindings between patterns of the HTML code and the concepts. We propose an approach based on modeling the visual part of the rendered document and describing the key characteristics of the data presentation in a general way. As a next step, we propose the way for using this model for locating the instances of the concepts in the document using the approximate tree matching algorithms and regular expressions.
Citation:
Radek Burget, "Hierarchies in HTML Documents: Linking Text to Concepts," dexa, pp.186-190, Database and Expert Systems Applications, 15th International Workshop on (DEXA'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.