2004 IEEE International Conference on e-Technology, e-Commerce and e-Service (EEE'04)
Data Extraction and Annotation for Dynamic Web Pages
Taipei, Taiwan
March 28-March 31
ISBN: 0-7695-2073-1
Many Web sites contain large sets of pages generated dynamically using a common template. The structured data extracted from these pages with semantic annotation are valuable for information system. In this paper, we proposed a system, ADeaD, to automatically extract data values from these Web pages and annotate the data schema. Experimental evaluation on a lot of real Web page collections indicates our algorithm correctly extracted data and annotated the data schema.
Citation:
Hui Song, Suraj Giri, Fanyuan Ma, "Data Extraction and Annotation for Dynamic Web Pages," eee, pp.499-502, 2004 IEEE International Conference on e-Technology, e-Commerce and e-Service (EEE'04), 2004