loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
29th Annual International Computer Software and Applications Conference (COMPSAC'05) Volume 1
A Web Data Extraction Description Language and Its Implementation
Edinburgh, Scotland
July 26-July 28
ISBN: 0-7695-2413-3
I-Chen Wu, National Chiao Tung University
Jui-Yuan Su, National Chiao Tung University
Loon-Been Chen, National Chiao Tung University

A data extraction model, named the browser-oriented data extraction (BODE) model, was proposed in [14] to extract web contents with script functions. In this model, the system built on top of browsers accesses pages by simulating users? operations on browsers.

Based on this model, this paper defines a scripting language, named the BODED (Browser-Oriented Data Extraction Description) language, which instructs the system how to do data extraction. This paper proposes a technique, called indirect browser replication to implement a BODE system, and also optimize the performance of this technique.

Citation:
I-Chen Wu, Jui-Yuan Su, Loon-Been Chen, "A Web Data Extraction Description Language and Its Implementation," compsac, vol. 1, pp.293-298, 29th Annual International Computer Software and Applications Conference (COMPSAC'05) Volume 1, 2005
Usage of this product signifies your acceptance of the Terms of Use.