29th Annual International Computer Software and Applications Conference (COMPSAC'05) Volume 1 A Web Data Extraction Description Language and Its Implementation Edinburgh, Scotland July 26-July 28 ISBN: 0-7695-2413-3
A data extraction model, named the browser-oriented data extraction (BODE) model, was proposed in [14] to extract web contents with script functions. In this model, the system built on top of browsers accesses pages by simulating users? operations on browsers. Based on this model, this paper defines a scripting language, named the BODED (Browser-Oriented Data Extraction Description) language, which instructs the system how to do data extraction. This paper proposes a technique, called indirect browser replication to implement a BODE system, and also optimize the performance of this technique.
Citation:
I-Chen Wu, Jui-Yuan Su, Loon-Been Chen, "A Web Data Extraction Description Language and Its Implementation," compsac, vol. 1, pp.293-298, 29th Annual International Computer Software and Applications Conference (COMPSAC'05) Volume 1, 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||