The Community for Technology Leaders
Green Image
Issue No. 05 - September/October (2003 vol. 18)
ISSN: 1541-1672
pp: 34-40
Chia-Hui Chang , National Central University, Taiwan
Harianto Siek , Institute of Information Science, Taiwan
Jiann-Jyh Lu , Institute of Information Science, Taiwan
Chun-Nan Hsu , Institute of Information Science, Taiwan
Jen-Jie Chiou , Deepspot Intelligent Systems, Taiwan
ABSTRACT
<p>The DeepSpot Agent Toolbox exploits online Web data sources using reconfigurable Web wrapper agents. These agents are rapidly generated and executed on the basis of the XML-based Web Navigation Description Language and extraction rule generator IEPAD (information extraction based on pattern discovery). A WNDL script describes how to locate, extract, and combine data. By executing different WNDL scripts, users can automate all types of Web browsing sessions. They also describe IEPAD, a data extractor based on pattern discovery techniques. IEPAD lets software agents automatically discover the extraction rules to extract the contents of a structurally formatted Web page without the need to label a Web page to train a wrapper. With this programming-by-example authoring tool, users can generate a complete Web wrapper agent by browsing the target Web sites. Various applications demonstrate this approach's feasibility.</p>
INDEX TERMS
intelligent agents, Web wrappers, information integration, WNDL
CITATION

C. Chang, C. Hsu, H. Siek, J. Lu and J. Chiou, "Reconfigurable Web Wrapper Agents," in IEEE Intelligent Systems, vol. 18, no. , pp. 34-40, 2003.
doi:10.1109/MIS.2003.1234767
86 ms
(Ver 3.3 (11022016))