The Community for Technology Leaders
E-Commerce Technology for Dynamic E-Business, IEEE International Conference on (2004)
Beijing, China
Sept. 13, 2004 to Sept. 15, 2004
ISBN: 0-7695-2206-8
pp: 158-161
Angel Vi? , University of A Coru?a, Spain
Manuel ?lvarez , University of A Coru?a, Spain
Juan Raposo , University of A Coru?a, Spain
Alberto Pan , University of A Coru?a, Spain
ABSTRACT
The problem of data extraction from the Deep Web can be divided into two tasks: crawling the client-side and the server-side deep web. The objective of this paper is to define an architecture and a set of related techniques to access the information placed in the client-side deep web. This involves dealing with aspects such as JavaScript technology, non-standard session maintenance mechanisms, client redirections, pop-up menus, etc. Our work uses current browser APIs as building blocks and leverages them to implement novel crawling models and algorithms.
INDEX TERMS
null
CITATION
Angel Vi?, Manuel ?lvarez, Juan Raposo, Alberto Pan, "Client-Side Deep Web Data Extraction", E-Commerce Technology for Dynamic E-Business, IEEE International Conference on, vol. 00, no. , pp. 158-161, 2004, doi:10.1109/CEC-EAST.2004.30
90 ms
(Ver 3.3 (11022016))