2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06) Protection Techniques from Information Extraction Hong Kong, China December 18-December 22 ISBN: 0-7695-2747-7
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2006.138
Information extraction technologies meet the market need for automatic tools for extracting semi-structured information from web pages. However, pages may change over time due to different reasons, ranging from restyling pages to on-purpose modifications brought about into pages in order to puzzle Web wrappers. In this paper we deal with this latter scenario, by studying the issue of on-purpose wrapper spoiling and its relationship to wrapping. We present an architecture and a tool implementing a wrapper spoiling system, and discuss some practical spoiling techniques which are also experimentally tested.
Citation:
Gianluigi Greco, Giovambattista Ianni, Vincenzino Lio, Luigi Palopoli, "Protection Techniques from Information Extraction," wi, pp.1029-1033, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||