|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)
ITPilot: A Toolkit for Industrial-Strength Web Data Extraction
Compi?gne University of Technology, France
September 19-September 22
ISBN: 0-7695-2415-X
| ASCII Text | x | ||
| Alberto Pan, Juan Raposo, Manuel ?lvarez, Paula Montoto, Jos? Losada, Justo Hidalgo, "ITPilot: A Toolkit for Industrial-Strength Web Data Extraction," Web Intelligence, IEEE / WIC / ACM International Conference on, pp. 798-801, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005. | |||
| BibTex | x | ||
| @article{ 10.1109/WI.2005.85, author = {Alberto Pan and Juan Raposo and Manuel ?lvarez and Paula Montoto and Jos? Losada and Justo Hidalgo}, title = {ITPilot: A Toolkit for Industrial-Strength Web Data Extraction}, journal ={Web Intelligence, IEEE / WIC / ACM International Conference on}, volume = {0}, year = {2005}, isbn = {0-7695-2415-X}, pages = {798-801}, doi = {http://doi.ieeecomputersociety.org/10.1109/WI.2005.85}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Web Intelligence, IEEE / WIC / ACM International Conference on TI - ITPilot: A Toolkit for Industrial-Strength Web Data Extraction SN - 0-7695-2415-X SP798 EP801 A1 - Alberto Pan, A1 - Juan Raposo, A1 - Manuel ?lvarez, A1 - Paula Montoto, A1 - Jos? Losada, A1 - Justo Hidalgo, PY - 2005 KW - null VL - 0 JA - Web Intelligence, IEEE / WIC / ACM International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2005.85
In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today?s Web sources are "human-readable" but not "machine-readable", these systems must address a number of difficult challenges, such as dealing with complex navigation sequences, extracting data from HTML pages and reacting to source changes. Denodo Corporation has developed ITPilot, an industrial-strength solution that allows complex "wrappers" for Web sources to be graphically generated and automatically maintained. This paper presents the architecture and the basic ideas "behind the scenes" in ITPilot.
Citation:
Alberto Pan, Juan Raposo, Manuel ?lvarez, Paula Montoto, Jos? Losada, Justo Hidalgo, "ITPilot: A Toolkit for Industrial-Strength Web Data Extraction," wi, pp.798-801, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.
