|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 WRI World Congress on Computer Science and Information Engineering
Web Data Extraction Based on Label Library
Los Angeles, California USA
March 31-April 02
ISBN: 978-0-7695-3507-4
| ASCII Text | x | ||
| Shoubiao Tan, Jin Fan, Yuan Jiang, "Web Data Extraction Based on Label Library," Computer Science and Information Engineering, World Congress on, vol. 5, pp. 134-138, 2009 WRI World Congress on Computer Science and Information Engineering, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/CSIE.2009.595, author = {Shoubiao Tan and Jin Fan and Yuan Jiang}, title = {Web Data Extraction Based on Label Library}, journal ={Computer Science and Information Engineering, World Congress on}, volume = {5}, year = {2009}, isbn = {978-0-7695-3507-4}, pages = {134-138}, doi = {http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.595}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Computer Science and Information Engineering, World Congress on TI - Web Data Extraction Based on Label Library SN - 978-0-7695-3507-4 SP134 EP138 A1 - Shoubiao Tan, A1 - Jin Fan, A1 - Yuan Jiang, PY - 2009 KW - Web information extraction KW - label library KW - data intensive Web pages VL - 5 JA - Computer Science and Information Engineering, World Congress on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.595
A Web data Extraction technique based on label library is proposed for extracting information from data intensive Web pages in this paper. It eliminates conception ambiguity of the contents of Web pages with the label library, mines data regions by using MDR repeated patterns discovery algorithm, recognizes their structure and extracts data from them through a novel hierarchic pattern recognition and data extraction algorithm. Experiments showed it has perfect effect.
Index Terms:
Web information extraction, label library, data intensive Web pages
Citation:
Shoubiao Tan, Jin Fan, Yuan Jiang, "Web Data Extraction Based on Label Library," csie, vol. 5, pp.134-138, 2009 WRI World Congress on Computer Science and Information Engineering, 2009
Usage of this product signifies your acceptance of the Terms of Use.
