|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Bing Liu, Robert Grossman, Yanhong Zhai, "Mining Web Pages for Data Records," IEEE Intelligent Systems, vol. 19, no. 6, pp. 49-55, November/December, 2004. | |||
| BibTex | x | ||
| @article{ 10.1109/MIS.2004.68, author = {Bing Liu and Robert Grossman and Yanhong Zhai}, title = {Mining Web Pages for Data Records}, journal ={IEEE Intelligent Systems}, volume = {19}, number = {6}, issn = {1541-1672}, year = {2004}, pages = {49-55}, doi = {http://doi.ieeecomputersociety.org/10.1109/MIS.2004.68}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - MGZN JO - IEEE Intelligent Systems TI - Mining Web Pages for Data Records IS - 6 SN - 1541-1672 SP49 EP55 EPD - 49-55 A1 - Bing Liu, A1 - Robert Grossman, A1 - Yanhong Zhai, PY - 2004 KW - data mining KW - Web mining KW - Web data extraction KW - Web data KW - databases VL - 19 JA - IEEE Intelligent Systems ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MIS.2004.68
Much information on the Web is contained in regularly structured objects, or data records. Data records often present their host pages' essential information, such as lists of products and services. Mining data records to extract this information can help you provide value-added services. Existing approaches to data extraction on the Web include supervised learning and automatic techniques. Supervised learning requires substantial human effort, and current automatic techniques provide poor results. To solve this problem, the MDR (mining data records) system exploits two key observations about the layout of data records in Web pages and employs a string-matching algorithm. Experiments show that this new automatic technique significantly outperforms existing methods. In addition, it mines both contiguous and noncontiguous data records.
Index Terms:
data mining, Web mining, Web data extraction, Web data, databases
Citation:
Bing Liu, Robert Grossman, Yanhong Zhai, "Mining Web Pages for Data Records," IEEE Intelligent Systems, vol. 19, no. 6, pp. 49-55, Nov.-Dec. 2004, doi:10.1109/MIS.2004.68
Usage of this product signifies your acceptance of the Terms of Use.

