This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Annotating Search Results from Web Databases
March 2013 (vol. 25 no. 3)
pp. 514-527
Yiyao Lu, Binghamton University, Binghamton
Hai He, Morningstar Inc., Chicago
Hongkun Zhao, Bloomberg L.P., Princeton
Weiyi Meng, Binghamton University, Binghamton
Clement Yu, University of Illinois at Chicago, Chicago
An increasing number of databases have become web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded into the result pages dynamically for human browsing. For the encoded data units to be machine processable, which is essential for many applications such as deep web data collection and Internet comparison shopping, they need to be extracted out and assigned meaningful labels. In this paper, we present an automatic annotation approach that first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, for each group we annotate it from different aspects and aggregate the different annotations to predict a final annotation label for it. An annotation wrapper for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Our experiments indicate that the proposed approach is highly effective.
Index Terms:
HTML,Semantics,Database systems,Data mining,Ontologies,Clustering algorithms,Information retrieval,wrapper generation,Data alignment,data annotation,web database
Citation:
Yiyao Lu, Hai He, Hongkun Zhao, Weiyi Meng, Clement Yu, "Annotating Search Results from Web Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 3, pp. 514-527, March 2013, doi:10.1109/TKDE.2011.175
Usage of this product signifies your acceptance of the Terms of Use.