|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Tao Cheng, Hady W. Lauw, Stelios Paparizos, "Entity Synonyms for Structured Web Search," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1862-1875, Oct., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TKDE.2011.168, author = {Tao Cheng and Hady W. Lauw and Stelios Paparizos}, title = {Entity Synonyms for Structured Web Search}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {24}, number = {10}, issn = {1041-4347}, year = {2012}, pages = {1862-1875}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2011.168}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Knowledge and Data Engineering TI - Entity Synonyms for Structured Web Search IS - 10 SN - 1041-4347 SP1862 EP1875 EPD - 1862-1875 A1 - Tao Cheng, A1 - Hady W. Lauw, A1 - Stelios Paparizos, PY - 2012 KW - Motion pictures KW - Web search KW - Noise KW - Search engines KW - Earth Observing System KW - Digital cameras KW - Databases KW - query log. KW - Entity synonym KW - fuzzy matching KW - structured data KW - web query VL - 24 JA - IEEE Transactions on Knowledge and Data Engineering ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2011.168
Nowadays, there are many queries issued to search engines targeting at finding values from structured data (e.g., movie showtime of a specific location). In such scenarios, there is often a mismatch between the values of structured data (how content creators describe entities) and the web queries (how different users try to retrieve them). Therefore, recognizing the alternative ways people use to reference an entity, is crucial for structured web search. In this paper, we study the problem of automatic generation of entity synonyms over structured data toward closing the gap between users and structured data. We propose an offline, data-driven approach that mines query logs for instances where content creators and web users apply a variety of strings to refer to the same webpages. This way, given a set of strings that reference entities, we generate an expanded set of equivalent strings (entity synonyms) for each entity. Our framework consists of three modules: candidate generation, candidate selection, and noise cleaning. We further study the cause of the problem through the identification of different entity synonym classes. The proposed method is verified with experiments on real-life data sets showing that we can significantly increase the coverage of structured web queries with good precision.
Index Terms:
Motion pictures,Web search,Noise,Search engines,Earth Observing System,Digital cameras,Databases,query log.,Entity synonym,fuzzy matching,structured data,web query
Citation:
Tao Cheng, Hady W. Lauw, Stelios Paparizos, "Entity Synonyms for Structured Web Search," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1862-1875, Oct. 2012, doi:10.1109/TKDE.2011.168
Usage of this product signifies your acceptance of the Terms of Use.

