Publication 2014 Issue No. 1 - Jan. Abstract - Spatially Aware Term Selection for Geotagging
 This Article Share Bibliographic References Add to: Digg Furl Spurl Blink Simpy Google Del.icio.us Y!MyWeb Search Similar Articles Articles by Olivier Van Laere Articles by Jonathan Quinn Articles by Steven Schockaert Articles by Bart Dhoedt
Spatially Aware Term Selection for Geotagging
Jan. 2014 (vol. 26 no. 1)
pp. 221-234
 ASCII Text x Olivier Van Laere, Jonathan Quinn, Steven Schockaert, Bart Dhoedt, "Spatially Aware Term Selection for Geotagging," IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 221-234, Jan., 2014.
 BibTex x @article{ 10.1109/TKDE.2013.42,author = {Olivier Van Laere and Jonathan Quinn and Steven Schockaert and Bart Dhoedt},title = {Spatially Aware Term Selection for Geotagging},journal ={IEEE Transactions on Knowledge and Data Engineering},volume = {26},number = {1},issn = {1041-4347},year = {2014},pages = {221-234},doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.42},publisher = {IEEE Computer Society},address = {Los Alamitos, CA, USA},}
 RefWorks Procite/RefMan/Endnote x TY - JOURJO - IEEE Transactions on Knowledge and Data EngineeringTI - Spatially Aware Term Selection for GeotaggingIS - 1SN - 1041-4347SP221EP234EPD - 221-234A1 - Olivier Van Laere, A1 - Jonathan Quinn, A1 - Steven Schockaert, A1 - Bart Dhoedt, PY - 2014KW - StandardsKW - EncyclopediasKW - Electronic publishingKW - InternetKW - EstimationKW - ContextKW - feature extractionKW - Information search and retrievalKW - knowledge managementKW - artificial intelligenceKW - text miningKW - metadataKW - geographic information retrievalKW - classificationKW - semi-structured dataVL - 26JA - IEEE Transactions on Knowledge and Data EngineeringER -
Olivier Van Laere, Ghent University, Belgium
Jonathan Quinn, Cardiff University, UK
Steven Schockaert, Cardiff University, UK
Bart Dhoedt, Ghent University, Belgium
The task of assigning geographic coordinates to textual resources plays an increasingly central role in geographic information retrieval. The ability to select those terms from a given collection that are most indicative of geographic location is of key importance in successfully addressing this task. However, this process of selecting spatially relevant terms is at present not well understood, and the majority of current systems are based on standard term selection techniques, such as $(\chi^2)$ or information gain, and thus fail to exploit the spatial nature of the domain. In this paper, we propose two classes of term selection techniques based on standard geostatistical methods. First, to implement the idea of spatial smoothing of term occurrences, we investigate the use of kernel density estimation (KDE) to model each term as a two-dimensional probability distribution over the surface of the Earth. The second class of term selection methods we consider is based on Ripley's K statistic, which measures the deviation of a point set from spatial homogeneity. We provide experimental results which compare these classes of methods against existing baseline techniques on the tasks of assigning coordinates to Flickr photos and to Wikipedia articles, revealing marked improvements in cases where only a relatively small number of terms can be selected.
Index Terms:
Standards,Encyclopedias,Electronic publishing,Internet,Estimation,Context,feature extraction,Information search and retrieval,knowledge management,artificial intelligence,text mining,metadata,geographic information retrieval,classification,semi-structured data
Citation:
Olivier Van Laere, Jonathan Quinn, Steven Schockaert, Bart Dhoedt, "Spatially Aware Term Selection for Geotagging," IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 221-234, Jan. 2014, doi:10.1109/TKDE.2013.42