The Community for Technology Leaders
2015 3rd International Conference on Future Internet of Things and Cloud (FiCloud) (2015)
Rome, Italy
Aug. 24, 2015 to Aug. 26, 2015
ISBN: 978-1-4673-8102-4
pp: 465-472
ABSTRACT
This paper describes an approach to infer the location of a social media post at a hyper-local scale based on its content, conditional to the knowledge that the post originates from a larger area such as a city or even a state. The approach comprises three components: (i) a discriminative classifier, namely, Logistic Regression (LR) which selects from a set of most probable sub-regions from where a post might have originated, (ii) a clustering technique, namely, k-means, that adaptively partitions the larger geographic region into sub regions based on the density of the posts, and (iii) a range of techniques to extract a set of hyper-local words from the posts to be fed as features to the LR classifier. The approach is evaluated on a large corpus of tweets collected from Twitter over the NYC, Washington DC, and state of Connecticut regions. The results show that our approach can geo-locate tweets within 1:72 km for NYC, 12:5 km for DC and 37:00 km for CT. These results from three geographically and socially diverse regions suggest that our approach outperforms contemporary methods that estimate locations within ranges of hundreds of kilometers. It can thus support a wide array of services such as location-based advertising, and disaster and emergency response.
INDEX TERMS
Media, Training, Mathematical model, Feature extraction, Accuracy, Cities and towns, Logistics
CITATION

B. McClanahan and S. S. Gokhale, "Location Inference of Social Media Posts at Hyper-Local Scale," 2015 3rd International Conference on Future Internet of Things and Cloud (FiCloud)(FICLOUD), Rome, Italy, 2015, pp. 465-472.
doi:10.1109/FiCloud.2015.71
92 ms
(Ver 3.3 (11022016))