The Community for Technology Leaders
RSS Icon
Subscribe
Lyon
Aug. 22, 2011 to Aug. 27, 2011
ISBN: 978-1-4577-1373-6
pp: 124-129
ABSTRACT
In this paper, we present a Web spam detection algorithm that relies on link analysis. The method consists of three steps: (1) decomposition of web graphs in densely connected sub graphs and calculation of the features for each sub graph, (2) use of SVM classifiers to identify sub graphs composed of Web spam, and (3) propagation of predictions over web graphs by a biased Page Rank algorithm to expand the scope of identification. We performed experiments on a public benchmark. An empirical study of the core structure of web graphs suggests that highly ranked non-spam hosts can be identified by viewing the coreness of the web graph elements.
INDEX TERMS
Web spam, dense subgraphs, biased pagerank
CITATION
Yutaka I. Leon-Suematsu, Kentaro Inui, Sadao Kurohashi, Yutaka Kidawara, "Web Spam Detection by Exploring Densely Connected Subgraphs", WI-IAT, 2011, 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies, 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies 2011, pp. 124-129, doi:10.1109/WI-IAT.2011.152
6 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool