The Community for Technology Leaders
RSS Icon
Subscribe
Lyon
Aug. 22, 2011 to Aug. 27, 2011
ISBN: 978-1-4577-1373-6
pp: 273-276
ABSTRACT
This paper presents a meaning-based method to distinguish text without or with little semantic content from text that has meaning which can be processed. The basic method assumes that a semantic analyzer will be able to produce less output from semantically less grammatical input text. The method was pilot-tested on a corpus of blog spam. Future improvements, including a method to distinguish semantically unified from semantically disparate text are sketched. The tested method, but even more the projected improvements, open up the way to taking the spam filtering arms race to a new level that is very costly to spam producers.
INDEX TERMS
spam filter, semantics, ontological semantics
CITATION
Christian F. Hempelmann, Vikas Mehra, "Baseline Semantic Spam Filtering", WI-IAT, 2011, 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies, 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies 2011, pp. 273-276, doi:10.1109/WI-IAT.2011.133
29 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool