loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007)
Combating Good Word Attacks on Statistical Spam Filters with Multiple Instance Learning
Paris, France
October 29-October 31
ISBN: 0-7695-3015-X
Statistical spam filters are known to be vulnerable to ad- versarial attacks. One such adversarial attack, known as the Good Word Attack, thwarts spam filters by appending to spam messages sets of "good" words, which are common in legitimate e-mail but rare in spam. We present a counter- attack strategy that first attempts to differentiate spam from legitimate e-mail in the input space, by transforming each e- mail into a bag of multiple segments, and subsequently ap- plies multiple instance logistic regression on the bags. We treat each segment in the bag as an instance. An e-mail is classified as spam if at least one instance in the corre- sponding bag is spam, and as legitimate if all the instances in it are legitimate. We show that a spam filter using our multiple instance counter-attack strategy stands up better to good word attacks than its single instance counterpart and the commonly practiced Bayesian filters.
Citation:
Yan Zhou, Zach Jorgensen, Meador Inge, "Combating Good Word Attacks on Statistical Spam Filters with Multiple Instance Learning," ictai, vol. 2, pp.298-305, 19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.