The Community for Technology Leaders
2009 IEEE International Conference on Semantic Computing (2009)
Berkeley, CA, USA
Sept. 14, 2009 to Sept. 16, 2009
ISBN: 978-0-7695-3800-6
pp: 33-39
ABSTRACT
This paper presents the results of using statistical analysis and automatic text categorization to identify an author’s age group based on the author's online chat posts. A Naive Bayesian Classifier and Support Vector Machine (SVM) model were used. The SVM model experiments generated an f-score measurement of 0.996 on test data distinguishing teens from adults. We also introduce an alternative method for generating “stop words” that chooses n-grams based on their relative distribution across the classes.
INDEX TERMS
online chat, age classification, Support Vector Machine, Naïve Bayesian Classifier, stop words
CITATION

J. Tam and C. H. Martell, "Age Detection in Chat," 2009 IEEE International Conference on Semantic Computing(ICSC), Berkeley, CA, USA, 2009, pp. 33-39.
doi:10.1109/ICSC.2009.37
88 ms
(Ver 3.3 (11022016))