2009 IEEE International Conference on Semantic Computing (2009)
Berkeley, CA, USA
Sept. 14, 2009 to Sept. 16, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICSC.2009.37
This paper presents the results of using statistical analysis and automatic text categorization to identify an author’s age group based on the author's online chat posts. A Naive Bayesian Classifier and Support Vector Machine (SVM) model were used. The SVM model experiments generated an f-score measurement of 0.996 on test data distinguishing teens from adults. We also introduce an alternative method for generating “stop words” that chooses n-grams based on their relative distribution across the classes.
online chat, age classification, Support Vector Machine, Naïve Bayesian Classifier, stop words
J. Tam and C. H. Martell, "Age Detection in Chat," 2009 IEEE International Conference on Semantic Computing(ICSC), Berkeley, CA, USA, 2009, pp. 33-39.