22nd International Conference on Advanced Information Networking and Applications (aina 2008) Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples March 25-March 28 ISBN: 978-0-7695-3095-6
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/AINA.2008.57
This paper proposes a method that acquires a more appropriate classification model for label extraction. The model can extract specific labels from articles included in bulletin board sites. The labels represent the contents of the articles and are used to characterize the articles. The method selects two kinds of important examples not including a specific label by using expressions related to the label. The method inductively acquires the classification model from the selected examples and examples including the label. The paper applies the method to articles collected from three bulletin board sites and verifies its effect through comparative experiments
Index Terms:
text classification, imbalance problem, bulletin board site
Citation:
Shigeaki Sakurai, Ryohei Orihara, "Classification Model Learning for Bulletin Board Site Analysis Based on Unbalanced Textual Examples," aina, pp.494-501, 22nd International Conference on Advanced Information Networking and Applications (aina 2008), 2008 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||