|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 Ninth IEEE International Conference on Data Mining
Aspect Guided Text Categorization with Unobserved Labels
Miami, Florida
December 06-December 09
ISBN: 978-0-7695-3895-2
| ASCII Text | x | ||
| Dan Roth, Yuancheng Tu, "Aspect Guided Text Categorization with Unobserved Labels," Data Mining, IEEE International Conference on, pp. 962-967, 2009 Ninth IEEE International Conference on Data Mining, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2009.129, author = {Dan Roth and Yuancheng Tu}, title = {Aspect Guided Text Categorization with Unobserved Labels}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2009}, issn = {1550-4786}, pages = {962-967}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.129}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Aspect Guided Text Categorization with Unobserved Labels SN - 1550-4786 SP962 EP967 A1 - Dan Roth, A1 - Yuancheng Tu, PY - 2009 KW - multiclass classsification KW - text categorization KW - structure learning KW - constrained optimization VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.129
This paper proposes a novel multiclass classification method and exhibits its advantage in the domain of text categorization with a large label space and, most importantly, when some of the labels were not observed in the training data. The key insight is the introduction of intermediate aspect variables that encode properties of the labels. Aspect variables serve as a joint representation for observed and unobserved labels. This way the classification problem can be viewed as a structure learning problem with natural constraints on assignments to the aspect variables. We solve the problem as a constrained optimization problem over multiple learners and show significant improvement in classifying short sentences into a large label space of categories, including previously unobserved categories.
Index Terms:
multiclass classsification, text categorization, structure learning, constrained optimization
Citation:
Dan Roth, Yuancheng Tu, "Aspect Guided Text Categorization with Unobserved Labels," icdm, pp.962-967, 2009 Ninth IEEE International Conference on Data Mining, 2009
Usage of this product signifies your acceptance of the Terms of Use.
