Issue No.11 - November (2011 vol.23)
Toshiyuki Tanaka , Kyoto University, Kyoto
Takeshi Yamada , NTT, Kyoto
Naonori Ueda , NTT Communication Science Laboratories, Kyoto
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2010.170
We propose a framework for improving classifier performance by effectively using auxiliary samples. The auxiliary samples are labeled not in terms of the target taxonomy according to which we wish to classify samples, but according to classification schemes or taxonomies that are different from the target taxonomy. Our method finds a classifier by minimizing a weighted error over the target and auxiliary samples. The weights are defined so that the weighted error approximates the expected error when samples are classified into the target taxonomy. Experiments using synthetic and text data show that our method significantly improves the classifier performance in most cases compared to conventional data augmentation methods.
Transfer learning, semisupervised learning, text classification.
Toshiyuki Tanaka, Takeshi Yamada, Naonori Ueda, "Improving Classifier Performance Using Data with Different Taxonomies", IEEE Transactions on Knowledge & Data Engineering, vol.23, no. 11, pp. 1668-1677, November 2011, doi:10.1109/TKDE.2010.170