|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Sicheng Xiong, Javad Azimi, Xiaoli Z. Fern, "Active Learning of Constraints for Semi-Supervised Clustering," IEEE Transactions on Knowledge and Data Engineering, vol. 99, no. 1, pp. 1, , 5555. | |||
| BibTex | x | ||
| @article{ 10.1109/TKDE.2013.22, author = {Sicheng Xiong and Javad Azimi and Xiaoli Z. Fern}, title = {Active Learning of Constraints for Semi-Supervised Clustering}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {99}, number = {1}, issn = {1041-4347}, year = {5555}, pages = {1}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.22}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Knowledge and Data Engineering TI - Active Learning of Constraints for Semi-Supervised Clustering IS - 1 SN - 1041-4347 SP EP EPD - 1 A1 - Sicheng Xiong, A1 - Javad Azimi, A1 - Xiaoli Z. Fern, PY - 5555 KW - Semi-supervised learning KW - Active learning KW - Clustering VL - 99 JA - IEEE Transactions on Knowledge and Data Engineering ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.22
Semi-supervised clustering aims to improve clustering performance by considering user supervision in the form of pairwise constraints. In this paper, we study the active learning problem of selecting pairwise must-link and cannot-link constraints for semi-supervised clustering. We consider active learning in an iterative manner where in each iteration queries are selected based on the current clustering solution and the existing constraint set. We apply a general framework that builds on the concept of neighborhood, where neighborhoods contain "labeled examples" of different clusters according to the pairwise constraints. Our active learning method expands the neighborhoods by selecting informative points and querying their relationship with the neighborhoods. Under this framework, we build on the classic uncertainty-based principle and present a novel approach for computing the uncertainty associated with each data point. We further introduce a selection criterion that trades-off the amount of uncertainty of each data point with the expected number of queries (the cost) required to resolve this uncertainty. This allows us to select queries that have the highest information rate. We evaluate the proposed method on the benchmark datasets and the results demonstrate consistent and substantial improvements over the current state-of-the-art.
Index Terms:
Semi-supervised learning,Active learning,Clustering
Citation:
Sicheng Xiong, Javad Azimi, Xiaoli Z. Fern, "Active Learning of Constraints for Semi-Supervised Clustering," IEEE Transactions on Knowledge and Data Engineering, 25 Jan. 2013. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.22>
Usage of this product signifies your acceptance of the Terms of Use.

