|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 Ninth IEEE International Conference on Data Mining
Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data
Miami, Florida
December 06-December 09
ISBN: 978-0-7695-3895-2
| ASCII Text | x | ||
| Emmanuel Müller, Ira Assent, Stephan Günnemann, Ralph Krieger, Thomas Seidl, "Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data," Data Mining, IEEE International Conference on, pp. 377-386, 2009 Ninth IEEE International Conference on Data Mining, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2009.10, author = {Emmanuel Müller and Ira Assent and Stephan Günnemann and Ralph Krieger and Thomas Seidl}, title = {Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2009}, issn = {1550-4786}, pages = {377-386}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.10}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data SN - 1550-4786 SP377 EP386 A1 - Emmanuel Müller, A1 - Ira Assent, A1 - Stephan Günnemann, A1 - Ralph Krieger, A1 - Thomas Seidl, PY - 2009 KW - data mining KW - high dimensional data KW - subspace clustering KW - redundancy removal KW - global optimization VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.10
Subspace clustering aims at detecting clusters in any subspace projection of a high dimensional space. As the number of possible subspace projections is exponential in the number of dimensions, the result is often tremendously large. Recent approaches fail to reduce results to relevant subspace clusters. Their results are typically highly redundant, i.e. many clusters are detected multiple times in several projections. In this work, we propose a novel model for relevant subspace clustering (RESCU). We present a global optimization which detects the most interesting non-redundant subspace clusters. We prove that computation of this model is NP-hard. For RESCU, we propose an approximative solution that shows high accuracy with respect to our relevance model. Thorough experiments on synthetic and real world data show that RESCU successfully reduces the result to manageable sizes. It reliably achieves top clustering quality while competing approaches show greatly varying performance.
Index Terms:
data mining, high dimensional data, subspace clustering, redundancy removal, global optimization
Citation:
Emmanuel Müller, Ira Assent, Stephan Günnemann, Ralph Krieger, Thomas Seidl, "Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data," icdm, pp.377-386, 2009 Ninth IEEE International Conference on Data Mining, 2009
Usage of this product signifies your acceptance of the Terms of Use.
