|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Third IEEE International Conference on Data Mining (ICDM'03)
Regression Clustering
Melbourne, Florida
November 19-November 22
ISBN: 0-7695-1978-4
| ASCII Text | x | ||
| Bin Zhang, "Regression Clustering," Data Mining, IEEE International Conference on, pp. 451, Third IEEE International Conference on Data Mining (ICDM'03), 2003. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2003.1250952, author = {Bin Zhang}, title = {Regression Clustering}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2003}, isbn = {0-7695-1978-4}, pages = {451}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2003.1250952}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Regression Clustering SN - 0-7695-1978-4 SP EP A1 - Bin Zhang, PY - 2003 KW - null VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
Complex distribution in real-world data is often modeled by a mixture of simpler distributions. Clustering is one of the tools to reveal the structure of this mixture. The same is true to the datasets with chosen response variables that people run regression on. Without separating the clusters with very different response properties, the residue error of the regression is large. Input variable selection could also be misguided to a higher complexity by the mixture. In Regression Clustering (RC), K (>1) regression functions are applied to the dataset simultaneously which guide the clustering of the dataset into K subsets each with a simpler distribution matching its guiding function. Each function is regressed on its own subset of data with a much smaller residue error. Both the regressions and the clustering optimize a common objective function. We present a RC algorithm based on K-Harmonic Means clustering algorithm and compare it with other existing RC algorithms based on K-Means and EM.
Citation:
Bin Zhang, "Regression Clustering," icdm, pp.451, Third IEEE International Conference on Data Mining (ICDM'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.
