|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Third IEEE International Conference on Data Mining (ICDM'03)
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering
Melbourne, Florida
November 19-November 22
ISBN: 0-7695-1978-4
| ASCII Text | x | ||
| Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wang, Philip S. Yu, "MaPle: A Fast Algorithm for Maximal Pattern-based Clustering," Data Mining, IEEE International Conference on, pp. 259, Third IEEE International Conference on Data Mining (ICDM'03), 2003. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2003.1250928, author = {Jian Pei and Xiaoling Zhang and Moonjung Cho and Haixun Wang and Philip S. Yu}, title = {MaPle: A Fast Algorithm for Maximal Pattern-based Clustering}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2003}, isbn = {0-7695-1978-4}, pages = {259}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2003.1250928}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - MaPle: A Fast Algorithm for Maximal Pattern-based Clustering SN - 0-7695-1978-4 SP EP A1 - Jian Pei, A1 - Xiaoling Zhang, A1 - Moonjung Cho, A1 - Haixun Wang, A1 - Philip S. Yu, PY - 2003 KW - null VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However, pattern-based clustering in large databases is challenging. On the one hand, there can be a huge number of clusters and many of them can be redundant and thus make the pattern-based clustering ineffective. On the other hand, the previous proposed methods may not be efficient or scalable in mining large databases. In this paper, we study the problem of maximal pattern-based clustering. Redundant clusters are avoided completely by mining only the maximal pattern-based clusters. MaPle, an efficient and scalable mining algorithm is developed. It conducts a depth-first, divide-and-conquer search and prunes unnecessary branches smartly. Our extensive performance study on both synthetic data sets and real data sets shows that maximal pattern-based clustering is effective. It reduces the number of clusters substantially. Moreover, MaPle is more efficient and scalable than the previously proposed pattern-based clustering methods in mining large databases.
Citation:
Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wang, Philip S. Yu, "MaPle: A Fast Algorithm for Maximal Pattern-based Clustering," icdm, pp.259, Third IEEE International Conference on Data Mining (ICDM'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.
