
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Huidong Jin, ManLeung Wong, K.S. Leung, "Scalable ModelBased Clustering for Large Databases Based on Data Summarization," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 11, pp. 17101719, November, 2005.  
BibTex  x  
@article{ 10.1109/TPAMI.2005.226, author = {Huidong Jin and ManLeung Wong and K.S. Leung}, title = {Scalable ModelBased Clustering for Large Databases Based on Data Summarization}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {27}, number = {11}, issn = {01628828}, year = {2005}, pages = {17101719}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.2005.226}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Pattern Analysis and Machine Intelligence TI  Scalable ModelBased Clustering for Large Databases Based on Data Summarization IS  11 SN  01628828 SP1710 EP1719 EPD  17101719 A1  Huidong Jin, A1  ManLeung Wong, A1  K.S. Leung, PY  2005 KW  Index Terms Scalable clustering KW  Gaussian mixture model KW  expectationmaximization KW  data summary KW  maximum penalized likelihood estimate. VL  27 JA  IEEE Transactions on Pattern Analysis and Machine Intelligence ER   
[1] J. Han and M. Kamber, Data Mining: Concepts and Techniques. San Francisco: Morgan Kaufmann, 2001.
[2] P. Bradley, U. Fayyad, and C. Reina, “Clustering Very Large Databases Using EM Mixture Models,” Proc. 15th Int'l Conf. Pattern Recognition, vol. 2, pp. 7680, 2000.
[3] V. Ganti, J. Gehrke, and R. Ramakrishnan, “Mining Very Large Databases,” Computer, vol. 32, no. 8, pp. 3845, Aug. 1999.
[4] T. Zhang, R. Ramakrishnan, and M. Livny, “BIRCH: A New Data Clustering Algorithm and Its Applications,” Data Mining and Knowledge Discovery, vol. 1, no. 2, pp. 141182, 1997.
[5] H.D. Jin, M.L. Wong, and K.S. Leung, “Scalable ModelBased Clustering by Working on Data Summaries,” Proc. Third IEEE Int'l Conf. Data Mining, pp. 9198, Nov. 2003.
[6] B. Thiesson, C. Meek, and D. Heckerman, “Accelerating EM for Large Databases,” Machine Learning, vol. 45, pp. 279299, 2001.
[7] A. Moore, “Very Fast EMBased Mixture Model Clustering Using Multiresolution KDTrees,” Advances in Neural Information Processing Systems 11, pp. 543549, 1999.
[8] C. Palmer and C. Faloutsos, “Density Biased Sampling: An Improved Method for Data Mining and Clustering,” Proc. 2000 ACM SIGMOD, pp. 8292, 2000.
[9] M. Meila and D. Heckerman, “An Experimental Comparison of ModelBased Clustering Methods,” Machine Learning, vol. 42, no. 1/2, pp. 929, 2001.
[10] H.D. Jin, “Scalable ModelBased Clustering Algorithms for Large Databases and Their Applications,” PhD thesis, The Chinese Univ. of Hong Kong, Hong Kong, Aug. 2002, see errata, codes, and data at http://www.cmis.csiro.au/Warren.JinPhDthesisWork.htm .
[11] P.A. Pantel, “Clustering by Committee,” PhD dissertation, Univ. of Alberta, Canada, 2003.
[12] M. Figueiredo and A.K. Jain, “Unsupervised Learning of Finite Mixture Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 3, pp. 381396, Mar. 2002.
[13] S. Wang, D. Schuurmans, F. Peng, and Y. Zhao, “Learning Mixture Models with the Latent Maximum Entropy Principle,” Proc. 20th Int'l Conf. Machine Learning, pp. 784791, 2003.
[14] A. Dempster, N. Laird, and D. Rubin, “MaximumLikelihood from Incomplete Data via the EM Algorithm,” J. Royal Statistical Soc. Series B, vol. 39, pp. 138, 1977.
[15] H.D. Jin, K.S. Leung, M.L. Wong, and Z.B. Xu, “Scalable ModelBased Cluster Analysis Using Clustering Features,” Pattern Recognition, vol. 38, no. 5, pp. 637649, May 2005.
[16] G. McLachlan and T. Krishnan, The EM Algorithm and Extensions. New York: John Wiley & Sons, Inc., 1997.
[17] P. Cheeseman and J. Stutz, “Bayesian Classification (AutoClass): Theory and Results,” Advances in Knowledge Discovery and Data Mining, U. Fayyad et al., eds., pp. 153180, 1996.
[18] B.J. Frey and N. Jojic, “TransformationInvariant Clustering Using the EM Algorithm,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 1, pp. 117, Jan. 2003.
[19] C. Fraley, “Algorithms for ModelBased Gaussian Hierarchical Clustering,” SIAM J. Scientific Computing, vol. 20, no. 1, pp. 270281, Jan. 1999.
[20] J. Shanmugasundaram, U. Fayyad, and P. Bradley, “Compressed Data Cubes for OLAP Aggregate Query Approximation on Continuous Dimensions,” Proc. Fifth ACM SIGKDD, pp. 223232, 1999.