$PEEDR$ and the $CPGS$ clustering algorithm. For each of the proposed algorithms, we develop several pruning techniques to further improve their efficiency. We evaluate the effectiveness and efficiency of our algorithms and pruning methods through comprehensive experiments." /> $PEEDR$ and the $CPGS$ clustering algorithm. For each of the proposed algorithms, we develop several pruning techniques to further improve their efficiency. We evaluate the effectiveness and efficiency of our algorithms and pruning methods through comprehensive experiments." /> $PEEDR$ and the $CPGS$ clustering algorithm. For each of the proposed algorithms, we develop several pruning techniques to further improve their efficiency. We evaluate the effectiveness and efficiency of our algorithms and pruning methods through comprehensive experiments." /> Effective and Efficient Clustering Methods for Correlated Probabilistic Graphs
The Community for Technology Leaders
Subscribe
Issue No.05 - May (2014 vol.26)
pp: 1117-1130
Yu Gu , Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Chunpeng Gao , Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Gao Cong , Nanyang Technol. Univ., Singapore, Singapore
Ge Yu , Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
ABSTRACT
Recently, probabilistic graphs have attracted significant interests of the data mining community. It is observed that correlations may exist among adjacent edges in various probabilistic graphs. As one of the basic mining techniques, graph clustering is widely used in exploratory data analysis, such as data compression, information retrieval, image segmentation, etc. Graph clustering aims to divide data into clusters according to their similarities, and a number of algorithms have been proposed for clustering graphs, such as the pKwikCluster algorithm, spectral clustering, k-path clustering, etc. However, little research has been performed to develop efficient clustering algorithms for probabilistic graphs. Particularly, it becomes more challenging to efficiently cluster probabilistic graphs when correlations are considered. In this paper, we define the problem of clustering correlated probabilistic graphs. To solve the challenging problem, we propose two algorithms, namely the PEEDR and the CPGS clustering algorithm. For each of the proposed algorithms, we develop several pruning techniques to further improve their efficiency. We evaluate the effectiveness and efficiency of our algorithms and pruning methods through comprehensive experiments.
INDEX TERMS
pattern clustering, correlation theory, data analysis, data mining, graph theory,pruning techniques, efficient clustering method, effective clustering method, correlated probabilistic graph clustering, data mining, adjacent edges, exploratory data analysis, CPGS clustering algorithm, PEEDR clustering algorithm,Probabilistic logic, Clustering algorithms, Correlation, Algorithm design and analysis, Probability, Joints, Data mining,Data mining, Clustering, classification, and association rules,algorithm, Clustering, correlated, probabilistic graphs
CITATION
Yu Gu, Chunpeng Gao, Gao Cong, Ge Yu, "Effective and Efficient Clustering Methods for Correlated Probabilistic Graphs", IEEE Transactions on Knowledge & Data Engineering, vol.26, no. 5, pp. 1117-1130, May 2014, doi:10.1109/TKDE.2013.123