Fifth IEEE Symposium on Bioinformatics and Bioengineering (BIBE'05)
On Clustering Biological Data Using Unsupervised and Semi-Supervised Message Passing
Minneapolis, Minnesota
October 19-October 21
ISBN: 0-7695-2476-1
Noticing that unsupervised clustering may produce clusters that are irrelevant to the research hypotheses and interests, we generalize traditional unsupervised clustering into semi-supervised clustering based on our previously proposed Message Passing Clustering (MPC). In the semi-supervised MPC, prior knowledge such as instance-level and attribute-level constraints are used to guide the clustering process towards better and interpretable partitions. We applied the unsupervised MPC (null background) to phylogenetic analysis of Mycobacterium and the semi-supervised MPC to colon cancer microarray data analysis. The results show that MPC is superior to the widely accepted neighbor-joining and hierarchical clustering methods, and the semi-supervised MPC is even more powerful in biological data analysis such as gene selection and cancer diagnosis using microarray.
Citation:
Huimin Geng, Xutao Deng, Dhundy Bastola, Hesham Ali, "On Clustering Biological Data Using Unsupervised and Semi-Supervised Message Passing," bibe, pp.294-298, Fifth IEEE Symposium on Bioinformatics and Bioengineering (BIBE'05), 2005