Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Global Biclustering of Microarray Data
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
We consider the problem of simultaneously clustering genes and conditions of a gene expression data matrix. A bicluster is defined as a subset of genes that show similar behavior within a subset of conditions. Finding biclusters can be useful for revealing groups of genes involved in the same molecular process as well as groups of conditions where this process takes place. Previous work either deals with local, bicluster-based criteria or assumes a very specific structure of the data matrix (e.g. checkerboard or block-diagonal) [11]. In contrast, our goal is to find a set of flexibly arranged biclusters which is optimal in regard to a global objective function. As this is a NP-hard combinatorial problem, we describe several techniques to obtain approximate solutions. We benchmarked our approach successfully on the Alizadeh B-cell lymphoma data set [1].
Citation:
Thomas Wolf, Benedikt Brors, Thomas Hofmann, Elisabeth Georgii, "Global Biclustering of Microarray Data," icdmw, pp.125-129, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006