Issue No. 01 - January/February (2012 vol. 9)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2011.58
Yang Xiang , Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
P. R. O. Payne , Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
Kun Huang , Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
Binary (0,1) matrices, commonly known as transactional databases, can represent many application data, including gene-phenotype data where "1” represents a confirmed gene-phenotype relation and "0” represents an unknown relation. It is natural to ask what information is hidden behind these "0”s and "1”s. Unfortunately, recent matrix completion methods, though very effective in many cases, are less likely to infer something interesting from these (0,1)-matrices. To answer this challenge, we propose Ind Evi, a very succinct and effective algorithm to perform independent-evidence-based transactional database transformation. Each entry of a (0,1)-matrix is evaluated by "independent evidence” (maximal supporting patterns) extracted from the whole matrix for this entry. The value of an entry, regardless of its value as 0 or 1, has completely no effect for its independent evidence. The experiment on a gene-phenotype database shows that our method is highly promising in ranking candidate genes and predicting unknown disease genes.
Diseases, Bipartite graph, Itemsets, Bioinformatics, Feature extraction, Data mining
Yang Xiang, P. R. Payne and Kun Huang, "Transactional Database Transformation and Its Application in Prioritizing Human Disease Genes," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. 1, pp. 294-304, 2011.