CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2012 vol.9 Issue No.03 - May-June

Subscribe

Issue No.03 - May-June (2012 vol.9)

pp: 924-933

Debarka Sengupta , Indian Statistical Institute, Kolkata

Ujjwal Maulik , Jadavpur University, Kolkata

Sanghamitra Bandyopadhyay , Indian Statistical Institute, Kolkata

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.28

ABSTRACT

The scope and effectiveness of Rank Aggregation (RA) have already been established in contemporary bioinformatics research. Rank aggregation helps in meta-analysis of putative results collected from different analytic or experimental sources. For example, we often receive considerably differing ranked lists of genes or microRNAs from various target prediction algorithms or microarray studies. Sometimes combining them all, in some sense, yields more effective ordering of the set of objects. Also, assigning a certain level of confidence to each source of ranking is a natural demand of aggregation. Assignment of weights to the sources of orderings can be performed by experts. Several rank aggregation approaches like those based on Markov Chains (MCs), evolutionary algorithms, etc., exist in the literature. Markov chains, in general, are faster than the evolutionary approaches. Unlike the evolutionary computing approaches Markov chains have not been used for weighted aggregation scenarios. This is because of the absence of a formal framework of Weighted Markov Chain (WMC). In this paper, we propose the use of a modified version of MC4 (one of the Markov chains proposed by Dwork et al., 2001), followed by the weighted analog of local Kemenization for performing rank aggregation, where the sources of rankings can be prioritized by an expert. Effectiveness of the weighted Markov chain approach over the very recently proposed Genetic Algorithm (GA) and Cross-Entropy Monte Carlo (MC) algorithm-based techniques, has been established for gene orderings from microarray analysis and orderings of predicted microRNA targets.

INDEX TERMS

Rank aggregation, Markov chain, Kendall's tau, microRNA, genes, ordering.

CITATION

Debarka Sengupta, Ujjwal Maulik, Sanghamitra Bandyopadhyay, "Weighted Markov Chain Based Aggregation of Biomolecule Orderings",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.9, no. 3, pp. 924-933, May-June 2012, doi:10.1109/TCBB.2012.28REFERENCES

- [1] S.S. Gupta, "On Some Multiple Decision (Selection and Ranking) Rules,"
Technometrics, vol. 7, pp. 225-245, 1965.- [2] A.D. Grazia, "Mathematical Derivation of an Election System,"
Isis, vol. 44, pp. 42-51, 1953.- [3] C. Dwork, R. Kumar, and D. Sivakumar, "Rank Aggregation Methods for the Web,"
Proc. 10th Int'l World Wide Web Conf., pp. 613-620, 2001.- [4] R. DeConde, S. Hawley, S. Falcon, N. Clegg, B. Knudsen, and R. Etzioni, "Combining Results of Microarray Experiments: A Rank Aggregation Approach,"
Statistical Applications in Genetics Molecular Biology, vol. 5, no. 1, pp. 1-17, 2006.- [5] V. Pihur, S. Datta, and S. Datta, "Finding Cancer Genes Through Meta-Analysis of Microarray Experiments: Rank Aggregation via the Cross Entropy Algorithm,"
Genomics, vol. 92, pp. 400-403, 2008.- [6] S. Lin and J. Ding, "Integration of Ranked Lists via Cross Entropy Monte Carlo with Applications to mRNA and microRNA Studies,"
Biometrics, vol. 65, no. 1, pp. 9-18, 2009.- [7] S. Bandyopadhyay, R. Mitra, U. Maulik, and M.Q. Zhang, "Development of the Human Cancer Microrna Network,"
Silence, vol. 1, no. 6, 2010, doi:10.1186/1758-907X-1-6. - [8] V. Pihur, S. Datta, and S. Datta, "Weighted Rank Aggregation of Cluster Validation Measures: A Monte Carlo Cross-Entropy Approach,"
Bioinformatics, vol. 23, no. 13, pp. 1607-1615, 2007.- [9] D. Sengupta, B. Sanghamitra, and U. Maulik, "A Novel Measure for Evaluating an Ordered List: Application in microRNA Target Prediction,"
Proc. Int'l Symp. Biocomputing, 2010.- [10] J. Zhou, S. Lin, V. Melfi, and J. Verducci, "Composite MicroRNA Target Predictions and Comparisons of Several Prediction Algorithms," http://chagall.med.cornell.edu/BioinfoCourse/ PDFs/Lec-ture1Zhou2006.pdf, 2012.
- [11] V. Pihur, S. Datta, and S. Datta, "RankAggreg, an R Package for Weighted Rank Aggregation,"
BMC Bioinformatics, vol. 10, article 62, 2009, doi:10.1186/1471-2105-10-62. - [12] R. Fagin, R. Kumar, and D. Sivakumar, "Comparing Top k Lists,"
SIAM J. Discrete Math., vol. 17, no. 1, pp. 134-160, 2003.- [13] H.P. Young, "Condorcet's Theory of Voting,"
Am. Political Science Review, vol. 82, no. 4, pp. 1231-1244, 1988.- [14] H.P. Young and A. Levenglick, "A Consistent Extension of Condorcet's Election Principle,"
SIAM J. Applied Math., vol. 35, no. 2, pp. 285-300, 1978.- [15] M. Truchon, "An Extension of the Condorcet Criterion and Kemeny Orders," http://www.ecn.ulaval.ca/w3/recherche/cahiers/ 19989813.pdf, 1998.
- [16] "Rank Aggregation Methods for the Web," www.cs.uc.edu/annexste/Courses/cs728-2008 Lecture11.ppt, 2012.
- [17] S.M. Dhanasekaran, T.R. Barrette, D. Ghosh, R. Shah, S. Varambally, K. Kurachi, K.J. Pienta, M.S. Rubin, and A.M. Chinnaiyan, "Delineation of Prognostic Biomarkers in Prostate Cancer,"
Nature, vol. 412, pp. 822-826, 2001.- [18] J. Luo, D.J. Duggan, Y. Chen, J. Sauvageot, M. Ewing, M.L. Bittner, J.M. Trent, and W.B. Isaacs, "Human Prostate Cancer and Benign Prostatic Hyperplasia: Molecular Dissection by Gene Expression Profiling,"
Cancer Research, vol. 61, pp. 4683-4688, 2001.- [19] L. True, I. Coleman, S. Hawley, A. Huang, D. Gifford, R. Coleman, T. Beer, E. Gelman, M. Datta, E. Mostaghel, B. Knudsen, P. Lange, R. Vessella, D. Lin, L. Hood, and P. Nelson, "A Molecular Correlate to the Gleason Grading System for Prostate Adenocarcinoma,"
Proc. Nat'l Academy of Sciences USA (PNAS), vol. 103, no. 29, pp. 10991-10996, 2006.- [20] J.B. Welsh, l.M. Sapinoso, A.I. Su, S.G. Kern, J. Wang-Rodriguez, C.A. Moskaluk, H.F. Frierson, and G.M. Hampton, "Analysis of Gene Expression Identifies Candidate Markers and Pharmacological Targets in Prostate Cancer,"
Cancer Research, vol. 61, pp. 5974-5978, 2001.- [21] D. Singh, P.G. Febbo, K. Ross, D.G. Jackson, J. Manola, C. Ladd, P. Tamayo, A.A. Renshaw, A.V. Damico, J.P. Richie, E.S. Lander, M. Loda, P.W. Kantoff, T.E. Golub, and W.R. Sellers, "Gene Expression Correlates of Clinical Prostate Cancer Behavior,"
Cancer Cell, vol. 1, no. 2, pp. 203-209, 2002.- [22] A. Krek, D. Grun, M.N. Poy, R. Wolf, L. Rosenberg, E.J. Epstein, P. MacMenamin, I. daPiedade, K. C, M. Stoffel, and N. Rajewsky, "Combinatorial MicroRNA Target Predictions,"
Nature Genetics, vol. 37, pp. 495-500, 2005.- [23] B.P. Lewis, I.H. Shih, M.W. Jones-Rhoades, D.P. Bartel, and C.B. Burge, "Prediction of Mammalian MicroRNA Targets,"
Cell, vol. 115, no. 7, pp. 787-798, 2003.- [24] B. John, A.J. Enright, A. Aravin, T. Tuschl, C. Sander, and D.S. Marks, "Human MicroRNA Targets,"
PLOS Biology, vol. 2, no. 11, p. e363, 2004.- [25] M. Kertesz, N. Iovino, U. Unnerstall, U. Gaul, and E. Segal, "The Role of Site Accessibility in MicroRNA Target Recognition,"
Nature Genetics, vol. 39, no. 10, pp. 1278-1284, 2007.- [26] M. Rehmsmeier, P. Steffen, M. Hchsmann, and R. Giegerich, "Fast and Effective Prediction of MicroRNA/Target Duplexes,"
RNA, vol. 10, pp. 1507-1517, 2004.- [27] F. Xiao, Z. Zuo, G. Cai, S. Kang, X. Gao, and T. Li, "MiRecords: An Integrated Resource for MicroRNA-target Interactions,"
Nucleic Acids Research, vol. 37, pp. D105-D110, 2009.- [28] P. Sethupathy, M. Megraw, and A.G. Hatzigeorgiou, "A Guide Through Present Computational Approaches for the Identification of Mammalian MicroRNA Targets,"
Nature Methods, vol. 3, pp. 881-886, 2006.- [29] D.P. Bartel, "microRNAs: Genomics, Biogenesis, Mechanism, and Function,"
Cell, vol. 116, no. 2, pp. 281-297, 2004.- [30] J.A. Chan, A.M. Krichevsky, and K.S. Kosik, "MicroRNA-21 is an Antiapoptotic Factor in Human Glioblastoma Cells,"
Cancer Research, vol. 65, no. 14, pp. 6029-6033, 2005.- [31] C.Z. Chen, L. Li, H.F. Lodish, and D.P. Bartel, "MicroRNAs Modulate Hematopoietic Lineage Differentiation,"
Science, vol. 303, no. 5954, pp. 83-86, 2003.- [32] A. Esquela-Kerscher and F.J. Slack, "Oncomirs—MicroRNAs with a Role in Cancer,"
Nature Rev. Cancer, vol. 6, no. 4, pp. 259-269, 2006.- [33] S. Bandyopadhyay and R. Mitra, "Targetminer: MicroRNA Target Prediction with Systematic Identification of Tissue Specific Negative Examples,"
Bioinformatics, vol. 25, no. 20, pp. 2625-2631, 2009.- [34] M. Maragkakis, M. Reczko, V.A. Simossis, P. Alexiou, G.L. Papadopoulos, T. Dalamagas, G. Giannopoulos, G. Goumas, E. Koukis, K. Kourtis, T. Vergoulis, N. Koziris, T. Sellis, P. Tsanakas, and A.G. Hatzigeorgiou, "Diana-microT Web Server: Elucidating microRNA Functions Through Target Prediction,"
Nucleic Acids Research, vol. 37, pp. W273-W276, 2009.- [35] X. Wang and I.M.E. Naqa, "Prediction of Both Conserved and Nonconserved MicroRNA Targets in Animals,"
Bioinfromatics, vol. 24, no. 3, pp. 325-332, 2008.- [36] D. Long, R. Lee, P. Williams, C.Y. Chan, V. Ambros, and Y. Ding, "Potent Effect of Target Structure on MicroRNA Function,"
Nature Structural and Molecular Biology, vol. 14, no. 4, pp. 287-294, 2007.- [37] A. Stark, J. Brennecke, R.B. Russell, and S.M. Cohen, "Identification of Drosophila MicroRNA Targets,"
Bioinformatics, vol. 1, no. 3,E60, 2003, doi:10.1371/journal.pbio.0000060. - [38] H. Robins, Y. Li, and R.W. Padgett, "Incorporating Structure to Predict MicroRNA Targets,"
Proc. Nat'l Academy of Sciences USA, vol. 102, no. 11, pp. 4006-4009, 2005. |