Issue No. 06 - Nov.-Dec. (2013 vol. 10)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.28
Chanchala D. Kaddi , Dept. of Biomed. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
R. Mitchell Parry , Dept. of Comput. Sci., Appalachian State Univ., Boone, NC, USA
May D. Wang , Dept. of Biomed. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
We propose a similarity measure based on the multivariate hypergeometric distribution for the pairwise comparison of images and data vectors. The formulation and performance of the proposed measure are compared with other similarity measures using synthetic data. A method of piecewise approximation is also implemented to facilitate application of the proposed measure to large samples. Example applications of the proposed similarity measure are presented using mass spectrometry imaging data and gene expression microarray data. Results from synthetic and biological data indicate that the proposed measure is capable of providing meaningful discrimination between samples, and that it can be a useful tool for identifying potentially related samples in large-scale biological data sets.
Biomedical measurement, Approximation methods, Gene expression, Bioinformatics, Diseases,chemistry, Similarity measures, contingency tables, multivariate statistics, biology and genetics
Chanchala D. Kaddi, R. Mitchell Parry, May D. Wang, "Multivariate Hypergeometric Similarity Measure", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. , pp. 1505-1516, Nov.-Dec. 2013, doi:10.1109/TCBB.2013.28