Issue No. 11 - November (2009 vol. 31)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2009.67
Kevin M. Carter , Information Systems Technology Group, Lexington
Raviv Raich , Oregon State University, Corvallis
William G. Finn , University of Michigan, Ann Arbor
Alfred O. Hero III , University of Michigan, Ann Arbor
We consider the problems of clustering, classification, and visualization of high-dimensional data when no straightforward euclidean representation exists. In this paper, we propose using the properties of information geometry and statistical manifolds in order to define similarities between data sets using the Fisher information distance. We will show that this metric can be approximated using entirely nonparametric methods, as the parameterization and geometry of the manifold is generally unknown. Furthermore, by using multidimensional scaling methods, we are able to reconstruct the statistical manifold in a low-dimensional euclidean space; enabling effective learning on the data. As a whole, we refer to our framework as Fisher Information Nonparametric Embedding (FINE) and illustrate its uses on practical problems, including a biomedical application and document classification.
Information geometry, statistical manifold, dimensionality reduction, multidimensional scaling.
A. O. Hero III, R. Raich, W. G. Finn and K. M. Carter, "FINE: Fisher Information Nonparametric Embedding," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 31, no. , pp. 2093-2098, 2009.