loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Sparse Learning Machine for High-Dimensional Data with Application to Microarray Gene Analysis
PrePrint
ISSN: 1545-5963
Qiang Cheng, Southern Illinois University, Carbondale
Extracting features from high-dimensional data is a critically important task for pattern recognition and machine learning applications. High-dimensional data typically have much more variables than observations, and contain significant noise, missing components, or outliers. Features extracted from high-dimensional data need to be discriminative, sparse, and can capture essential characteristics of the data. In this paper, we present a way to constructing multivariate features and then classify the data into proper classes. The resulting small subset of features is nearly the best in the sense of Greenshtein's persistence, however, the estimated feature weights may be biased. We take a systematic step to correct the biases. We use conjugate gradient based primal-dual interior-point techniques for large-scale problems. We apply our procedure to microarray gene analysis. The effectiveness of our method is confirmed by experimental results.
Index Terms:
high-dimensional data, feature selection, convex optimization, cancer classification, gene analysis, diagnosis
Citation:
Qiang Cheng, "A Sparse Learning Machine for High-Dimensional Data with Application to Microarray Gene Analysis," IEEE/ACM Transactions on Computational Biology and Bioinformatics, 22 Jan. 2009. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TCBB.2009.8>
Usage of this product signifies your acceptance of the Terms of Use.