This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2012 IEEE 12th International Conference on Data Mining Workshops
Improved Feature Selection by Incorporating Gene Similarity Into the LASSO
Brussels, Belgium Belgium
December 10-December 10
ISBN: 978-1-4673-5164-5
Personalized medicine is customizing treatments to a patientâs genetic profile, and it has the potential to revolutionize medical practice. An important process used in personalized medicine is gene expression profiling. Analyzing gene expression profiles is difficult, because there are usually few patients and thousands of genes. This leads to the curse of dimensionality. In order to combat this problem, some researchers suggest using prior knowledge to enhance feature selection for supervised learning algorithms. We propose an enhancement to the LASSO, a shrinkage and selection technique that induces parameter sparsity by penalizing a modelâs objective function. Our enhancement gives preference to the selection of genes that are involved in similar biological processes. We expect this to be the case because co-expressed genes are likely to be involved in related pathways. Our modified LASSO selects similar genes by penalizing interaction terms between genes. We devised a coordinate descent algorithm to minimize the corresponding objective function. To evaluate our method, we created simulation data where we compared our model to the standard LASSO model and an interaction LASSO model. Our model outperformed both the standard LASSO and the interaction model in terms of detecting important genes and gene interactions for a reasonable number of training samples. This preliminary study leads us to believe that our method has the potential compete with state of the art methods in gene expression analysis.
Index Terms:
Semantics,Gene expression,Biological system modeling,Mathematical model,Linear programming,Ontologies,Linear regression,Semantic Similarity,Gene Expression,Gene Ontology,LASSO,Regression
Citation:
C.E. Gillies, X. Gao, N.V. Patel, M.R. Siadat, G.D. Wilson, "Improved Feature Selection by Incorporating Gene Similarity Into the LASSO," icdmw, pp.41-48, 2012 IEEE 12th International Conference on Data Mining Workshops, 2012
Usage of this product signifies your acceptance of the Terms of Use.