This Article 
 Bibliographic References 
 Add to: 
Combined Feature Selection and Cancer Prognosis Using Support Vector Machine Regression
November/December 2011 (vol. 8 no. 6)
pp. 1671-1677
Bing-Yu Sun, Chinese Academy of Sciences, Hefei
Zhi-Hua Zhu, Sun Yat-sen University, Guangzhou
Jiuyong Li, University of South Australia, Adelaide
Bin Linghu, Chinese Academy of Sciences, Hefei
Prognostic prediction is important in medical domain, because it can be used to select an appropriate treatment for a patient by predicting the patient's clinical outcomes. For high-dimensional data, a normal prognostic method undergoes two steps: feature selection and prognosis analysis. Recently, the L_1\hbox{-}L_2-norm Support Vector Machine (L_1\hbox{-}L_2 SVM) has been developed as an effective classification technique and shown good classification performance with automatic feature selection. In this paper, we extend L_1\hbox{-}L_2 SVM for regression analysis with automatic feature selection. We further improve the L_1\hbox{-}L_2 SVM for prognostic prediction by utilizing the information of censored data as constraints. We design an efficient solution to the new optimization problem. The proposed method is compared with other seven prognostic prediction methods on three real-world data sets. The experimental results show that the proposed method performs consistently better than the medium performance. It is more efficient than other algorithms with the similar performance.

[1] R. Xu, X. Cai, and D.C. Wunsch II, “Gene Expression Data for DLBCL Cancer Survival Prediction with a Combination of Machine Learning Technologies,” Proc. 27th Ann. Int'l Conf. IEEE Eng. in Medicine and Biology, pp. 894-897, 2005.
[2] D.K. Tasoulis, P. Spyridonos, N.G. Pavlidis, V.P. Plagianakos, P. Ravazoula, G. Nikiforidis, and M.N. Vrahatis, “Cell-Nuclear Data Reduction and Prognostic Model Selection in Bladder Tumor Recurrence,” Artificial Intelligence in Medicine, vol. 38, no. 3, pp. 291-303, 2006.
[3] J.M. Jerez-Aragones, J.A. Gomez-Ruiza, G. Ramos-Jimenez, J. Munoz-Perez, and E. Alba-Conejob, “A Combined Neural Network and Decision Trees Model for Prognosis of Breast Cancer Relapse,” Artificial Intelligence in Medicine, vol. 27, pp. 45-63, 2003.
[4] O.L. Mangasarian, W.N. Street, and W.H. Wolberg, “Breast Cancer Diagnosis and Prognosis via Linear Programming,” Operations Research, vol. 43, no. 4, pp. 570-577, July 1995.
[5] T.H. Falk, H. Shatkay, and W.-Y. Chan, “Breast Cancer Prognosis via Gaussian Mixture Regression,” Proc. Canadian Conf. Electrical and Computer Eng. (CCECE '06), pp. 987-990, 2006.
[6] N. Bagotskaya, I. Lossev, N. Lossea, and M. Parakhin, “Prediction of Time to Event for Censord Data: Ridge Regression with Linear Constraints in Kernel Space,” Proc. IEEE Int'l Joint Conf. Neural Netwroks (IJCNN '05), pp. 1033-1038, 2005.
[7] P.K. Shivaswamy, W. Chu, and M. Jansche, “A Support Vector Approach to Censored Targets,” Proc. Seventh IEEE Int'l Conf. Data Mining (ICDM '07), pp. 655-660, 2007.
[8] J. Fan and R. Li, “Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties,” J. Am. Statistical Assoc., vol. 96, no. 456, pp. 1348-1360, 2001.
[9] B.A. Johnson, “On Lasso for Censored Data,” Electronic J. Statistics, vol. 3, pp. 485-506, 2009.
[10] B.A. Johnson, D.Y. Lin, and D. Zeng, “Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models,” J. Am. Statistical Assoc., vol. 103, pp. 672-680, 2008.
[11] R.L. Strawderman, “The Accelerated Gap Times Model,” Biometrika, vol. 92, pp. 647-666, 2005.
[12] C. Heuchenne and I.V. Keilegom, “Polynomial Regression with Censored Data Based on Preliminary Nonparametric Estimation,” Annals of the Inst. of Statistical Math., vol. 58, no. 3, pp. 273-297, 2007.
[13] V. Vapnik, The Nature of Statistical Learning Theory. Wiley, 1998.
[14] I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, “Gene Selection for Cancer Classification Using Support Vector Machines,” Machine Learning, vol. 46, nos. 1-3, pp. 389-422, 2002.
[15] J. Neumann, C. Schnorr, and G. Steidl, “Combined SVM-Based Feature Selection and Classification,” Machine Learning, vol. 61, nos. 1-3, pp. 129-150, 2005.
[16] L. Wang, J. Zhu, and H. Zou, “Hybrid Huberized Support Vector Machines for Microarray Classification and Gene Selection,” Bioinformatics, vol. 24, no. 3, pp. 412-419, 2008.
[17] P. Bradley and O. Mangasarian, “Feature Selection via Concave Minimization and Support Vector Machines,” Proc. 15th Int'l Conf. Machine Learning (ICML '98), pp. 82-90, 1998.
[18] J. Zhu, S. Rosset, T. Hastie, and R. Tibshirani, “1-Norm Support Vector Machines,” Proc. 17th Ann. Conf. Neural Information Processing Systems (NIPS '03), pp. 49-57, 2003.
[19] H. Zou and T. Hastie, “Regularization and Variable Selection via the Elastic Net,” J. Royal Statistical Soc. B, vol. 67, pp. 301-20, 2005.
[20] W.N. Street, “Cancer Diagnosis and Prognosis via Linearprogramming Based Machine Learning,” PhD dissertation, Univ. of Wisconsin, 1994.
[21] A. Rosenwald, G. Wright, W.C. Chan, J.M. Connors, E. Campo, R.I. Fisher, R.D. Gascoyne, H.K. Muller-Hermelink, E.B. Smeland, and L.M. Staudt, “The Use of Molecular Profiling to Predict Survival after Chemotherapy for Diffuse Large-B-Cell Lymphoma,” New England J. Medicine, vol. 346, no. 25, pp. 1937-1947, 2002.
[22] J. Kononen, L. Bubendorf, A. Kallionimeni, M. Barlund, P. Schraml, S. Leighton, J. Torhorst, M.J. Mihatsch, G. Sauter, and O.-P. Kallionimeni1, “Tissue Microarrays for High-Throughput Molecular Profiling of Tumor Specimens,” Nature Medicine, vol. 4, no. 7, pp. 844-847, 2005.
[23] H.H. Zhang and W. Lu, “Adaptive Lasso for Cox's Proportional Hazards Model,” Biometrika, vol. 94, no. 3, pp. 691-703, 2007.

Index Terms:
Prognostic prediction, support vector machine, censored data, feature selection.
Bing-Yu Sun, Zhi-Hua Zhu, Jiuyong Li, Bin Linghu, "Combined Feature Selection and Cancer Prognosis Using Support Vector Machine Regression," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. 6, pp. 1671-1677, Nov.-Dec. 2011, doi:10.1109/TCBB.2010.119
Usage of this product signifies your acceptance of the Terms of Use.