This Article 
 Bibliographic References 
 Add to: 
Multicategory Classification Using An Extreme Learning Machine for Microarray Gene Expression Cancer Diagnosis
July-September 2007 (vol. 4 no. 3)
pp. 485-495
In this paper, the recently developed Extreme Learning Machine (ELM) is used for direct multicategory classification problems in the cancer diagnosis area. ELM avoids problems like local minima, improper learning rate and overfitting commonly faced by iterative learning methods and completes the training very fast. We have evaluated the multi-category classification performance of ELM on three benchmark microarray datasets for cancer diagnosis, namely, the GCM dataset, the Lung dataset and the Lymphoma dataset. The results indicate that ELM produces comparable or better classification accuracies with reduced training time and implementation complexity compared to artificial neural networks methods like conventional back-propagation ANN, Linder's SANN, and Support Vector Machine methods like SVM-OVO and Ramaswamy's SVM-OVA. ELM also achieves better accuracies for classification of individual categories.

[1] T. Li, C. Zhang, and M. Ogihara, “A Comparative Study of Feature Selection and Multiclass Classification Methods for Tissue Classification Based on Gene Expression,” Bioinformatics, vol. 20, no. 15, pp. 2429-2437, 2004.
[2] S. Ramaswamy, P. Tamayo, R. Rifkin, S. Mukherjee, C.-H. Yeang, M. Angelo, C. Ladd, M. Reich, E. Latulippe, J.P. Mesirov, T. Poggio, W. Gerald, M. Loda, E.S. Lander, and T.R. Golub, “Multiclass Cancer Diagnosis Using Tumor Gene Expression Signatures,” Proc. Nat'l Academy Sciences, USA, vol. 98, no. 26, pp.15149-15154, 2002.
[3] A. Statnikov, C.F. Aliferis, I. Tsamardinos, D. Hardin, and S. Levy, “A Comprehensive Evaluation of Multicategory Classification Methods for Microarray Gene Expression Cancer Diagnosis,” Bioinformatics, vol. 21, no. 5, pp. 631-643, 2005.
[4] C.-H. Yeang, S. Ramaswamy, P. Tamayo, S. Mukherjee, R.M. Rifkin, M. Angelo, M. Reich, E. Lander, J. Mesirov, and T. Golub, “Molecular Classification of Multiple Tumor Types,” Bioinformatics, vol. 17, pp. S316-S322, 2001.
[5] R. Linder, D. Dew, H. Sudhoff, D. Theegarten, K. Remberger, S.J. Poppl, and M. Wagner, “The 'Subsequent Artificial Neural Network' (SANN) Approach Might Bring More Classificatory Power to ANN-Based DNA Microarray Analyses,” Bioinformatics, vol. 20, no. 18, pp. 3544-3552, 2004.
[6] M. Ringner, C. Peterson, and J. Khan, “Analyzing Array Data Using Supervised Methods,” Pharmacogenomics, vol. 3, no. 3, pp.403-415, 2002.
[7] J. Khan, J.S. Wei, M. Ringner, L.H. Saal, M. Ladanyi, F. Westermann, F. Berthold, M. Schwab, C.R. Antonescu, C. Peterson, and S. Meltzer, “Classification and Diagnostic Prediction of Cancers Using Gene Expression Profiling and Artificial Neural Networks,” Nature Medicine, vol. 7, no. 6, pp. 673-679, 2001.
[8] J.W. Lee, J.B. Lee, M. Park, and S.H. Song, “An Extensive Comparison of Recent Classification Tools Applied to Microarray Data,” Computational Statistics and Data Analysis, vol. 48, pp. 869-885, 2005.
[9] G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, “Extreme Learning Machine: A New Learning Scheme of Feedforward Neural Networks,” Proc. Int'l Joint Conf. Neural Networks (IJCNN '04), July 2004.
[10] G.-B. Huang and C.-K. Siew, “Extreme Learning Machine: RBF Network Case,” Proc. Eighth Int'l Conf. Control, Automation, Robotics, and Vision (ICARCV '04), Dec. 2004.
[11] G.-B. Huang and C.-K. Siew, “Extreme Learning Machine with Randomly Assigned RBF Kernels,” Int'l J. Information Technology, vol. 11, no. 1, 2005.
[12] G.-B. Huang, Q.-Y. Zhu, K.Z. Mao, C.-K. Siew, P. Saratchandran, and N. Sundararajan, “Can Threshold Networks Be Trained Directly?” IEEE Trans. Circuits and Systems II, vol. 53, no. 3, pp.187-191, 2006.
[13] M.-B. Li, G.-B. Huang, P. Saratchandran, and N. Sundararajan, “Fully Complex Extreme Learning Machine,” Neurocomputing, vol. 68, pp. 306-314, 2005.
[14] G.-B. Huang, “Learning Capability and Storage Capacity of Two-Hidden-Layer Feedforward Networks,” IEEE Trans. Neural Networks, vol. 14, no. 2, pp. 274-281, 2003.
[15] D. Serre, Matrices: Theory and Applications. Springer-Verlag, 2002.
[16] G.-B. Huang, L. Chen, and C.-K. Siew, “Universal Approximation Using Incremental Constructive Feedforward Networks with Random Hidden Nodes,” IEEE Trans. Neural Networks, vol. 17, no. 4, pp. 879-892, 2006.
[17] S.L. Salzberg, “On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach,” Data Mining and Knowledge Discovery, vol. 1, pp. 317-328, 1997.
[18] S. Dudoit, J. Fridlyand, and T.P. Speed, “Comparison of Discrimination Methods for Classification of Tumors Using Gene Expression Data,” J. Am. Statistical Assoc., vol. 97, no. 457, pp. 77-87, 2002.

Index Terms:
Extreme learning machine, gene expression, microarray, multi-category classification, SVM
Runxuan Zhang, Guang-Bin Huang, N. Sundararajan, P. Saratchandran, "Multicategory Classification Using An Extreme Learning Machine for Microarray Gene Expression Cancer Diagnosis," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 4, no. 3, pp. 485-495, July-Sept. 2007, doi:10.1109/tcbb.2007.1012
Usage of this product signifies your acceptance of the Terms of Use.