Design and Analysis of Ensemble Classifier for Gene Expression Data of Cancer
|Nianfeng Song, Kun Wang, Menglu Xu, Xiaolu Xie, Gan Chen and Ying Wang*|
|Department of Automation, Xiamen University, Xiamen, Fujian, 361005, China|
|Corresponding Author :||Ying Wang, Ph.D.
Department of Automation, Xiamen University
Xiamen, Fujian, 361005, China
E-mail: [email protected]
|Received December 13, 2015; Accepted December 24, 2015; Published December 31, 2015|
|Citation: Song N, Wang K, Xu M, Xie X, Chen G, et al. (2015) Design and Analysis of Ensemble Classifier for Gene Expression Data of Cancer. J Clin Med Genomics 3:134. doi:10.4172/2472-128X.1000134|
|Copyright: © 2015 Song N, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.|
|Related article at Pubmed, Scholar Google|
Gene expression levels are important for disease, such as, Cancer diagnosis. This paper proposed a SVM-based ensemble classifier to classify the control and cancer groups based on gene expression levels from microarray data. A combinational Recursive Feature Elimination in conjunction with the Adaboost algorithm was developed to select significant features and design the proper classifier. The method is applied to microarray data of cancer patients, and the results show improvements on the success rate. By AUC calculation, the SVM-based ensemble classifier shows predominate performance. Furthermore, the characteristics and different effect issues to classification performance is discussed. If a single SVM can obtain satisfactory classification performance, an ensemble SVM is hardly capable to improve it. Otherwise, an ensemble of SVM is superior to the best single SVM. We also investigated the effect of kernel functions, feature selections and type of classifiers on the classification.