Improved support vector machine using multiple SVM-RFE for cancer classification

Support Vector Machine (SVM) is a machine learning method and widely used in the area of cancer studies especially in microarray data. A common problem related to the microarray data is that the size of genes is essentially larger than the number of samples. Although SVM is capable of handling a lar...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Hasri, Nurul Nadzirah, Nies, Hui Wen, Chan, Weng Howe, Mohamad, Mohd Saberi, Deris, Safaai, Kasim, Shahreen
Format: Article
Language:English
Published: Insight - Indonesian Society for Knowledge and Human Development 2017
Subjects:
Online Access:http://eprints.uthm.edu.my/3338/1/AJ%202017%20%28479%29.pdf
http://eprints.uthm.edu.my/3338/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Support Vector Machine (SVM) is a machine learning method and widely used in the area of cancer studies especially in microarray data. A common problem related to the microarray data is that the size of genes is essentially larger than the number of samples. Although SVM is capable of handling a large number of genes, better accuracy of classification can be obtained using a small number of gene subset. This research proposed Multiple Support Vector Machine- Recursive Feature Elimination (MSVMRFE) as a gene selection to identify the small number of informative genes. This method is implemented in order to improve the performance of SVM during classification. The effectiveness of the proposed method has been tested on two different datasets of gene expression which are leukemia and lung cancer. In order to see the effectiveness of the proposed method, some methods such as Random Forest and C4.5 Decision Tree are compared in this paper. The result shows that this MSVM-RFE is effective in reducing the number of genes in both datasets thus providing a better accuracy for SVM in cancer classification.