Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification

Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to...

Full description

Saved in:
Bibliographic Details
Main Authors: Nasrudin, Nurul Athirah, Weng, Howe Chan, Mohamad, Mohd. Saberi, Deris, Safaai, Napis, Suhaimi, Kasim, Shahreen
Format: Article
Published: Insight Society 2017
Subjects:
Online Access:http://eprints.utm.my/id/eprint/81261/
http://dx.doi.org/10.18517/ijaseit.7.4-2.3397
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.81261
record_format eprints
spelling my.utm.812612019-07-23T08:55:42Z http://eprints.utm.my/id/eprint/81261/ Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification Nasrudin, Nurul Athirah Weng, Howe Chan Mohamad, Mohd. Saberi Deris, Safaai Napis, Suhaimi Kasim, Shahreen TJ Mechanical engineering and machinery Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to determine the appropriate diseases accurately. This is because of existing non-informative genes that could be included in the analysis of context specific data like cancer gene expression data, which affect the classification performance. This study proposed a pathway-based analysis for gene classification. Pathway-based analysis enable handling microarray data in order to improved biological interpretation of the analysis outcome. Secondly, Support Vector Machine with Least Absolute Shrinkage and Selection Operator algorithm (SVM-LASSO) is proposed, which to find informative genes for each pathway to ensure efficient gene selection and classification in every pathway. Experiments are done using lung cancer dataset and breast cancer dataset that widely used in cancer classification area. A stratified 10-fold cross validation is implement to evaluate the performance of the proposed method in terms of accuracy, specificity and sensitivity. Moreover, biological validation have been done on the selected genes based on biological literatures and biological databases. Next, the results from the proposed methods are compared with the previous study throughout all the data sets in terms of performance. As conclusion, this research finding can contribute in biology area especially in cancer classification area. Insight Society 2017 Article PeerReviewed Nasrudin, Nurul Athirah and Weng, Howe Chan and Mohamad, Mohd. Saberi and Deris, Safaai and Napis, Suhaimi and Kasim, Shahreen (2017) Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification. International Journal on Advanced Science, Engineering and Information Technology, 7 (4-2). pp. 1609-1614. ISSN 2088-5334 http://dx.doi.org/10.18517/ijaseit.7.4-2.3397 DOI:10.18517/ijaseit.7.4-2.3397
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TJ Mechanical engineering and machinery
spellingShingle TJ Mechanical engineering and machinery
Nasrudin, Nurul Athirah
Weng, Howe Chan
Mohamad, Mohd. Saberi
Deris, Safaai
Napis, Suhaimi
Kasim, Shahreen
Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
description Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to determine the appropriate diseases accurately. This is because of existing non-informative genes that could be included in the analysis of context specific data like cancer gene expression data, which affect the classification performance. This study proposed a pathway-based analysis for gene classification. Pathway-based analysis enable handling microarray data in order to improved biological interpretation of the analysis outcome. Secondly, Support Vector Machine with Least Absolute Shrinkage and Selection Operator algorithm (SVM-LASSO) is proposed, which to find informative genes for each pathway to ensure efficient gene selection and classification in every pathway. Experiments are done using lung cancer dataset and breast cancer dataset that widely used in cancer classification area. A stratified 10-fold cross validation is implement to evaluate the performance of the proposed method in terms of accuracy, specificity and sensitivity. Moreover, biological validation have been done on the selected genes based on biological literatures and biological databases. Next, the results from the proposed methods are compared with the previous study throughout all the data sets in terms of performance. As conclusion, this research finding can contribute in biology area especially in cancer classification area.
format Article
author Nasrudin, Nurul Athirah
Weng, Howe Chan
Mohamad, Mohd. Saberi
Deris, Safaai
Napis, Suhaimi
Kasim, Shahreen
author_facet Nasrudin, Nurul Athirah
Weng, Howe Chan
Mohamad, Mohd. Saberi
Deris, Safaai
Napis, Suhaimi
Kasim, Shahreen
author_sort Nasrudin, Nurul Athirah
title Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
title_short Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
title_full Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
title_fullStr Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
title_full_unstemmed Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification
title_sort pathway-based analysis with support vector machine (svm-lasso) for gene selection and classification
publisher Insight Society
publishDate 2017
url http://eprints.utm.my/id/eprint/81261/
http://dx.doi.org/10.18517/ijaseit.7.4-2.3397
_version_ 1643658658445262848
score 13.18916