Pathway-based analysis with support vector machine (SVM-LASSO) for gene selection and classification

Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to...

詳細記述

保存先:
書誌詳細
主要な著者: Nurul Athirah, Nasrudin, Chan, Weng Howe, Mohd Saberi, Mohamad, Safaai, Deris, Suhaimi, Napis, Shahreen, Kasim
フォーマット: 論文
言語:English
出版事項: Indonesian Society for Knowledge and Human Development 2017
主題:
オンライン・アクセス:http://umpir.ump.edu.my/id/eprint/29818/1/Pathway-based%20analysis%20with%20support%20vector%20machine.pdf
http://umpir.ump.edu.my/id/eprint/29818/
https://doi.org/10.18517/ijaseit.7.4-2.3397
https://doi.org/10.18517/ijaseit.7.4-2.3397
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
要約:Genomic knowledge has become a popular research field in bioinformatics biological process that providing further biological process information. Many methods have been done to address the issues of high data throughput due to increased use of microarray technology. However, it is still not able to determine the appropriate diseases accurately. This is because of existing non-informative genes that could be included in the analysis of context specific data like cancer gene expression data, which affect the classification performance. This study proposed a pathway-based analysis for gene classification. Pathway-based analysis enable handling microarray data in order to improved biological interpretation of the analysis outcome. Secondly, Support Vector Machine with Least Absolute Shrinkage and Selection Operator algorithm (SVM-LASSO) is proposed, which to find informative genes for each pathway to ensure efficient gene selection and classification in every pathway. Experiments are done using lung cancer dataset and breast cancer dataset that widely used in cancer classification area. A stratified 10-fold cross validation is implement to evaluate the performance of the proposed method in terms of accuracy, specificity and sensitivity. Moreover, biological validation have been done on the selected genes based on biological literatures and biological databases. Next, the results from the proposed methods are compared with the previous study throughout all the data sets in terms of performance. As conclusion, this research finding can contribute in biology area especially in cancer classification area.