Significant directed walk framework to increase the accuracy of cancer classification using gene expression data

Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene...

Full description

Saved in:
Bibliographic Details
Main Authors: Seah, C. S., Kasim, S., Md. Fudzee, M. F., Hassan, R.
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/93387/
http://dx.doi.org/10.1007/s12652-020-02404-1
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.93387
record_format eprints
spelling my.utm.933872021-11-30T08:20:57Z http://eprints.utm.my/id/eprint/93387/ Significant directed walk framework to increase the accuracy of cancer classification using gene expression data Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. QA75 Electronic computers. Computer science Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification. Springer Science and Business Media Deutschland GmbH 2021-07 Article PeerReviewed Seah, C. S. and Kasim, S. and Md. Fudzee, M. F. and Hassan, R. (2021) Significant directed walk framework to increase the accuracy of cancer classification using gene expression data. Journal of Ambient Intelligence and Humanized Computing, 12 (7). pp. 7281-7298. ISSN 1868-5137 http://dx.doi.org/10.1007/s12652-020-02404-1 DOI: 10.1007/s12652-020-02404-1
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
description Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification.
format Article
author Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
author_facet Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
author_sort Seah, C. S.
title Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_short Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_full Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_fullStr Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_full_unstemmed Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_sort significant directed walk framework to increase the accuracy of cancer classification using gene expression data
publisher Springer Science and Business Media Deutschland GmbH
publishDate 2021
url http://eprints.utm.my/id/eprint/93387/
http://dx.doi.org/10.1007/s12652-020-02404-1
_version_ 1718926060696895488
score 13.160551