Identification of potential biomarkers using improved ranked guided iterative feature elimination

In healthcare, biomarkers serve an important role in disease classification. Many existing works are focusing in identifying potential biomarkers from gene expression. Moreover, the large number of redundant features in a high dimensional dataset such as gene expression would introduce bias in the c...

Full description

Saved in:
Bibliographic Details
Main Authors: Ng, Wen Xin, Chan, Weng Howe
Format: Article
Language:English
Published: Penerbit UTM Press 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/97780/1/NgWenXin2021_IdentificationofPotentialBiomarkersusingImproved.pdf
http://eprints.utm.my/id/eprint/97780/
http://dx.doi.org/10.11113/ijic.v11n1.288
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.97780
record_format eprints
spelling my.utm.977802022-10-31T08:46:45Z http://eprints.utm.my/id/eprint/97780/ Identification of potential biomarkers using improved ranked guided iterative feature elimination Ng, Wen Xin Chan, Weng Howe QA75 Electronic computers. Computer science In healthcare, biomarkers serve an important role in disease classification. Many existing works are focusing in identifying potential biomarkers from gene expression. Moreover, the large number of redundant features in a high dimensional dataset such as gene expression would introduce bias in the classifier and reduce the classifier’s performance. Embedded feature selection methods such as ranked guided iterative feature elimination have been widely adopted owing to the good performance in identification of informative features. However, method like ranked guided iterative feature elimination does not consider the redundancy of the features. Thus, this paper proposes an improved ranked guided iterative feature elimination method by introducing an additional filter selection based on minimum redundancy maximum relevance to filter out redundant features and maintain the relevant feature subset to be ranked and used for classification. Experiments are done using two gene expression datasets for prostate cancer and central nervous system. The performance of the classification is measured in terms of accuracy and compared with existing methods. Meanwhile, biological context verification of the identified features is done through available knowledge databases. Our method shows improved classification accuracy, and the selected genes were found to have relationship with the diseases. Penerbit UTM Press 2021-06 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/97780/1/NgWenXin2021_IdentificationofPotentialBiomarkersusingImproved.pdf Ng, Wen Xin and Chan, Weng Howe (2021) Identification of potential biomarkers using improved ranked guided iterative feature elimination. International Journal of Innovative Computing, 11 (1). pp. 35-43. ISSN 2180-4370 http://dx.doi.org/10.11113/ijic.v11n1.288 DOI:10.11113/ijic.v11n1.288
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Ng, Wen Xin
Chan, Weng Howe
Identification of potential biomarkers using improved ranked guided iterative feature elimination
description In healthcare, biomarkers serve an important role in disease classification. Many existing works are focusing in identifying potential biomarkers from gene expression. Moreover, the large number of redundant features in a high dimensional dataset such as gene expression would introduce bias in the classifier and reduce the classifier’s performance. Embedded feature selection methods such as ranked guided iterative feature elimination have been widely adopted owing to the good performance in identification of informative features. However, method like ranked guided iterative feature elimination does not consider the redundancy of the features. Thus, this paper proposes an improved ranked guided iterative feature elimination method by introducing an additional filter selection based on minimum redundancy maximum relevance to filter out redundant features and maintain the relevant feature subset to be ranked and used for classification. Experiments are done using two gene expression datasets for prostate cancer and central nervous system. The performance of the classification is measured in terms of accuracy and compared with existing methods. Meanwhile, biological context verification of the identified features is done through available knowledge databases. Our method shows improved classification accuracy, and the selected genes were found to have relationship with the diseases.
format Article
author Ng, Wen Xin
Chan, Weng Howe
author_facet Ng, Wen Xin
Chan, Weng Howe
author_sort Ng, Wen Xin
title Identification of potential biomarkers using improved ranked guided iterative feature elimination
title_short Identification of potential biomarkers using improved ranked guided iterative feature elimination
title_full Identification of potential biomarkers using improved ranked guided iterative feature elimination
title_fullStr Identification of potential biomarkers using improved ranked guided iterative feature elimination
title_full_unstemmed Identification of potential biomarkers using improved ranked guided iterative feature elimination
title_sort identification of potential biomarkers using improved ranked guided iterative feature elimination
publisher Penerbit UTM Press
publishDate 2021
url http://eprints.utm.my/id/eprint/97780/1/NgWenXin2021_IdentificationofPotentialBiomarkersusingImproved.pdf
http://eprints.utm.my/id/eprint/97780/
http://dx.doi.org/10.11113/ijic.v11n1.288
_version_ 1748703143766523904
score 13.18916