An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization

Medical science-related studies have reinforced that the prevalence of coronary heart disease which is associated with the heart and blood vessels has been the most significant cause of health loss and death globally. Recently, data mining and machine learning have been used to detect diseases based...

Full description

Saved in:
Bibliographic Details
Main Authors: Reddy, K.V.V., Elamvazuthi, I., Aziz, A.A., Paramasivam, S., Chua, H.N., Pranavanand, S.
Format: Article
Published: 2023
Online Access:http://scholars.utp.edu.my/id/eprint/34289/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145698341&doi=10.3390%2fapp13010118&partnerID=40&md5=5d7a120d188f59249e86e5557a9151b4
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:34289
record_format eprints
spelling oai:scholars.utp.edu.my:342892023-01-17T13:35:34Z http://scholars.utp.edu.my/id/eprint/34289/ An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization Reddy, K.V.V. Elamvazuthi, I. Aziz, A.A. Paramasivam, S. Chua, H.N. Pranavanand, S. Medical science-related studies have reinforced that the prevalence of coronary heart disease which is associated with the heart and blood vessels has been the most significant cause of health loss and death globally. Recently, data mining and machine learning have been used to detect diseases based on the unique characteristics of a person. However, these techniques have often posed challenges due to the complexity in understanding the objective of the datasets, the existence of too many factors to analyze as well as lack of performance accuracy. This research work is of two-fold effort: firstly, feature extraction and selection. This entails extraction of the principal components, and consequently, the Correlation-based Feature Selection (CFS) method was applied to select the finest principal components of the combined (Cleveland and Statlog) heart dataset. Secondly, by applying datasets to three single and three ensemble classifiers, the best hyperparameters that reflect the pre-eminent predictive outcomes were investigated. The experimental result reveals that hyperparameter optimization has improved the accuracy of all the models. In the comparative studies, the proposed work outperformed related works with an accuracy of 97.91, and an AUC of 0.996 by employing six optimal principal components selected from the CFS method and optimizing parameters of the Rotation Forest ensemble classifier. © 2022 by the authors. 2023 Article NonPeerReviewed Reddy, K.V.V. and Elamvazuthi, I. and Aziz, A.A. and Paramasivam, S. and Chua, H.N. and Pranavanand, S. (2023) An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization. Applied Sciences (Switzerland), 13 (1). https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145698341&doi=10.3390%2fapp13010118&partnerID=40&md5=5d7a120d188f59249e86e5557a9151b4 10.3390/app13010118 10.3390/app13010118 10.3390/app13010118
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Medical science-related studies have reinforced that the prevalence of coronary heart disease which is associated with the heart and blood vessels has been the most significant cause of health loss and death globally. Recently, data mining and machine learning have been used to detect diseases based on the unique characteristics of a person. However, these techniques have often posed challenges due to the complexity in understanding the objective of the datasets, the existence of too many factors to analyze as well as lack of performance accuracy. This research work is of two-fold effort: firstly, feature extraction and selection. This entails extraction of the principal components, and consequently, the Correlation-based Feature Selection (CFS) method was applied to select the finest principal components of the combined (Cleveland and Statlog) heart dataset. Secondly, by applying datasets to three single and three ensemble classifiers, the best hyperparameters that reflect the pre-eminent predictive outcomes were investigated. The experimental result reveals that hyperparameter optimization has improved the accuracy of all the models. In the comparative studies, the proposed work outperformed related works with an accuracy of 97.91, and an AUC of 0.996 by employing six optimal principal components selected from the CFS method and optimizing parameters of the Rotation Forest ensemble classifier. © 2022 by the authors.
format Article
author Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
spellingShingle Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
author_facet Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
author_sort Reddy, K.V.V.
title An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
title_short An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
title_full An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
title_fullStr An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
title_full_unstemmed An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
title_sort efficient prediction system for coronary heart disease risk using selected principal components and hyperparameter optimization
publishDate 2023
url http://scholars.utp.edu.my/id/eprint/34289/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85145698341&doi=10.3390%2fapp13010118&partnerID=40&md5=5d7a120d188f59249e86e5557a9151b4
_version_ 1755874790788300800
score 13.214268