Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation
Hybrid combinations of feature selection, classification and visualisation using machine learning (ham) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
Penerbit Universiti Kebangsaan Malaysia
2021
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/28045/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.eprints.28045 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.280452022-07-19T02:31:13Z http://eprints.um.edu.my/28045/ Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation Aziida, Nanyonga Malek, Sorayya Aziz, Firdaus Ibrahim, Khairul Shafiq Kasim, Sazzli Q Science (General) QH301 Biology R Medicine (General) Hybrid combinations of feature selection, classification and visualisation using machine learning (ham) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method with a classifier algorithm that produces high performance in mortality studies is essential and has not been reported before. Feature selection methods such as Boruta, Random Forest (RF), Elastic Net (EN), Recursive Feature Elimination (RFE), learning vector quantization (LVQ), Genetic Algorithm (GA), Cluster Dendrogram (CD), Support Vector Machine (SI/M) and Logistic Regression (LR) were combined with RE SLM, LR, and EN classifiers for 30-day mortality prediction. ML models were constructed using 302 patients and 54 input variables from the Malaysian National Cardiovascular Disease Database. Validation of the best ML model was performed against Thrombolysis in Myocardial Infarction (TMI) using an additional dataset of 102 patients. The Self-Organising Feature Map (SOM) was used to visualise mortality-related factors post ACS. The performance of ML models using the area under the curve (AUC) ranged from 0.48 to 0.80. The best-performing model (AUC = 0.80) was a hybrid combination of the RF variable importance method, the sequential backward selection and the RF classifier using five predictors (age, triglyceride, creatinine, troponin, and total cholesterol). Comparison with T1M1 using an additional dataset resulted in the best ML model outperforming the TiM! score (AUC = 0.75 vs. AUC 0.60). The findings of this study will provide a basis for developing an online ML-based population-specific risk scoring calculator. Penerbit Universiti Kebangsaan Malaysia 2021-03 Article PeerReviewed Aziida, Nanyonga and Malek, Sorayya and Aziz, Firdaus and Ibrahim, Khairul Shafiq and Kasim, Sazzli (2021) Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation. Sains Malaysiana, 50 (3). pp. 753-768. ISSN 0126-6039, DOI https://doi.org/10.17576/jsm-2021-5003-17 <https://doi.org/10.17576/jsm-2021-5003-17>. 10.17576/jsm-2021-5003-17 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
Q Science (General) QH301 Biology R Medicine (General) |
spellingShingle |
Q Science (General) QH301 Biology R Medicine (General) Aziida, Nanyonga Malek, Sorayya Aziz, Firdaus Ibrahim, Khairul Shafiq Kasim, Sazzli Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
description |
Hybrid combinations of feature selection, classification and visualisation using machine learning (ham) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method with a classifier algorithm that produces high performance in mortality studies is essential and has not been reported before. Feature selection methods such as Boruta, Random Forest (RF), Elastic Net (EN), Recursive Feature Elimination (RFE), learning vector quantization (LVQ), Genetic Algorithm (GA), Cluster Dendrogram (CD), Support Vector Machine (SI/M) and Logistic Regression (LR) were combined with RE SLM, LR, and EN classifiers for 30-day mortality prediction. ML models were constructed using 302 patients and 54 input variables from the Malaysian National Cardiovascular Disease Database. Validation of the best ML model was performed against Thrombolysis in Myocardial Infarction (TMI) using an additional dataset of 102 patients. The Self-Organising Feature Map (SOM) was used to visualise mortality-related factors post ACS. The performance of ML models using the area under the curve (AUC) ranged from 0.48 to 0.80. The best-performing model (AUC = 0.80) was a hybrid combination of the RF variable importance method, the sequential backward selection and the RF classifier using five predictors (age, triglyceride, creatinine, troponin, and total cholesterol). Comparison with T1M1 using an additional dataset resulted in the best ML model outperforming the TiM! score (AUC = 0.75 vs. AUC 0.60). The findings of this study will provide a basis for developing an online ML-based population-specific risk scoring calculator. |
format |
Article |
author |
Aziida, Nanyonga Malek, Sorayya Aziz, Firdaus Ibrahim, Khairul Shafiq Kasim, Sazzli |
author_facet |
Aziida, Nanyonga Malek, Sorayya Aziz, Firdaus Ibrahim, Khairul Shafiq Kasim, Sazzli |
author_sort |
Aziida, Nanyonga |
title |
Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
title_short |
Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
title_full |
Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
title_fullStr |
Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
title_full_unstemmed |
Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation |
title_sort |
predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation |
publisher |
Penerbit Universiti Kebangsaan Malaysia |
publishDate |
2021 |
url |
http://eprints.um.edu.my/28045/ |
_version_ |
1739828429294927872 |
score |
13.211869 |