An effective heart disease detection and severity level classification model using machine learning and hyperparameter optimization methods

Cardiovascular disease (CVD) is the leading cause of death worldwide. A Machine Learning (ML) system can predict CVD in the early stages to mitigate mortality rates based on clinical data. Recently, many research works utilized different machine learning approaches to detect CVD or identify the pati...

Full description

Saved in:
Bibliographic Details
Main Authors: Abdellatif, Abdallah, Abdellatef, Hamdan, Kanesan, Jeevan, Chow, Chee-Onn, Chuah, Joon Huang, Gheni, Hassan Muwafaq
Format: Article
Published: Institute of Electrical and Electronics Engineers 2022
Subjects:
Online Access:http://eprints.um.edu.my/41777/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Cardiovascular disease (CVD) is the leading cause of death worldwide. A Machine Learning (ML) system can predict CVD in the early stages to mitigate mortality rates based on clinical data. Recently, many research works utilized different machine learning approaches to detect CVD or identify the patient's severity level. Although these works obtained promising results, none focused on employing optimization methods to improve the ML model performance for CVD detection and severity-level classification. This study provides an effective method based on the Synthetic Minority Oversampling Technique (SMOTE) to handle imbalance distribution issue, six different ML classifiers to detect the patient status, and Hyperparameter Optimization (HPO) to find the best hyperparameter for ML classifier together with SMOTE. Two public datasets were used to build and test the model using all features. The results show that SMOTE and Extra Trees (ET) optimized using hyperband achieved higher results than other models and outperformed the state-of-the-art works by achieving 99.2% and 98.52% in CVD detection, respectively. Also, the developed model converged to 95.73% severity classification using the Cleveland dataset. The proposed model can help doctors determine a patient's current heart disease status. As a result, it is possible to prevent heart disease-related mortality by implementing early therapy.