Handling imbalanced class problem of measles infectionrisk prediction model

Measles is an emerging infectious disease with increasing number of reported cases. It is a vaccine-preventable disease;thus, it is common to have imbalanced class problem in the dataset. This study aims to resolve the imbalanced class problem for the prediction of measles infection risk and to comp...

Full description

Saved in:
Bibliographic Details
Main Authors: Wan Ahmad W.M.T., Ghani N., Drus S.M.
Other Authors: 57211662334
Format: Article
Published: Blue Eyes Intelligence Engineering and Sciences Publication 2023
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uniten.dspace-24388
record_format dspace
spelling my.uniten.dspace-243882023-05-29T15:23:10Z Handling imbalanced class problem of measles infectionrisk prediction model Wan Ahmad W.M.T. Ghani N. Drus S.M. 57211662334 56940219600 56330463900 Measles is an emerging infectious disease with increasing number of reported cases. It is a vaccine-preventable disease;thus, it is common to have imbalanced class problem in the dataset. This study aims to resolve the imbalanced class problem for the prediction of measles infection risk and to compare the predictive results on a balanced dataset based on three machine learningtechniques. The data that was utilized in this study contained 37,884 records of suspected measles casesthat were highly imbalanced towards negative measles cases. The Synthetic Minority Over-Sampling Technique (SMOTE) was performed to balance thedistribution of the target attribute. The balanced dataset was then modelled using logistic regression, decision tree and Na�ve Bayes. The predicted results indicated that logistic regression executed on the balanced dataset by SMOTE has the highest and most accurateclassification with 94.5% overall accuracy, 93.9% true positive rate, 5.8% false positive rate and 5.1% false negative rate. Therefore, SMOTE and other over-sampling approaches may be applicable to overcome imbalanced class issues in the medical dataset. � BEIESP. Final 2023-05-29T07:23:10Z 2023-05-29T07:23:10Z 2019 Article 10.35940/ijeat.A2649.109119 2-s2.0-85074788142 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85074788142&doi=10.35940%2fijeat.A2649.109119&partnerID=40&md5=8a6c1db2c6e5281263f29c7ebff9d646 https://irepository.uniten.edu.my/handle/123456789/24388 9 1 3431 3435 All Open Access, Bronze Blue Eyes Intelligence Engineering and Sciences Publication Scopus
institution Universiti Tenaga Nasional
building UNITEN Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
url_provider http://dspace.uniten.edu.my/
description Measles is an emerging infectious disease with increasing number of reported cases. It is a vaccine-preventable disease;thus, it is common to have imbalanced class problem in the dataset. This study aims to resolve the imbalanced class problem for the prediction of measles infection risk and to compare the predictive results on a balanced dataset based on three machine learningtechniques. The data that was utilized in this study contained 37,884 records of suspected measles casesthat were highly imbalanced towards negative measles cases. The Synthetic Minority Over-Sampling Technique (SMOTE) was performed to balance thedistribution of the target attribute. The balanced dataset was then modelled using logistic regression, decision tree and Na�ve Bayes. The predicted results indicated that logistic regression executed on the balanced dataset by SMOTE has the highest and most accurateclassification with 94.5% overall accuracy, 93.9% true positive rate, 5.8% false positive rate and 5.1% false negative rate. Therefore, SMOTE and other over-sampling approaches may be applicable to overcome imbalanced class issues in the medical dataset. � BEIESP.
author2 57211662334
author_facet 57211662334
Wan Ahmad W.M.T.
Ghani N.
Drus S.M.
format Article
author Wan Ahmad W.M.T.
Ghani N.
Drus S.M.
spellingShingle Wan Ahmad W.M.T.
Ghani N.
Drus S.M.
Handling imbalanced class problem of measles infectionrisk prediction model
author_sort Wan Ahmad W.M.T.
title Handling imbalanced class problem of measles infectionrisk prediction model
title_short Handling imbalanced class problem of measles infectionrisk prediction model
title_full Handling imbalanced class problem of measles infectionrisk prediction model
title_fullStr Handling imbalanced class problem of measles infectionrisk prediction model
title_full_unstemmed Handling imbalanced class problem of measles infectionrisk prediction model
title_sort handling imbalanced class problem of measles infectionrisk prediction model
publisher Blue Eyes Intelligence Engineering and Sciences Publication
publishDate 2023
_version_ 1806426358090301440
score 13.214268