Data mining techniques for disease risk prediction model: A systematic literature review

Risk prediction model estimates event occurrence based on related data. Conventional statistical metrics that utilized primary data generates simple descriptive analysis that often provide insufficient knowledge for decision making. In contrast, data mining techniques that have the capability to fin...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad, W.M.T.W., Ghani, N.L.A., Drus, S.M.
Format: Conference Paper
Language:English
Published: 2020
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uniten.dspace-13249
record_format dspace
spelling my.uniten.dspace-132492020-03-17T05:22:33Z Data mining techniques for disease risk prediction model: A systematic literature review Ahmad, W.M.T.W. Ghani, N.L.A. Drus, S.M. Risk prediction model estimates event occurrence based on related data. Conventional statistical metrics that utilized primary data generates simple descriptive analysis that often provide insufficient knowledge for decision making. In contrast, data mining techniques that have the capability to find hidden pattern from the secondary data in large databases and create prediction for de- sired output has become a popular approach to develop any risk prediction model. In healthcare particularly, data mining techniques can be applied in disease risk prediction model to provide reliable prediction on the possibility of acquiring the disease based on individual’s clinical and non-clinical data. Due to the increased use of data mining in healthcare, this study aims at identifying the data mining techniques and algorithms that are commonly implemented in studies related to various disease risk prediction model as well as finding the accuracy of the algorithms. The accuracy evaluation consists of various method, but this paper is focusing on overall accuracy which is measured by the total number of correctly predicted output over the total number of prediction. A systematic literature review approach that search across five databases found 170 articles, of which 7 articles were selected in the final process. This review found that most prediction model used classification technique, with a focus on decision tree, neural network, support vector machines, and Naïve Bayes algorithms where heart-related disease is commonly studied. Further research can apply similar algorithms to develop risk prediction model for other types of diseases, such as infectious disease prediction. © Springer Nature Switzerland AG 2019. 2020-02-03T03:31:21Z 2020-02-03T03:31:21Z 2019 Conference Paper 10.1007/978-3-319-99007-1_4 en
institution Universiti Tenaga Nasional
building UNITEN Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
url_provider http://dspace.uniten.edu.my/
language English
description Risk prediction model estimates event occurrence based on related data. Conventional statistical metrics that utilized primary data generates simple descriptive analysis that often provide insufficient knowledge for decision making. In contrast, data mining techniques that have the capability to find hidden pattern from the secondary data in large databases and create prediction for de- sired output has become a popular approach to develop any risk prediction model. In healthcare particularly, data mining techniques can be applied in disease risk prediction model to provide reliable prediction on the possibility of acquiring the disease based on individual’s clinical and non-clinical data. Due to the increased use of data mining in healthcare, this study aims at identifying the data mining techniques and algorithms that are commonly implemented in studies related to various disease risk prediction model as well as finding the accuracy of the algorithms. The accuracy evaluation consists of various method, but this paper is focusing on overall accuracy which is measured by the total number of correctly predicted output over the total number of prediction. A systematic literature review approach that search across five databases found 170 articles, of which 7 articles were selected in the final process. This review found that most prediction model used classification technique, with a focus on decision tree, neural network, support vector machines, and Naïve Bayes algorithms where heart-related disease is commonly studied. Further research can apply similar algorithms to develop risk prediction model for other types of diseases, such as infectious disease prediction. © Springer Nature Switzerland AG 2019.
format Conference Paper
author Ahmad, W.M.T.W.
Ghani, N.L.A.
Drus, S.M.
spellingShingle Ahmad, W.M.T.W.
Ghani, N.L.A.
Drus, S.M.
Data mining techniques for disease risk prediction model: A systematic literature review
author_facet Ahmad, W.M.T.W.
Ghani, N.L.A.
Drus, S.M.
author_sort Ahmad, W.M.T.W.
title Data mining techniques for disease risk prediction model: A systematic literature review
title_short Data mining techniques for disease risk prediction model: A systematic literature review
title_full Data mining techniques for disease risk prediction model: A systematic literature review
title_fullStr Data mining techniques for disease risk prediction model: A systematic literature review
title_full_unstemmed Data mining techniques for disease risk prediction model: A systematic literature review
title_sort data mining techniques for disease risk prediction model: a systematic literature review
publishDate 2020
_version_ 1662758835877576704
score 13.214268