Staff View: Identification of significant features and data mining techniques in predicting heart disease

Identification of significant features and data mining techniques in predicting heart disease

Cardiovascular disease is one of the biggest cause for morbidity and mortality among the population of the world. Prediction of cardiovascular disease is regarded as one of the most important subject in the section of clinical data analysis. The amount of data in the healthcare industry is huge. Dat...

Full description

Saved in:

Bibliographic Details
Main Authors:	Amin, Mohammad Shafenoor, Chiam, Yin Kia, Varathan, Kasturi Dewi
Format:	Article
Published:	Elsevier 2019
Subjects:	QA75 Electronic computers. Computer science QA76 Computer software
Online Access:	http://eprints.um.edu.my/20121/ https://doi.org/10.1016/j.tele.2018.11.007
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.um.eprints.20121
record_format	eprints
spelling	my.um.eprints.201212019-01-23T04:34:34Z http://eprints.um.edu.my/20121/ Identification of significant features and data mining techniques in predicting heart disease Amin, Mohammad Shafenoor Chiam, Yin Kia Varathan, Kasturi Dewi QA75 Electronic computers. Computer science QA76 Computer software Cardiovascular disease is one of the biggest cause for morbidity and mortality among the population of the world. Prediction of cardiovascular disease is regarded as one of the most important subject in the section of clinical data analysis. The amount of data in the healthcare industry is huge. Data mining turns the large collection of raw healthcare data into information that can help to make informed decision and prediction. There are some existing studies that applied data mining techniques in heart disease prediction. Nonetheless, studies that have given attention towards the significant features that play a vital role in predicting cardiovascular disease are limited. It is crucial to select the correct combination of significant features that can improve the performance of the prediction models. This research aims to identify significant features and data mining techniques that can improve the accuracy of predicting cardiovascular disease. Prediction models were developed using different combination of features, and seven classification techniques: k-NN, Decision Tree, Naive Bayes, Logistic Regression (LR), Support Vector Machine (SVM), Neural Network and Vote (a hybrid technique with Naïve Bayes and Logistic Regression). Experiment results show that the heart disease prediction model developed using the identified significant features and the best-performing data mining technique (i.e. Vote) achieves an accuracy of 87.4% in heart disease prediction. Elsevier 2019 Article PeerReviewed Amin, Mohammad Shafenoor and Chiam, Yin Kia and Varathan, Kasturi Dewi (2019) Identification of significant features and data mining techniques in predicting heart disease. Telematics and Informatics, 36. pp. 82-93. ISSN 0736-5853 https://doi.org/10.1016/j.tele.2018.11.007 doi:10.1016/j.tele.2018.11.007
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	QA75 Electronic computers. Computer science QA76 Computer software
spellingShingle	QA75 Electronic computers. Computer science QA76 Computer software Amin, Mohammad Shafenoor Chiam, Yin Kia Varathan, Kasturi Dewi Identification of significant features and data mining techniques in predicting heart disease
description	Cardiovascular disease is one of the biggest cause for morbidity and mortality among the population of the world. Prediction of cardiovascular disease is regarded as one of the most important subject in the section of clinical data analysis. The amount of data in the healthcare industry is huge. Data mining turns the large collection of raw healthcare data into information that can help to make informed decision and prediction. There are some existing studies that applied data mining techniques in heart disease prediction. Nonetheless, studies that have given attention towards the significant features that play a vital role in predicting cardiovascular disease are limited. It is crucial to select the correct combination of significant features that can improve the performance of the prediction models. This research aims to identify significant features and data mining techniques that can improve the accuracy of predicting cardiovascular disease. Prediction models were developed using different combination of features, and seven classification techniques: k-NN, Decision Tree, Naive Bayes, Logistic Regression (LR), Support Vector Machine (SVM), Neural Network and Vote (a hybrid technique with Naïve Bayes and Logistic Regression). Experiment results show that the heart disease prediction model developed using the identified significant features and the best-performing data mining technique (i.e. Vote) achieves an accuracy of 87.4% in heart disease prediction.
format	Article
author	Amin, Mohammad Shafenoor Chiam, Yin Kia Varathan, Kasturi Dewi
author_facet	Amin, Mohammad Shafenoor Chiam, Yin Kia Varathan, Kasturi Dewi
author_sort	Amin, Mohammad Shafenoor
title	Identification of significant features and data mining techniques in predicting heart disease
title_short	Identification of significant features and data mining techniques in predicting heart disease
title_full	Identification of significant features and data mining techniques in predicting heart disease
title_fullStr	Identification of significant features and data mining techniques in predicting heart disease
title_full_unstemmed	Identification of significant features and data mining techniques in predicting heart disease
title_sort	identification of significant features and data mining techniques in predicting heart disease
publisher	Elsevier
publishDate	2019
url	http://eprints.um.edu.my/20121/ https://doi.org/10.1016/j.tele.2018.11.007
_version_	1643691185639784448
score	13.18916

Identification of significant features and data mining techniques in predicting heart disease

Similar Items