Classification of cervical cancer using random forest

Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data,...

Full description

Saved in:
Bibliographic Details
Main Authors: Bahirah, Mohd Bashah, Ku Muhammad Naim, Ku Khalif, Nor Azuana, Ramli
Format: Conference or Workshop Item
Language:English
Published: 2022
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/36667/1/Classification%20of%20cervical%20cancer%20using%20random%20forest.pdf
http://umpir.ump.edu.my/id/eprint/36667/
https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Cervical cancer is the second most common cancer among Malaysian women between 15 to 44 although the morbidity and the mortality of cervical cancer have been decreasing in recent years. Developing supervised models for cervical cancer is a challenging task. By gleaning deeper insights from the data, data mining knowledge has capability to learn from data, identify the patterns with meaningful in that they lead to some advantages in many real-world applications. In this research, the cervical cancer risk classification model was used by using data mining approach which consider Decision Tree and Random Forest algorithm. These two models have been implemented by using JupyterLab on the UCI datasets. Model evaluation has been conducted to identify the robust data mining algorithm in the prediction of cervical cancer risk. The model gives 67% for the precision and 95% of accuracy. However, decision tree is the best method compared to Random Forest since Random Forest has the lowest AUC which indicated that it is the worse model. To improve this study, other method such as Artificial Neural Network, Support Vector Machine or ensemble classifiers can be applied to the dataset to see if there is a better model to predict cervical cancer.