Comparison on some machine learning techniques in breast cancer classification

Breast cancer is the second most common cancer after lung cancer and one of the main causes of death worldwide. Women have a higher risk of breast cancer as compared to men. Thus, one of the early diagnosis with an accurate and reliable system is critical in breast cancer treatment. Machine learning...

Full description

Saved in:
Bibliographic Details
Main Authors: Mashudi, N. A., Rossli, S. A., Ahmad, N., Mohd. Noor, N.
Format: Conference or Workshop Item
Language:English
Published: 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/95683/1/NurulAmirahMashudi2021_ComparisononSomeMachineLearningTechniques.pdf
http://eprints.utm.my/id/eprint/95683/
http://dx.doi.org/10.1109/IECBES48179.2021.9398837
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.95683
record_format eprints
spelling my.utm.956832022-05-31T13:04:34Z http://eprints.utm.my/id/eprint/95683/ Comparison on some machine learning techniques in breast cancer classification Mashudi, N. A. Rossli, S. A. Ahmad, N. Mohd. Noor, N. T Technology (General) Breast cancer is the second most common cancer after lung cancer and one of the main causes of death worldwide. Women have a higher risk of breast cancer as compared to men. Thus, one of the early diagnosis with an accurate and reliable system is critical in breast cancer treatment. Machine learning techniques are well known and popular among researchers, especially for classification and prediction. An investigation was conducted to evaluate the performance of breast cancer classification for malignant tumors and benign tumors using various machine learning techniques, namely k-Nearest Neighbors (k-NN), Random Forest, and Support Vector Machine (SVM) and ensemble techniques to compute the prediction of the breast cancer survival by implementing 10-fold cross validation. Additionally, the proposed methods are classified using 2-fold, 3-fold, and 5-fold cross validation to meet the best accuracy rate. This study used a dataset obtained from Wisconsin Diagnostic Breast Cancer (WDBC) with 23 selected attributes measured from 569 patients, from which 212 patients have malignant tumors and 357 patients have benign tumors. The performance evaluation of the proposed methods was computed to obtain accuracy, sensitivity, and specificity. Comparison results between all methods show that AdaBoost ensemble methods gave the highest accuracy at 98.77% for 10-fold cross validation, while 2-fold and 3-fold cross validation at 98.41% and 98.24%, respectively. Nevertheless, the result with 5-fold cross validation show SVM produced the best accuracy rate at 98.60% with the lowest error rate. 2021 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/95683/1/NurulAmirahMashudi2021_ComparisononSomeMachineLearningTechniques.pdf Mashudi, N. A. and Rossli, S. A. and Ahmad, N. and Mohd. Noor, N. (2021) Comparison on some machine learning techniques in breast cancer classification. In: 2020 IEEE EMBS Conference on Biomedical Engineering and Sciences, IECBES 2020, 1 - 3 March 2021, Virtual, Langkawi Island. http://dx.doi.org/10.1109/IECBES48179.2021.9398837
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Mashudi, N. A.
Rossli, S. A.
Ahmad, N.
Mohd. Noor, N.
Comparison on some machine learning techniques in breast cancer classification
description Breast cancer is the second most common cancer after lung cancer and one of the main causes of death worldwide. Women have a higher risk of breast cancer as compared to men. Thus, one of the early diagnosis with an accurate and reliable system is critical in breast cancer treatment. Machine learning techniques are well known and popular among researchers, especially for classification and prediction. An investigation was conducted to evaluate the performance of breast cancer classification for malignant tumors and benign tumors using various machine learning techniques, namely k-Nearest Neighbors (k-NN), Random Forest, and Support Vector Machine (SVM) and ensemble techniques to compute the prediction of the breast cancer survival by implementing 10-fold cross validation. Additionally, the proposed methods are classified using 2-fold, 3-fold, and 5-fold cross validation to meet the best accuracy rate. This study used a dataset obtained from Wisconsin Diagnostic Breast Cancer (WDBC) with 23 selected attributes measured from 569 patients, from which 212 patients have malignant tumors and 357 patients have benign tumors. The performance evaluation of the proposed methods was computed to obtain accuracy, sensitivity, and specificity. Comparison results between all methods show that AdaBoost ensemble methods gave the highest accuracy at 98.77% for 10-fold cross validation, while 2-fold and 3-fold cross validation at 98.41% and 98.24%, respectively. Nevertheless, the result with 5-fold cross validation show SVM produced the best accuracy rate at 98.60% with the lowest error rate.
format Conference or Workshop Item
author Mashudi, N. A.
Rossli, S. A.
Ahmad, N.
Mohd. Noor, N.
author_facet Mashudi, N. A.
Rossli, S. A.
Ahmad, N.
Mohd. Noor, N.
author_sort Mashudi, N. A.
title Comparison on some machine learning techniques in breast cancer classification
title_short Comparison on some machine learning techniques in breast cancer classification
title_full Comparison on some machine learning techniques in breast cancer classification
title_fullStr Comparison on some machine learning techniques in breast cancer classification
title_full_unstemmed Comparison on some machine learning techniques in breast cancer classification
title_sort comparison on some machine learning techniques in breast cancer classification
publishDate 2021
url http://eprints.utm.my/id/eprint/95683/1/NurulAmirahMashudi2021_ComparisononSomeMachineLearningTechniques.pdf
http://eprints.utm.my/id/eprint/95683/
http://dx.doi.org/10.1109/IECBES48179.2021.9398837
_version_ 1735386834435833856
score 13.19449