The Effectiveness of Data Augmentation for Melanoma Skin Cancer Prediction Using Convolutional Neural Networks

Melanoma skin cancer has been a serious threat due to its high fatality. For this reason, early detection and treatments are given more attention as countermeasures. In recent years, skin cancer detection has been utilizing artificial intelligence techniques, specifically deep convolutional neural n...

Full description

Saved in:
Bibliographic Details
Main Authors: Kin, Wai Lee, Ka, Renee Yin Chin
Format: Conference or Workshop Item
Language:English
English
Published: IEEE Xplore 2020
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/30132/1/The%20Effectiveness%20of%20Data%20Augmentation%20for%20Melanoma%20Skin%20Cancer%20Prediction%20Using%20Convolutional%20Neural%20Networks%20abstract.pdf
https://eprints.ums.edu.my/id/eprint/30132/2/The%20Effectiveness%20of%20Data%20Augmentation%20for%20Melanoma%20Skin%20Cancer%20Prediction%20Using%20Convolutional%20Neural%20Networks.pdf
https://eprints.ums.edu.my/id/eprint/30132/
https://ieeexplore.ieee.org/document/9257859
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Melanoma skin cancer has been a serious threat due to its high fatality. For this reason, early detection and treatments are given more attention as countermeasures. In recent years, skin cancer detection has been utilizing artificial intelligence techniques, specifically deep convolutional neural network. However, the performance of the convolutional neural network is highly vulnerable to different data constraints, such as the quality and quantity of the data. Therefore, this study explores the synthetization of training data using different data augmentation methods. The work presented in this paper utilizes four different categories of data augmentation methods, which include geometrical transformation, noise addition, color transformation, and image mix. Multiple layers data augmentation approach is also explored. Dataset expansion strategies and optimized dataset expansion scale are determined to improve the performance of the skin cancer classification. The core findings in our study revealed that single-layer augmentation has better performance than multiple layers augmentation approaches, where region of interest (ROI) image mix has the highest effectiveness compared to other methods. In addition, the best dataset expansion strategy is random ROI image mix. Finally, the optimized dataset expansion is determined at 300%, which yielded the best overall test accuracy at 82.9%, 4.6% improvement compared to unprocessed raw dataset.