Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia

Machine learning models have emerged as an advanced tool for predicting diseases and their outcomes. This study developed a machine learning model to predict the treatment duration for Tuberculosis patients in Malaysia based on a real-life patient dataset. Six regression models, namely Support Vecto...

Full description

Saved in:
Bibliographic Details
Main Authors: Balakrishnan, Vimala, Ramanathan, Ghayathri, Zhou, Siyi, Wong, Chee Kuan
Format: Article
Published: Springer 2024
Subjects:
Online Access:http://eprints.um.edu.my/44989/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.44989
record_format eprints
spelling my.um.eprints.449892024-04-16T04:50:39Z http://eprints.um.edu.my/44989/ Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia Balakrishnan, Vimala Ramanathan, Ghayathri Zhou, Siyi Wong, Chee Kuan R Medicine Machine learning models have emerged as an advanced tool for predicting diseases and their outcomes. This study developed a machine learning model to predict the treatment duration for Tuberculosis patients in Malaysia based on a real-life patient dataset. Six regression models, namely Support Vector Regression, Linear Regression, Lasso Regression, Ridge Regression, Random Forest Regression, and Gradient Boosting Regression were initially developed and then optimized through hyperparameter tuning to determine the best predictive model. Using a dataset of 435 Malaysian Tuberculosis patients, we compared our results with data from countries with high Tuberculosis prevalence rates, namely Belarus, Nigeria, and Georgia. Experimentations revealed Support Vector Regression emerged as the best performing model as it can predict treatment duration with the lowest error rates (Mean Absolute Error = 69.70; Root Mean Squared Error = 109.49). Eight significant risk factors were identified for the Malaysian dataset through Pearson correlation, namely, treatment outcome, treatment status, fixed dose combination dosage, maintenance phase regimen, chest X-ray findings, tuberculin skin test, location of treatment initiation, and levofloxacin-based regimen. Comparison with data from other countries confirmed the consistent performance of the optimized Support Vector Regression model in predicting Tuberculosis treatment duration, hence rendering the model generalizable. To the best of our knowledge, this is the first study to demonstrates the effectiveness of machine learning in predicting Tuberculosis treatment duration based on potential risk factors. These findings will help clinicians make informed decisions about the optimal treatment duration, prepare patients' expectations, and estimate the cost of Tuberculosis treatment. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. Springer 2024 Article PeerReviewed Balakrishnan, Vimala and Ramanathan, Ghayathri and Zhou, Siyi and Wong, Chee Kuan (2024) Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia. Multimedia Tools and Applications, 83 (4). 11831 – 11844. ISSN 1380-7501, DOI https://doi.org/10.1007/s11042-023-16028-y <https://doi.org/10.1007/s11042-023-16028-y>. 10.1007/s11042-023-16028-y
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic R Medicine
spellingShingle R Medicine
Balakrishnan, Vimala
Ramanathan, Ghayathri
Zhou, Siyi
Wong, Chee Kuan
Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
description Machine learning models have emerged as an advanced tool for predicting diseases and their outcomes. This study developed a machine learning model to predict the treatment duration for Tuberculosis patients in Malaysia based on a real-life patient dataset. Six regression models, namely Support Vector Regression, Linear Regression, Lasso Regression, Ridge Regression, Random Forest Regression, and Gradient Boosting Regression were initially developed and then optimized through hyperparameter tuning to determine the best predictive model. Using a dataset of 435 Malaysian Tuberculosis patients, we compared our results with data from countries with high Tuberculosis prevalence rates, namely Belarus, Nigeria, and Georgia. Experimentations revealed Support Vector Regression emerged as the best performing model as it can predict treatment duration with the lowest error rates (Mean Absolute Error = 69.70; Root Mean Squared Error = 109.49). Eight significant risk factors were identified for the Malaysian dataset through Pearson correlation, namely, treatment outcome, treatment status, fixed dose combination dosage, maintenance phase regimen, chest X-ray findings, tuberculin skin test, location of treatment initiation, and levofloxacin-based regimen. Comparison with data from other countries confirmed the consistent performance of the optimized Support Vector Regression model in predicting Tuberculosis treatment duration, hence rendering the model generalizable. To the best of our knowledge, this is the first study to demonstrates the effectiveness of machine learning in predicting Tuberculosis treatment duration based on potential risk factors. These findings will help clinicians make informed decisions about the optimal treatment duration, prepare patients' expectations, and estimate the cost of Tuberculosis treatment. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
format Article
author Balakrishnan, Vimala
Ramanathan, Ghayathri
Zhou, Siyi
Wong, Chee Kuan
author_facet Balakrishnan, Vimala
Ramanathan, Ghayathri
Zhou, Siyi
Wong, Chee Kuan
author_sort Balakrishnan, Vimala
title Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
title_short Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
title_full Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
title_fullStr Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
title_full_unstemmed Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia
title_sort optimized support vector regression predicting treatment duration among tuberculosis patients in malaysia
publisher Springer
publishDate 2024
url http://eprints.um.edu.my/44989/
_version_ 1797906866029723648
score 13.1944895