Leptospirosis modelling using hydrometeorological indices and random forest machine learning

Leptospirosis is a zoonosis that has been linked to hydrometeorological variability. Hydrometeorological averages and extremes have been used before as drivers in the statistical prediction of disease. However, their importance and predictive capacity are still little known. In this study, the use o...

Full description

Saved in:
Bibliographic Details
Main Authors: Jayaramu, Veianthan, Zulkafli, Zed, De Stercke, Simon, Buytaert, Wouter, Rahmat, Fariq, Abdul Rahman, Ribhan Zafira, Ishak, Asnor Juraiza, Tahir, Wardah, Ab Rahman, Jamalludin, Mohd Fuzi, Nik Mohd Hafiz
Format: Article
Published: Springer Science and Business Media 2023
Online Access:http://psasir.upm.edu.my/id/eprint/109550/
https://link.springer.com/article/10.1007/s00484-022-02422-y?error=cookies_not_supported&code=b68bc8b6-2a6c-44d6-9266-49d454508008
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.109550
record_format eprints
spelling my.upm.eprints.1095502024-12-17T03:48:33Z http://psasir.upm.edu.my/id/eprint/109550/ Leptospirosis modelling using hydrometeorological indices and random forest machine learning Jayaramu, Veianthan Zulkafli, Zed De Stercke, Simon Buytaert, Wouter Rahmat, Fariq Abdul Rahman, Ribhan Zafira Ishak, Asnor Juraiza Tahir, Wardah Ab Rahman, Jamalludin Mohd Fuzi, Nik Mohd Hafiz Leptospirosis is a zoonosis that has been linked to hydrometeorological variability. Hydrometeorological averages and extremes have been used before as drivers in the statistical prediction of disease. However, their importance and predictive capacity are still little known. In this study, the use of a random forest classifier was explored to analyze the relative importance of hydrometeorological indices in developing the leptospirosis model and to evaluate the performance of models based on the type of indices used, using case data from three districts in Kelantan, Malaysia, that experience annual monsoonal rainfall and flooding. First, hydrometeorological data including rainfall, streamflow, water level, relative humidity, and temperature were transformed into 164 weekly average and extreme indices in accordance with the Expert Team on Climate Change Detection and Indices (ETCCDI). Then, weekly case occurrences were classified into binary classes “high” and “low” based on an average threshold. Seventeen models based on “average,” “extreme,” and “mixed” indices were trained by optimizing the feature subsets based on the model computed mean decrease Gini (MDG) scores. The variable importance was assessed through cross-correlation analysis and the MDG score. The average and extreme models showed similar prediction accuracy ranges (61.5–76.1 and 72.3–77.0) while the mixed models showed an improvement (71.7–82.6 prediction accuracy). An extreme model was the most sensitive while an average model was the most specific. The time lag associated with the driving indices agreed with the seasonality of the monsoon. The rainfall variable (extreme) was the most important in classifying the leptospirosis occurrence while streamflow was the least important despite showing higher correlations with leptospirosis. Springer Science and Business Media 2023-01-31 Article PeerReviewed Jayaramu, Veianthan and Zulkafli, Zed and De Stercke, Simon and Buytaert, Wouter and Rahmat, Fariq and Abdul Rahman, Ribhan Zafira and Ishak, Asnor Juraiza and Tahir, Wardah and Ab Rahman, Jamalludin and Mohd Fuzi, Nik Mohd Hafiz (2023) Leptospirosis modelling using hydrometeorological indices and random forest machine learning. International Journal of Biometeorology, 67 (3). pp. 423-437. ISSN 0020-7128; eISSN: 1432-1254 https://link.springer.com/article/10.1007/s00484-022-02422-y?error=cookies_not_supported&code=b68bc8b6-2a6c-44d6-9266-49d454508008 10.1007/s00484-022-02422-y
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
description Leptospirosis is a zoonosis that has been linked to hydrometeorological variability. Hydrometeorological averages and extremes have been used before as drivers in the statistical prediction of disease. However, their importance and predictive capacity are still little known. In this study, the use of a random forest classifier was explored to analyze the relative importance of hydrometeorological indices in developing the leptospirosis model and to evaluate the performance of models based on the type of indices used, using case data from three districts in Kelantan, Malaysia, that experience annual monsoonal rainfall and flooding. First, hydrometeorological data including rainfall, streamflow, water level, relative humidity, and temperature were transformed into 164 weekly average and extreme indices in accordance with the Expert Team on Climate Change Detection and Indices (ETCCDI). Then, weekly case occurrences were classified into binary classes “high” and “low” based on an average threshold. Seventeen models based on “average,” “extreme,” and “mixed” indices were trained by optimizing the feature subsets based on the model computed mean decrease Gini (MDG) scores. The variable importance was assessed through cross-correlation analysis and the MDG score. The average and extreme models showed similar prediction accuracy ranges (61.5–76.1 and 72.3–77.0) while the mixed models showed an improvement (71.7–82.6 prediction accuracy). An extreme model was the most sensitive while an average model was the most specific. The time lag associated with the driving indices agreed with the seasonality of the monsoon. The rainfall variable (extreme) was the most important in classifying the leptospirosis occurrence while streamflow was the least important despite showing higher correlations with leptospirosis.
format Article
author Jayaramu, Veianthan
Zulkafli, Zed
De Stercke, Simon
Buytaert, Wouter
Rahmat, Fariq
Abdul Rahman, Ribhan Zafira
Ishak, Asnor Juraiza
Tahir, Wardah
Ab Rahman, Jamalludin
Mohd Fuzi, Nik Mohd Hafiz
spellingShingle Jayaramu, Veianthan
Zulkafli, Zed
De Stercke, Simon
Buytaert, Wouter
Rahmat, Fariq
Abdul Rahman, Ribhan Zafira
Ishak, Asnor Juraiza
Tahir, Wardah
Ab Rahman, Jamalludin
Mohd Fuzi, Nik Mohd Hafiz
Leptospirosis modelling using hydrometeorological indices and random forest machine learning
author_facet Jayaramu, Veianthan
Zulkafli, Zed
De Stercke, Simon
Buytaert, Wouter
Rahmat, Fariq
Abdul Rahman, Ribhan Zafira
Ishak, Asnor Juraiza
Tahir, Wardah
Ab Rahman, Jamalludin
Mohd Fuzi, Nik Mohd Hafiz
author_sort Jayaramu, Veianthan
title Leptospirosis modelling using hydrometeorological indices and random forest machine learning
title_short Leptospirosis modelling using hydrometeorological indices and random forest machine learning
title_full Leptospirosis modelling using hydrometeorological indices and random forest machine learning
title_fullStr Leptospirosis modelling using hydrometeorological indices and random forest machine learning
title_full_unstemmed Leptospirosis modelling using hydrometeorological indices and random forest machine learning
title_sort leptospirosis modelling using hydrometeorological indices and random forest machine learning
publisher Springer Science and Business Media
publishDate 2023
url http://psasir.upm.edu.my/id/eprint/109550/
https://link.springer.com/article/10.1007/s00484-022-02422-y?error=cookies_not_supported&code=b68bc8b6-2a6c-44d6-9266-49d454508008
_version_ 1818835930686947328
score 13.223943