Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters

This study uses machine learning (ML) models for a high-resolution prediction (0.1°×0.1°) of air fine particular matter (PM2.5) concentration, the most harmful to human health, from meteorological and soil data. Iraq was considered the study area to implement the method. Different lags and the chang...

Full description

Saved in:
Bibliographic Details
Main Authors: Tao, Hai, Jawad, Ali H., Shather, A. H., Al-Khafaji, Zainab, A. Rashid, Tarik, Ali, Mumtaz, Al-Ansari, Nadhir, Marhoon, Haydar Abdulameer, Shahid, Shamsuddin, Yaseen, Zaher Mundher
Format: Article
Language:English
Published: Elsevier Ltd 2023
Subjects:
Online Access:http://eprints.utm.my/106853/1/ShamsuddinShahid2023_MachineLearningAlgorithmsForHighResolution.pdf
http://eprints.utm.my/106853/
http://dx.doi.org/10.1016/j.envint.2023.107931
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.106853
record_format eprints
spelling my.utm.1068532024-08-04T06:59:58Z http://eprints.utm.my/106853/ Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters Tao, Hai Jawad, Ali H. Shather, A. H. Al-Khafaji, Zainab A. Rashid, Tarik Ali, Mumtaz Al-Ansari, Nadhir Marhoon, Haydar Abdulameer Shahid, Shamsuddin Yaseen, Zaher Mundher TA Engineering (General). Civil engineering (General) This study uses machine learning (ML) models for a high-resolution prediction (0.1°×0.1°) of air fine particular matter (PM2.5) concentration, the most harmful to human health, from meteorological and soil data. Iraq was considered the study area to implement the method. Different lags and the changing patterns of four European Reanalysis (ERA5) meteorological variables, rainfall, mean temperature, wind speed and relative humidity, and one soil parameter, the soil moisture, were used to select the suitable set of predictors using a non-greedy algorithm known as simulated annealing (SA). The selected predictors were used to simulate the temporal and spatial variability of air PM2.5 concentration over Iraq during the early summer (May-July), the most polluted months, using three advanced ML models, extremely randomized trees (ERT), stochastic gradient descent backpropagation (SGD-BP) and long short-term memory (LSTM) integrated with Bayesian optimizer. The spatial distribution of the annual average PM2.5 revealed the population of the whole of Iraq is exposed to a pollution level above the standard limit. The changes in temperature and soil moisture and the mean wind speed and humidity of the month before the early summer can predict the temporal and spatial variability of PM2.5 over Iraq during May-July. Results revealed the higher performance of LSTM with normalized root-mean-square error and Kling-Gupta efficiency of 13.4% and 0.89, compared to 16.02% and 0.81 for SDG-BP and 17.9% and 0.74 for ERT. The LSTM could also reconstruct the observed spatial distribution of PM2.5 with MapCurve and Cramer's V values of 0.95 and 0.91, compared to 0.9 and 0.86 for SGD-BP and 0.83 and 0.76 for ERT. The study provided a methodology for forecasting spatial variability of PM2.5 concentration at high resolution during the peak pollution months from freely available data, which can be replicated in other regions for generating high-resolution PM2.5 forecasting maps. Elsevier Ltd 2023 Article PeerReviewed application/pdf en http://eprints.utm.my/106853/1/ShamsuddinShahid2023_MachineLearningAlgorithmsForHighResolution.pdf Tao, Hai and Jawad, Ali H. and Shather, A. H. and Al-Khafaji, Zainab and A. Rashid, Tarik and Ali, Mumtaz and Al-Ansari, Nadhir and Marhoon, Haydar Abdulameer and Shahid, Shamsuddin and Yaseen, Zaher Mundher (2023) Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters. Environment International, 175 (NA). pp. 1-17. ISSN 0160-4120 http://dx.doi.org/10.1016/j.envint.2023.107931 DOI : 10.1016/j.envint.2023.107931
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic TA Engineering (General). Civil engineering (General)
spellingShingle TA Engineering (General). Civil engineering (General)
Tao, Hai
Jawad, Ali H.
Shather, A. H.
Al-Khafaji, Zainab
A. Rashid, Tarik
Ali, Mumtaz
Al-Ansari, Nadhir
Marhoon, Haydar Abdulameer
Shahid, Shamsuddin
Yaseen, Zaher Mundher
Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
description This study uses machine learning (ML) models for a high-resolution prediction (0.1°×0.1°) of air fine particular matter (PM2.5) concentration, the most harmful to human health, from meteorological and soil data. Iraq was considered the study area to implement the method. Different lags and the changing patterns of four European Reanalysis (ERA5) meteorological variables, rainfall, mean temperature, wind speed and relative humidity, and one soil parameter, the soil moisture, were used to select the suitable set of predictors using a non-greedy algorithm known as simulated annealing (SA). The selected predictors were used to simulate the temporal and spatial variability of air PM2.5 concentration over Iraq during the early summer (May-July), the most polluted months, using three advanced ML models, extremely randomized trees (ERT), stochastic gradient descent backpropagation (SGD-BP) and long short-term memory (LSTM) integrated with Bayesian optimizer. The spatial distribution of the annual average PM2.5 revealed the population of the whole of Iraq is exposed to a pollution level above the standard limit. The changes in temperature and soil moisture and the mean wind speed and humidity of the month before the early summer can predict the temporal and spatial variability of PM2.5 over Iraq during May-July. Results revealed the higher performance of LSTM with normalized root-mean-square error and Kling-Gupta efficiency of 13.4% and 0.89, compared to 16.02% and 0.81 for SDG-BP and 17.9% and 0.74 for ERT. The LSTM could also reconstruct the observed spatial distribution of PM2.5 with MapCurve and Cramer's V values of 0.95 and 0.91, compared to 0.9 and 0.86 for SGD-BP and 0.83 and 0.76 for ERT. The study provided a methodology for forecasting spatial variability of PM2.5 concentration at high resolution during the peak pollution months from freely available data, which can be replicated in other regions for generating high-resolution PM2.5 forecasting maps.
format Article
author Tao, Hai
Jawad, Ali H.
Shather, A. H.
Al-Khafaji, Zainab
A. Rashid, Tarik
Ali, Mumtaz
Al-Ansari, Nadhir
Marhoon, Haydar Abdulameer
Shahid, Shamsuddin
Yaseen, Zaher Mundher
author_facet Tao, Hai
Jawad, Ali H.
Shather, A. H.
Al-Khafaji, Zainab
A. Rashid, Tarik
Ali, Mumtaz
Al-Ansari, Nadhir
Marhoon, Haydar Abdulameer
Shahid, Shamsuddin
Yaseen, Zaher Mundher
author_sort Tao, Hai
title Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
title_short Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
title_full Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
title_fullStr Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
title_full_unstemmed Machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
title_sort machine learning algorithms for high-resolution prediction of spatiotemporal distribution of air pollution from meteorological and soil parameters
publisher Elsevier Ltd
publishDate 2023
url http://eprints.utm.my/106853/1/ShamsuddinShahid2023_MachineLearningAlgorithmsForHighResolution.pdf
http://eprints.utm.my/106853/
http://dx.doi.org/10.1016/j.envint.2023.107931
_version_ 1806442418262769664
score 13.18916