Staff View: Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jumin E., Zaini N., Ahmed A.N., Abdullah S., Ismail M., Sherif M., Sefelnasr A., El-Shafie A.
Other Authors:	57216831084
Format:	Article
Published:	Taylor and Francis Ltd. 2023
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.uniten.dspace-25733
record_format	dspace
spelling	my.uniten.dspace-257332023-05-29T16:13:33Z Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction Jumin E. Zaini N. Ahmed A.N. Abdullah S. Ismail M. Sherif M. Sefelnasr A. El-Shafie A. 57216831084 56905328500 57214837520 56509029800 57210403363 7005414714 6505592467 16068189400 High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The proposed models were developed using three-year of historical data for different parameters as input to predict 24-hour and 12-hour of tropospheric ozone concentration. Different Machine Learning algorithms have been investigated, viz. Linear Regression, Neural Network and Boosted Decision Tree. The results revealed that wind speed, humidity, Nitrogen Oxide, Carbon Monoxide and Nitrogen Dioxide have significant influence on ozone formation. Boosted Decision Tree outperformed Linear regression and Neural Network algorithms for all stations. The performance of the proposed model improved by using 12-hours dataset instead of the 24-hour where R2 values were equal to 0.91, 0.88 and 0.87 for the three investigated stations. To assess the uncertainties of the Boosted Decision Tree model, 95% prediction uncertainties (95PPU) d-factors were introduced.95PPU showed about 94.4, 93.4, 96.7% and the d-factors were 0.001015, 0.001016 and 0.001124 which relate to S1, S2 and S3, respectively. The obtained results provide a reliable prediction model to mimic actual ozone concentration in different locations in Malaysia. � 2020, � 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. Final 2023-05-29T08:13:32Z 2023-05-29T08:13:32Z 2020 Article 10.1080/19942060.2020.1758792 2-s2.0-85084835158 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85084835158&doi=10.1080%2f19942060.2020.1758792&partnerID=40&md5=333f6763da33364053ddf6f0c5a1df34 https://irepository.uniten.edu.my/handle/123456789/25733 14 1 713 725 All Open Access, Gold Taylor and Francis Ltd. Scopus
institution	Universiti Tenaga Nasional
building	UNITEN Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Tenaga Nasional
content_source	UNITEN Institutional Repository
url_provider	http://dspace.uniten.edu.my/
description	High level of tropospheric ozone concentration, exceeding allowable level has been frequently reported in Malaysia. This study proposes accurate model based on Machine Learning algorithms to predict Tropospheric ozone concentration in major cities located in Kuala Lumpur and Selangor, Malaysia. The proposed models were developed using three-year of historical data for different parameters as input to predict 24-hour and 12-hour of tropospheric ozone concentration. Different Machine Learning algorithms have been investigated, viz. Linear Regression, Neural Network and Boosted Decision Tree. The results revealed that wind speed, humidity, Nitrogen Oxide, Carbon Monoxide and Nitrogen Dioxide have significant influence on ozone formation. Boosted Decision Tree outperformed Linear regression and Neural Network algorithms for all stations. The performance of the proposed model improved by using 12-hours dataset instead of the 24-hour where R2 values were equal to 0.91, 0.88 and 0.87 for the three investigated stations. To assess the uncertainties of the Boosted Decision Tree model, 95% prediction uncertainties (95PPU) d-factors were introduced.95PPU showed about 94.4, 93.4, 96.7% and the d-factors were 0.001015, 0.001016 and 0.001124 which relate to S1, S2 and S3, respectively. The obtained results provide a reliable prediction model to mimic actual ozone concentration in different locations in Malaysia. � 2020, � 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
author2	57216831084
author_facet	57216831084 Jumin E. Zaini N. Ahmed A.N. Abdullah S. Ismail M. Sherif M. Sefelnasr A. El-Shafie A.
format	Article
author	Jumin E. Zaini N. Ahmed A.N. Abdullah S. Ismail M. Sherif M. Sefelnasr A. El-Shafie A.
spellingShingle	Jumin E. Zaini N. Ahmed A.N. Abdullah S. Ismail M. Sherif M. Sefelnasr A. El-Shafie A. Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
author_sort	Jumin E.
title	Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
title_short	Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
title_full	Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
title_fullStr	Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
title_full_unstemmed	Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
title_sort	machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
publisher	Taylor and Francis Ltd.
publishDate	2023
_version_	1806424558396243968
score	13.244413

Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction

Similar Items