Hybrid deep learning model for ozone concentration prediction: Comprehensive evaluation and comparison with various machine and deep learning algorithms

To accurately predict tropospheric ozone concentration(O-3), it is needed to investigate the variety of artificial intelligence techniques' performance, such as machine learning, deep learning and hybrid models. This research aims to effectively predict the hourly ozone trend via fewer input va...

Full description

Saved in:
Bibliographic Details
Main Authors: Yafouz, Ayman, Ahmed, Ali Najah, Zaini, Nur'atiah, Sherif, Mohsen, Sefelnasr, Ahmed, El-Shafie, Ahmed
Format: Article
Published: Taylor & Francis 2021
Subjects:
Online Access:http://eprints.um.edu.my/28354/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:To accurately predict tropospheric ozone concentration(O-3), it is needed to investigate the variety of artificial intelligence techniques' performance, such as machine learning, deep learning and hybrid models. This research aims to effectively predict the hourly ozone trend via fewer input variables. This ozone prediction attempt is performed on diversity data of air pollutants (NO2, NOx, CO, SO2) and meteorological parameters (wind-speed and humidity). The historical datasets are collected from 3 sites in Malaysia. The study's methodology progressed in two paths: standalone and hybrid models where hourly-averaged datasets are applied based on 5-time horizon analysis scenario, with different inputs' combinations. For evaluation, all models are tested throughout 5-performance indicator and illustrated on Modified Taylor diagram. Sensitivity analysis of input variables is quantified. Additionally, uncertainty analysis is conducted to assess their confidence level associated with Willmott Index. Based on R (2), results indicated that XGBoost has higher accuracy compared to MLP and SVR; meanwhile, LSTM and CNN outweighs XGBoost. In terms of robustness and accuracy, the proposed hybrid model possesses superlative performance compared to all above-mentioned techniques. The proposed model achieved exceptional results as the highest R (2), the highest 95% confidence degree, and narrower confidence interval width, are 93.48%, 98.16%, and 0.0014195, respectively.