A novel stacked long short-term memory approach of deep learning for streamflow simulation

Rainfall-Runoff simulation is the backbone of all hydrological and climate change studies. This study proposes a novel stochastic model for daily rainfall-runoff simulation called Stacked Long Short-Term Memory (SLSTM) relying on machine learning technology. The SLSTM model utilizes only the rainfal...

Full description

Saved in:
Bibliographic Details
Main Authors: Mirzaei, Majid, Yu, Haoxuan, Dehghani, Adnan, Galavi, Hadi, Shokri, Vahid, Mohsenzadeh Karimi, Sahar, Sookhak, Mehdi
Format: Article
Published: MDPI 2021
Subjects:
Online Access:http://eprints.um.edu.my/28564/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.28564
record_format eprints
spelling my.um.eprints.285642022-08-16T06:26:58Z http://eprints.um.edu.my/28564/ A novel stacked long short-term memory approach of deep learning for streamflow simulation Mirzaei, Majid Yu, Haoxuan Dehghani, Adnan Galavi, Hadi Shokri, Vahid Mohsenzadeh Karimi, Sahar Sookhak, Mehdi TA Engineering (General). Civil engineering (General) Rainfall-Runoff simulation is the backbone of all hydrological and climate change studies. This study proposes a novel stochastic model for daily rainfall-runoff simulation called Stacked Long Short-Term Memory (SLSTM) relying on machine learning technology. The SLSTM model utilizes only the rainfall-runoff data in its modelling approach and the hydrology system is deemed a blackbox. Conversely, the distributed and physically-based hydrological models, e.g., SWAT (Soil and Water Assessment Tool) preserve the physical aspect of hydrological variables and their inter-relations while taking a wide range of data. The two model types provide specific applications that interest modelers, who can apply them according to their project specification and objectives. However, sparse distribution of point-data may hinder physical models' performance, which may not be the case in data-driven models. This study proposes a specific SLSTM model and investigates the SLSTM and SWAT models' data dependency in terms of their spatial distribution. The study was conducted in the two distinct river basins of Samarahan and Trusan, Malaysia, with over 20 years of hydro-climate data. The Trusan basin's rain gauges are scattered downstream of the basin outlet and Samarahan's are located around the basin, with one station within each basin's limits. The SWAT was developed and calibrated following its general modelling approach, however, the SLSTM performance was also tested using data preprocessing with principal component analysis (PCA). Results showed that the SWAT performance for daily streamflow simulation at Samarahan has been superior to that of Trusan. Both the SLSTM and PCA-SLSTM models, however, showed better performance at Trusan with PCA-SLSTM outperforming the SLSTM. This demonstrates that the SWAT model is greatly affected by the spatial distribution of its input data, while data-driven models, irrespective of the spatial distribution of their entry data, can perform well if the data adequacy condition is met. However, considering the structural difference between the two models, each has its specific application in a water resources context. The study of catchments' response to changes in the hydrology cycle requires a physically-based model like SWAT with proper spatial and temporal distribution of its entry data. However, the study of a specific phenomenon without considering the underlying processes can be done using data-driven models like SLSTM, where improper spatial distribution of data cannot be a restricting factor. MDPI 2021-12 Article PeerReviewed Mirzaei, Majid and Yu, Haoxuan and Dehghani, Adnan and Galavi, Hadi and Shokri, Vahid and Mohsenzadeh Karimi, Sahar and Sookhak, Mehdi (2021) A novel stacked long short-term memory approach of deep learning for streamflow simulation. Sustainability, 13 (23). ISSN 2071-1050, DOI https://doi.org/10.3390/su132313384 <https://doi.org/10.3390/su132313384>. 10.3390/su132313384
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic TA Engineering (General). Civil engineering (General)
spellingShingle TA Engineering (General). Civil engineering (General)
Mirzaei, Majid
Yu, Haoxuan
Dehghani, Adnan
Galavi, Hadi
Shokri, Vahid
Mohsenzadeh Karimi, Sahar
Sookhak, Mehdi
A novel stacked long short-term memory approach of deep learning for streamflow simulation
description Rainfall-Runoff simulation is the backbone of all hydrological and climate change studies. This study proposes a novel stochastic model for daily rainfall-runoff simulation called Stacked Long Short-Term Memory (SLSTM) relying on machine learning technology. The SLSTM model utilizes only the rainfall-runoff data in its modelling approach and the hydrology system is deemed a blackbox. Conversely, the distributed and physically-based hydrological models, e.g., SWAT (Soil and Water Assessment Tool) preserve the physical aspect of hydrological variables and their inter-relations while taking a wide range of data. The two model types provide specific applications that interest modelers, who can apply them according to their project specification and objectives. However, sparse distribution of point-data may hinder physical models' performance, which may not be the case in data-driven models. This study proposes a specific SLSTM model and investigates the SLSTM and SWAT models' data dependency in terms of their spatial distribution. The study was conducted in the two distinct river basins of Samarahan and Trusan, Malaysia, with over 20 years of hydro-climate data. The Trusan basin's rain gauges are scattered downstream of the basin outlet and Samarahan's are located around the basin, with one station within each basin's limits. The SWAT was developed and calibrated following its general modelling approach, however, the SLSTM performance was also tested using data preprocessing with principal component analysis (PCA). Results showed that the SWAT performance for daily streamflow simulation at Samarahan has been superior to that of Trusan. Both the SLSTM and PCA-SLSTM models, however, showed better performance at Trusan with PCA-SLSTM outperforming the SLSTM. This demonstrates that the SWAT model is greatly affected by the spatial distribution of its input data, while data-driven models, irrespective of the spatial distribution of their entry data, can perform well if the data adequacy condition is met. However, considering the structural difference between the two models, each has its specific application in a water resources context. The study of catchments' response to changes in the hydrology cycle requires a physically-based model like SWAT with proper spatial and temporal distribution of its entry data. However, the study of a specific phenomenon without considering the underlying processes can be done using data-driven models like SLSTM, where improper spatial distribution of data cannot be a restricting factor.
format Article
author Mirzaei, Majid
Yu, Haoxuan
Dehghani, Adnan
Galavi, Hadi
Shokri, Vahid
Mohsenzadeh Karimi, Sahar
Sookhak, Mehdi
author_facet Mirzaei, Majid
Yu, Haoxuan
Dehghani, Adnan
Galavi, Hadi
Shokri, Vahid
Mohsenzadeh Karimi, Sahar
Sookhak, Mehdi
author_sort Mirzaei, Majid
title A novel stacked long short-term memory approach of deep learning for streamflow simulation
title_short A novel stacked long short-term memory approach of deep learning for streamflow simulation
title_full A novel stacked long short-term memory approach of deep learning for streamflow simulation
title_fullStr A novel stacked long short-term memory approach of deep learning for streamflow simulation
title_full_unstemmed A novel stacked long short-term memory approach of deep learning for streamflow simulation
title_sort novel stacked long short-term memory approach of deep learning for streamflow simulation
publisher MDPI
publishDate 2021
url http://eprints.um.edu.my/28564/
_version_ 1744649127128465408
score 13.160551