Multiple imputations by chained equations for recovering missing daily streamflow observations: A case study of Langat River basin in Malaysia

Missing values in hydrological studies are a common issue for hydrologists, especially in statistical analyses as a complete dataset is required. This work evaluates the performance of the multiple imputations by chained equations (MICE) approach to predicting recurrence in streamflow datasets. To e...

Full description

Saved in:
Bibliographic Details
Main Authors: Hamzah, Fatimah Bibi, Mohamad Hamzah, Firdaus, Mohd Razali, Siti Fatin, El-Shafie, Ahmed
Format: Article
Published: Taylor & Francis 2022
Subjects:
Online Access:http://eprints.um.edu.my/33487/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Missing values in hydrological studies are a common issue for hydrologists, especially in statistical analyses as a complete dataset is required. This work evaluates the performance of the multiple imputations by chained equations (MICE) approach to predicting recurrence in streamflow datasets. To evaluate and verify the effectiveness of the MICE approach in treating missing streamflow data, complete historical daily streamflow data from 2012 to 2014 were used. Later, MICE methods coupled with multiple linear regression (MLR) were applied to restore streamflow rates in Malaysia's Langat River basin from 1978 to 2016. The best estimation methods are validated with tests such as adjusted R-squared (Adj R-2), residual standard error (RSE), and mean absolute percentage error (MAPE). The findings revealed that the classification and regression tree (CART) method combined with MLR outperformed the other approaches tested, with the highest Adj R-2 value and the lowest RSE and MAPE values observed regardless of missing conditions.