Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation stu...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Published: |
Springer Science and Business Media Deutschland GmbH
2022
|
Online Access: | http://scholars.utp.edu.my/id/eprint/33932/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139545410&doi=10.1007%2fs00521-022-07904-z&partnerID=40&md5=97469bda6b87592a38118dbb42645fc5 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO2 and H2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity. © 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature. |
---|