Developing an ensembled machine learning prediction model for marine fish and aquaculture production

The fishing industry is identified as a strategic sector to raise domestic protein production and supply in Malaysia. Global changes in climatic variables have impacted and continue to impact marine fish and aquaculture production, where machine learning (ML) methods are yet to be extensively used t...

Full description

Saved in:
Bibliographic Details
Main Authors: Rahman L.F., Marufuzzaman M., Alam L., Bari M.A., Sumaila U.R., Sidek L.M.
Other Authors: 36984229900
Format: Article
Published: MDPI AG 2023
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uniten.dspace-26043
record_format dspace
spelling my.uniten.dspace-260432023-05-29T17:06:18Z Developing an ensembled machine learning prediction model for marine fish and aquaculture production Rahman L.F. Marufuzzaman M. Alam L. Bari M.A. Sumaila U.R. Sidek L.M. 36984229900 57205234835 37053462100 55639915700 6701840163 35070506500 The fishing industry is identified as a strategic sector to raise domestic protein production and supply in Malaysia. Global changes in climatic variables have impacted and continue to impact marine fish and aquaculture production, where machine learning (ML) methods are yet to be extensively used to study aquatic systems in Malaysia. ML-based algorithms could be paired with feature importance, i.e., (features that have the most predictive power) to achieve better prediction accuracy and can provide new insights on fish production. This research aims to develop an MLbased prediction of marine fish and aquaculture production. Based on the feature importance scores, we select the group of climatic variables for three different ML models: linear, gradient boosting, and random forest regression. The past 20 years (2000�2019) of climatic variables and fish production data were used to train and test the ML models. Finally, an ensemble approach named voting regression combines those three ML models. Performance matrices are generated and the results showed that the ensembled ML model obtains R2 values of 0.75, 0.81, and 0.55 for marine water, freshwater, and brackish water, respectively, which outperforms the single ML model in predicting all three types of fish production (in tons) in Malaysia. � 2021 by the authors. Licensee MDPI, Basel, Switzerland. Final 2023-05-29T09:06:17Z 2023-05-29T09:06:17Z 2021 Article 10.3390/su13169124 2-s2.0-85113807873 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85113807873&doi=10.3390%2fsu13169124&partnerID=40&md5=0ab683d17c71fab595ffe7ac46d714de https://irepository.uniten.edu.my/handle/123456789/26043 13 16 9124 All Open Access, Gold MDPI AG Scopus
institution Universiti Tenaga Nasional
building UNITEN Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
url_provider http://dspace.uniten.edu.my/
description The fishing industry is identified as a strategic sector to raise domestic protein production and supply in Malaysia. Global changes in climatic variables have impacted and continue to impact marine fish and aquaculture production, where machine learning (ML) methods are yet to be extensively used to study aquatic systems in Malaysia. ML-based algorithms could be paired with feature importance, i.e., (features that have the most predictive power) to achieve better prediction accuracy and can provide new insights on fish production. This research aims to develop an MLbased prediction of marine fish and aquaculture production. Based on the feature importance scores, we select the group of climatic variables for three different ML models: linear, gradient boosting, and random forest regression. The past 20 years (2000�2019) of climatic variables and fish production data were used to train and test the ML models. Finally, an ensemble approach named voting regression combines those three ML models. Performance matrices are generated and the results showed that the ensembled ML model obtains R2 values of 0.75, 0.81, and 0.55 for marine water, freshwater, and brackish water, respectively, which outperforms the single ML model in predicting all three types of fish production (in tons) in Malaysia. � 2021 by the authors. Licensee MDPI, Basel, Switzerland.
author2 36984229900
author_facet 36984229900
Rahman L.F.
Marufuzzaman M.
Alam L.
Bari M.A.
Sumaila U.R.
Sidek L.M.
format Article
author Rahman L.F.
Marufuzzaman M.
Alam L.
Bari M.A.
Sumaila U.R.
Sidek L.M.
spellingShingle Rahman L.F.
Marufuzzaman M.
Alam L.
Bari M.A.
Sumaila U.R.
Sidek L.M.
Developing an ensembled machine learning prediction model for marine fish and aquaculture production
author_sort Rahman L.F.
title Developing an ensembled machine learning prediction model for marine fish and aquaculture production
title_short Developing an ensembled machine learning prediction model for marine fish and aquaculture production
title_full Developing an ensembled machine learning prediction model for marine fish and aquaculture production
title_fullStr Developing an ensembled machine learning prediction model for marine fish and aquaculture production
title_full_unstemmed Developing an ensembled machine learning prediction model for marine fish and aquaculture production
title_sort developing an ensembled machine learning prediction model for marine fish and aquaculture production
publisher MDPI AG
publishDate 2023
_version_ 1806427347711164416
score 13.214268