Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review

Missing values are highly undesirable in real-world datasets. The missing values should be estimated and treated during the preprocessing stage. With the expansion of nature-inspired metaheuristic techniques, interest in missing value imputation (MVI) has increased. The main goal of this literature...

Full description

Saved in:
Bibliographic Details
Main Authors: Chiu, Po Chan, Selamat, Ali, Krejcar, Ondrej, Kuok, King Kuok, Abdul Bujang, Siti Dianah, Fujita, Hamido
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2022
Subjects:
Online Access:http://eprints.utm.my/104382/1/AliSelamat2022_MissingValueImputationDesignsandMethods.pdf
http://eprints.utm.my/104382/
http://dx.doi.org/10.1109/ACCESS.2022.3172319
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.104382
record_format eprints
spelling my.utm.1043822024-02-04T09:42:20Z http://eprints.utm.my/104382/ Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review Chiu, Po Chan Selamat, Ali Krejcar, Ondrej Kuok, King Kuok Abdul Bujang, Siti Dianah Fujita, Hamido T Technology (General) Missing values are highly undesirable in real-world datasets. The missing values should be estimated and treated during the preprocessing stage. With the expansion of nature-inspired metaheuristic techniques, interest in missing value imputation (MVI) has increased. The main goal of this literature is to identify and review the existing research on missing value imputation (MVI) in terms of nature-inspired metaheuristic approaches, dataset designs, missingness mechanisms, and missing rates, as well as the most used evaluation metrics between 2011 and 2021. This study ultimately gives insight into how the MVI plan can be incorporated into the experimental design. Using the systematic literature review (SLR) guidelines designed by Kitchenham, this study utilizes renowned scientific databases to retrieve and analyze all relevant articles during the search process. A total of 48 related articles from 2011 to 2021 were selected to assess the review questions. This review indicated that the synthetic missing dataset is the most popular baseline test dataset to evaluate the effectiveness of the imputation strategy. The study revealed that missing at random (MAR) is the most common proposed missing mechanism in the datasets. This review also indicated that the hybridizations of metaheuristics with clustering or neural networks are popular among researchers. The superior performance of the hybrid approaches is significantly attributed to the power of optimized learning in MVI models. In addition, perspectives, challenges, and opportunities in MVI are also addressed in this literature. The outcome of this review serves as a toolkit for the researchers to develop effective MVI models. Institute of Electrical and Electronics Engineers Inc. 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/104382/1/AliSelamat2022_MissingValueImputationDesignsandMethods.pdf Chiu, Po Chan and Selamat, Ali and Krejcar, Ondrej and Kuok, King Kuok and Abdul Bujang, Siti Dianah and Fujita, Hamido (2022) Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review. IEEE Access, 10 (NA). pp. 61544-61566. ISSN 2169-3536 http://dx.doi.org/10.1109/ACCESS.2022.3172319 DOI : 10.1109/ACCESS.2022.3172319
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Chiu, Po Chan
Selamat, Ali
Krejcar, Ondrej
Kuok, King Kuok
Abdul Bujang, Siti Dianah
Fujita, Hamido
Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
description Missing values are highly undesirable in real-world datasets. The missing values should be estimated and treated during the preprocessing stage. With the expansion of nature-inspired metaheuristic techniques, interest in missing value imputation (MVI) has increased. The main goal of this literature is to identify and review the existing research on missing value imputation (MVI) in terms of nature-inspired metaheuristic approaches, dataset designs, missingness mechanisms, and missing rates, as well as the most used evaluation metrics between 2011 and 2021. This study ultimately gives insight into how the MVI plan can be incorporated into the experimental design. Using the systematic literature review (SLR) guidelines designed by Kitchenham, this study utilizes renowned scientific databases to retrieve and analyze all relevant articles during the search process. A total of 48 related articles from 2011 to 2021 were selected to assess the review questions. This review indicated that the synthetic missing dataset is the most popular baseline test dataset to evaluate the effectiveness of the imputation strategy. The study revealed that missing at random (MAR) is the most common proposed missing mechanism in the datasets. This review also indicated that the hybridizations of metaheuristics with clustering or neural networks are popular among researchers. The superior performance of the hybrid approaches is significantly attributed to the power of optimized learning in MVI models. In addition, perspectives, challenges, and opportunities in MVI are also addressed in this literature. The outcome of this review serves as a toolkit for the researchers to develop effective MVI models.
format Article
author Chiu, Po Chan
Selamat, Ali
Krejcar, Ondrej
Kuok, King Kuok
Abdul Bujang, Siti Dianah
Fujita, Hamido
author_facet Chiu, Po Chan
Selamat, Ali
Krejcar, Ondrej
Kuok, King Kuok
Abdul Bujang, Siti Dianah
Fujita, Hamido
author_sort Chiu, Po Chan
title Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
title_short Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
title_full Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
title_fullStr Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
title_full_unstemmed Missing value imputation designs and methods of nature-inspired metaheuristic techniques: A systematic review
title_sort missing value imputation designs and methods of nature-inspired metaheuristic techniques: a systematic review
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2022
url http://eprints.utm.my/104382/1/AliSelamat2022_MissingValueImputationDesignsandMethods.pdf
http://eprints.utm.my/104382/
http://dx.doi.org/10.1109/ACCESS.2022.3172319
_version_ 1792147715495297024
score 13.209306