Staff View: Improving the classification performance on imbalanced data sets via new hybrid parameterisation model

Improving the classification performance on imbalanced data sets via new hybrid parameterisation model

The aim of this work is to analyse the performance of the new proposed hybrid parameterisation model in handling problematic data. Three types of problematic data will be highlighted in this paper: i) big data set, ii) uncertain and inconsistent data set and iii) imbalanced data set. The proposed hy...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mohamad, M., Selamat, A., Subroto, I. M., Krejcar, O.
Format:	Article
Language:	English
Published:	King Saud bin Abdulaziz University 2021
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.utm.my/id/eprint/95554/1/AliSelamat2021_ImprovingtheClassificationPerformance.pdf http://eprints.utm.my/id/eprint/95554/ http://dx.doi.org/10.1016/j.jksuci.2019.04.009
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.utm.95554
record_format	eprints
spelling	my.utm.955542022-05-31T12:46:21Z http://eprints.utm.my/id/eprint/95554/ Improving the classification performance on imbalanced data sets via new hybrid parameterisation model Mohamad, M. Selamat, A. Subroto, I. M. Krejcar, O. QA75 Electronic computers. Computer science The aim of this work is to analyse the performance of the new proposed hybrid parameterisation model in handling problematic data. Three types of problematic data will be highlighted in this paper: i) big data set, ii) uncertain and inconsistent data set and iii) imbalanced data set. The proposed hybrid model is an integration of three main phases which consist of the data decomposition, parameter reduction and parameter selection phases. Three main methods, which are soft set and rough set theories, were implemented to reduce and to select the optimised parameter set, while a neural network was used to classify the optimised data set. This proposed model can process a data set that might contain uncertain, inconsistent and imbalanced data. Therefore, one additional phase, data decomposition, was introduced and executed after the pre-processing task was completed in order to manage the big data issue. Imbalanced data sets were used to evaluate the capability of the proposed hybrid model in handling problematic data. The experimental results demonstrate that the proposed hybrid model has the potential to be implemented with any type of data set in a classification task, especially with complex data sets. King Saud bin Abdulaziz University 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/95554/1/AliSelamat2021_ImprovingtheClassificationPerformance.pdf Mohamad, M. and Selamat, A. and Subroto, I. M. and Krejcar, O. (2021) Improving the classification performance on imbalanced data sets via new hybrid parameterisation model. Journal of King Saud University - Computer and Information Sciences, 33 (7). ISSN 1319-1578 http://dx.doi.org/10.1016/j.jksuci.2019.04.009 DOI: 10.1016/j.jksuci.2019.04.009
institution	Universiti Teknologi Malaysia
building	UTM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknologi Malaysia
content_source	UTM Institutional Repository
url_provider	http://eprints.utm.my/
language	English
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Mohamad, M. Selamat, A. Subroto, I. M. Krejcar, O. Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
description	The aim of this work is to analyse the performance of the new proposed hybrid parameterisation model in handling problematic data. Three types of problematic data will be highlighted in this paper: i) big data set, ii) uncertain and inconsistent data set and iii) imbalanced data set. The proposed hybrid model is an integration of three main phases which consist of the data decomposition, parameter reduction and parameter selection phases. Three main methods, which are soft set and rough set theories, were implemented to reduce and to select the optimised parameter set, while a neural network was used to classify the optimised data set. This proposed model can process a data set that might contain uncertain, inconsistent and imbalanced data. Therefore, one additional phase, data decomposition, was introduced and executed after the pre-processing task was completed in order to manage the big data issue. Imbalanced data sets were used to evaluate the capability of the proposed hybrid model in handling problematic data. The experimental results demonstrate that the proposed hybrid model has the potential to be implemented with any type of data set in a classification task, especially with complex data sets.
format	Article
author	Mohamad, M. Selamat, A. Subroto, I. M. Krejcar, O.
author_facet	Mohamad, M. Selamat, A. Subroto, I. M. Krejcar, O.
author_sort	Mohamad, M.
title	Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
title_short	Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
title_full	Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
title_fullStr	Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
title_full_unstemmed	Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
title_sort	improving the classification performance on imbalanced data sets via new hybrid parameterisation model
publisher	King Saud bin Abdulaziz University
publishDate	2021
url	http://eprints.utm.my/id/eprint/95554/1/AliSelamat2021_ImprovingtheClassificationPerformance.pdf http://eprints.utm.my/id/eprint/95554/ http://dx.doi.org/10.1016/j.jksuci.2019.04.009
_version_	1735386818478604288
score	13.251813

Improving the classification performance on imbalanced data sets via new hybrid parameterisation model

Similar Items