An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction

Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5...

Full description

Saved in:
Bibliographic Details
Main Authors: Hassan, H., Ahmad, N. B., Sallehuddin, R.
Format: Conference or Workshop Item
Published: 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/95767/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.95767
record_format eprints
spelling my.utm.957672022-05-31T13:18:57Z http://eprints.utm.my/id/eprint/95767/ An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction Hassan, H. Ahmad, N. B. Sallehuddin, R. QA75 Electronic computers. Computer science Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5 ensembles classifiers and 10 resampling techniques for data imbalance. This is done via the used 4413 instances consisting of demographic, economic, and behavioural data from student information systems and e-learning, as well as engineering faculty for the first semester 2017/2018. The use of three sampling types was adapted for the analysis: oversampling, undersampling and hybrid. The experimental results prove to model students’ behaviour from e-learning data and bagging decision tree ensemble classifier produces the highest results. Lastly, a hybrid resampling technique, SMOTEENN consistently shows the top result compared to other resampling techniques. 2021 Conference or Workshop Item PeerReviewed Hassan, H. and Ahmad, N. B. and Sallehuddin, R. (2021) An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction. In: 7th International Conference on Computational Science and Technology, ICCST 2020, 29 August 2020 - 30 August 2020, Pattaya, Thailand.
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Hassan, H.
Ahmad, N. B.
Sallehuddin, R.
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
description Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5 ensembles classifiers and 10 resampling techniques for data imbalance. This is done via the used 4413 instances consisting of demographic, economic, and behavioural data from student information systems and e-learning, as well as engineering faculty for the first semester 2017/2018. The use of three sampling types was adapted for the analysis: oversampling, undersampling and hybrid. The experimental results prove to model students’ behaviour from e-learning data and bagging decision tree ensemble classifier produces the highest results. Lastly, a hybrid resampling technique, SMOTEENN consistently shows the top result compared to other resampling techniques.
format Conference or Workshop Item
author Hassan, H.
Ahmad, N. B.
Sallehuddin, R.
author_facet Hassan, H.
Ahmad, N. B.
Sallehuddin, R.
author_sort Hassan, H.
title An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
title_short An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
title_full An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
title_fullStr An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
title_full_unstemmed An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
title_sort empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
publishDate 2021
url http://eprints.utm.my/id/eprint/95767/
_version_ 1735386844750675968
score 13.160551