An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction
Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Published: |
2021
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/95767/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.95767 |
---|---|
record_format |
eprints |
spelling |
my.utm.957672022-05-31T13:18:57Z http://eprints.utm.my/id/eprint/95767/ An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction Hassan, H. Ahmad, N. B. Sallehuddin, R. QA75 Electronic computers. Computer science Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5 ensembles classifiers and 10 resampling techniques for data imbalance. This is done via the used 4413 instances consisting of demographic, economic, and behavioural data from student information systems and e-learning, as well as engineering faculty for the first semester 2017/2018. The use of three sampling types was adapted for the analysis: oversampling, undersampling and hybrid. The experimental results prove to model students’ behaviour from e-learning data and bagging decision tree ensemble classifier produces the highest results. Lastly, a hybrid resampling technique, SMOTEENN consistently shows the top result compared to other resampling techniques. 2021 Conference or Workshop Item PeerReviewed Hassan, H. and Ahmad, N. B. and Sallehuddin, R. (2021) An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction. In: 7th International Conference on Computational Science and Technology, ICCST 2020, 29 August 2020 - 30 August 2020, Pattaya, Thailand. |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Hassan, H. Ahmad, N. B. Sallehuddin, R. An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
description |
Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5 ensembles classifiers and 10 resampling techniques for data imbalance. This is done via the used 4413 instances consisting of demographic, economic, and behavioural data from student information systems and e-learning, as well as engineering faculty for the first semester 2017/2018. The use of three sampling types was adapted for the analysis: oversampling, undersampling and hybrid. The experimental results prove to model students’ behaviour from e-learning data and bagging decision tree ensemble classifier produces the highest results. Lastly, a hybrid resampling technique, SMOTEENN consistently shows the top result compared to other resampling techniques. |
format |
Conference or Workshop Item |
author |
Hassan, H. Ahmad, N. B. Sallehuddin, R. |
author_facet |
Hassan, H. Ahmad, N. B. Sallehuddin, R. |
author_sort |
Hassan, H. |
title |
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
title_short |
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
title_full |
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
title_fullStr |
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
title_full_unstemmed |
An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
title_sort |
empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction |
publishDate |
2021 |
url |
http://eprints.utm.my/id/eprint/95767/ |
_version_ |
1735386844750675968 |
score |
13.160551 |