Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior

Machine learning has been very promising in solving real problems, but the implementation involved difficulties mainly for the inexpert data scientists. Therefore, this paper presents an automated machine learning (AutoML) to simplify and accelerate the modeling tasks. Focused on Python and RapidMin...

Full description

Saved in:
Bibliographic Details
Main Authors: Rahman, R.A., Masrom, S., Mohamad, M., Sari, E.N., Saragih, F., Rahman, A.S.A.
Format: Article
Published: Elsevier B.V. 2023
Online Access:http://scholars.utp.edu.my/id/eprint/37269/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171322687&doi=10.1016%2fj.mex.2023.102364&partnerID=40&md5=2a8e87d40f36edebb10d81f9d465695b
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:37269
record_format eprints
spelling oai:scholars.utp.edu.my:372692023-10-04T08:36:23Z http://scholars.utp.edu.my/id/eprint/37269/ Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior Rahman, R.A. Masrom, S. Mohamad, M. Sari, E.N. Saragih, F. Rahman, A.S.A. Machine learning has been very promising in solving real problems, but the implementation involved difficulties mainly for the inexpert data scientists. Therefore, this paper presents an automated machine learning (AutoML) to simplify and accelerate the modeling tasks. Focused on Python and RapidMiner rapid modeling tools, Tree-based Pipeline Optimization Tool (TPOT) and AutoModel were used. This paper presents a comprehensive comparison between these tools with regard to the prediction accuracy and Area Under Curve (AUC) in classifying real cases of whistleblowing academic dishonesty among undergraduate students of two universities in Indonesia. Additionally, the correlations weight from demographic and Theory of Planned Behavior (TOB) attributes in the different machine learning models are also discussed. All the machine learning algorithms from TPOT and AutoModel are considerable powerful to generate good accuracy level (between 70�93 of AUC) in classifying both cases of whistleblowing and non-whistleblowing on the hold-out samples from the testing process. Generally, based on the validation results of the prediction models, demographic attributes presented more importance than the TBP attributes. The findings of this study will be a great interest of many research scholars to conduct a more in-depth analysis on AutoML for many domains mainly in education and academic misconduct fields. � AutoML is the first of its kind to be empirically compared between TPOT and AutoModel in an application to predict academic dishonesty whistleblowing. � Besides accuracy performances of the AutoML, the proportion of the variance of each attribute from demographic and Theory of Planned Behavior (TPB) is also presented in the prediction models of academic dishonesty whistleblowing. � AutoML is a convenient and reproducible rapid modeling method of machine learning to be used in many kinds of prediction problem. © 2023 Elsevier B.V. 2023 Article NonPeerReviewed Rahman, R.A. and Masrom, S. and Mohamad, M. and Sari, E.N. and Saragih, F. and Rahman, A.S.A. (2023) Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior. MethodsX, 11. ISSN 22150161 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171322687&doi=10.1016%2fj.mex.2023.102364&partnerID=40&md5=2a8e87d40f36edebb10d81f9d465695b 10.1016/j.mex.2023.102364 10.1016/j.mex.2023.102364 10.1016/j.mex.2023.102364
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Machine learning has been very promising in solving real problems, but the implementation involved difficulties mainly for the inexpert data scientists. Therefore, this paper presents an automated machine learning (AutoML) to simplify and accelerate the modeling tasks. Focused on Python and RapidMiner rapid modeling tools, Tree-based Pipeline Optimization Tool (TPOT) and AutoModel were used. This paper presents a comprehensive comparison between these tools with regard to the prediction accuracy and Area Under Curve (AUC) in classifying real cases of whistleblowing academic dishonesty among undergraduate students of two universities in Indonesia. Additionally, the correlations weight from demographic and Theory of Planned Behavior (TOB) attributes in the different machine learning models are also discussed. All the machine learning algorithms from TPOT and AutoModel are considerable powerful to generate good accuracy level (between 70�93 of AUC) in classifying both cases of whistleblowing and non-whistleblowing on the hold-out samples from the testing process. Generally, based on the validation results of the prediction models, demographic attributes presented more importance than the TBP attributes. The findings of this study will be a great interest of many research scholars to conduct a more in-depth analysis on AutoML for many domains mainly in education and academic misconduct fields. � AutoML is the first of its kind to be empirically compared between TPOT and AutoModel in an application to predict academic dishonesty whistleblowing. � Besides accuracy performances of the AutoML, the proportion of the variance of each attribute from demographic and Theory of Planned Behavior (TPB) is also presented in the prediction models of academic dishonesty whistleblowing. � AutoML is a convenient and reproducible rapid modeling method of machine learning to be used in many kinds of prediction problem. © 2023
format Article
author Rahman, R.A.
Masrom, S.
Mohamad, M.
Sari, E.N.
Saragih, F.
Rahman, A.S.A.
spellingShingle Rahman, R.A.
Masrom, S.
Mohamad, M.
Sari, E.N.
Saragih, F.
Rahman, A.S.A.
Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
author_facet Rahman, R.A.
Masrom, S.
Mohamad, M.
Sari, E.N.
Saragih, F.
Rahman, A.S.A.
author_sort Rahman, R.A.
title Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
title_short Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
title_full Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
title_fullStr Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
title_full_unstemmed Comparisons of automated machine learning (AutoML) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
title_sort comparisons of automated machine learning (automl) in predicting whistleblowing of academic dishonesty with demographic and theory of planned behavior
publisher Elsevier B.V.
publishDate 2023
url http://scholars.utp.edu.my/id/eprint/37269/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171322687&doi=10.1016%2fj.mex.2023.102364&partnerID=40&md5=2a8e87d40f36edebb10d81f9d465695b
_version_ 1779441357619724288
score 13.214268