A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?

Artificial intelligence technology is becoming increasingly essential to education. The outbreak of COVID-19 in recent years has led many schools to launch online education. Automated online assessments have become a hot topic of interest, and an increasing number of researchers are studying Automat...

Full description

Saved in:
Bibliographic Details
Main Authors: Xu, Wenbo, Mahmud, Rohana, Hoo, Wai Lam
Format: Article
Published: Institute of Electrical and Electronics Engineers 2024
Subjects:
Online Access:http://eprints.um.edu.my/45904/
https://doi.org/10.1109/ACCESS.2024.3399163
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.45904
record_format eprints
spelling my.um.eprints.459042024-11-14T04:16:52Z http://eprints.um.edu.my/45904/ A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios? Xu, Wenbo Mahmud, Rohana Hoo, Wai Lam QA75 Electronic computers. Computer science Artificial intelligence technology is becoming increasingly essential to education. The outbreak of COVID-19 in recent years has led many schools to launch online education. Automated online assessments have become a hot topic of interest, and an increasing number of researchers are studying Automated Essay Scoring (AES). This work seeks to summarise the characteristics of current AES systems used in English writing assessment, identify their strengths and weaknesses, and finally, analyse the limits of recent studies and research trends. Search strings were used to retrieve papers on AES systems from 2018 to 2023 from four databases, 104 of which were chosen to be potential to address the posed research aims after study selection and quality evaluation. It is concluded that the existing AES systems, although achieving good results in terms of accuracy in specific contexts, are unable to meet the needs of teachers and students in real teaching scenarios. The improvements of these systems relate to the scalability of the system for assessing different topics or styles of the essays, the accuracy of the model's predicted scores, as well as the reliability of outcomes: improving the robustness of AES models with some adversarial inputs, the richness of AES system functionality, and the development of AES assist tools. Institute of Electrical and Electronics Engineers 2024 Article PeerReviewed Xu, Wenbo and Mahmud, Rohana and Hoo, Wai Lam (2024) A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios? IEEE Access, 12. pp. 77639-77657. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2024.3399163 <https://doi.org/10.1109/ACCESS.2024.3399163>. https://doi.org/10.1109/ACCESS.2024.3399163 10.1109/ACCESS.2024.3399163
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Xu, Wenbo
Mahmud, Rohana
Hoo, Wai Lam
A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
description Artificial intelligence technology is becoming increasingly essential to education. The outbreak of COVID-19 in recent years has led many schools to launch online education. Automated online assessments have become a hot topic of interest, and an increasing number of researchers are studying Automated Essay Scoring (AES). This work seeks to summarise the characteristics of current AES systems used in English writing assessment, identify their strengths and weaknesses, and finally, analyse the limits of recent studies and research trends. Search strings were used to retrieve papers on AES systems from 2018 to 2023 from four databases, 104 of which were chosen to be potential to address the posed research aims after study selection and quality evaluation. It is concluded that the existing AES systems, although achieving good results in terms of accuracy in specific contexts, are unable to meet the needs of teachers and students in real teaching scenarios. The improvements of these systems relate to the scalability of the system for assessing different topics or styles of the essays, the accuracy of the model's predicted scores, as well as the reliability of outcomes: improving the robustness of AES models with some adversarial inputs, the richness of AES system functionality, and the development of AES assist tools.
format Article
author Xu, Wenbo
Mahmud, Rohana
Hoo, Wai Lam
author_facet Xu, Wenbo
Mahmud, Rohana
Hoo, Wai Lam
author_sort Xu, Wenbo
title A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
title_short A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
title_full A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
title_fullStr A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
title_full_unstemmed A Systematic Literature Review: Are Automated Essay Scoring Systems Competent in Real-Life Education Scenarios?
title_sort systematic literature review: are automated essay scoring systems competent in real-life education scenarios?
publisher Institute of Electrical and Electronics Engineers
publishDate 2024
url http://eprints.um.edu.my/45904/
https://doi.org/10.1109/ACCESS.2024.3399163
_version_ 1816130475433918464
score 13.214268