Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media

Mental health disorders pose a significant global public health challenge. Social media data provides insights into these conditions. Analysing text can help identify indications of mental health disorders through text-based analysis. However, despite the large number of studies on the analysis of...

Full description

Saved in:
Bibliographic Details
Main Authors: Sutranggono, Abi Nizar, Sarno, Riyanarto, Ghozali, Imam
Format: Article
Language:English
Published: Universiti Utara Malaysia Press 2024
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/30349/1/JICT%2023%2001%202024%2077-104.pdf
https://doi.org/10.32890/jict2024.23.1.4
https://repo.uum.edu.my/id/eprint/30349/
https://e-journal.uum.edu.my/index.php/jict/article/view/19042
https://doi.org/10.32890/jict2024.23.1.4
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.repo.30349
record_format eprints
spelling my.uum.repo.303492024-02-01T13:58:30Z https://repo.uum.edu.my/id/eprint/30349/ Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media Sutranggono, Abi Nizar Sarno, Riyanarto Ghozali, Imam QA75 Electronic computers. Computer science Mental health disorders pose a significant global public health challenge. Social media data provides insights into these conditions. Analysing text can help identify indications of mental health disorders through text-based analysis. However, despite the large number of studies on the analysis of mental health disorders, the predominant algorithm in the existing literature is the Multi-Class Single-Level (MCSL) classification algorithm, which is often used for simple classification tasks involving a limited number of classes. Typically, these classes are binary, representing either an unhealthy or a healthy mental state. This paper uses English text data from Reddit to classify mental health disorders. The Multi-Class Multi-Level (MCML) classification algorithm was applied to perform detailed classification and address the limitations of the research scope using several approaches, including machine learning, deep learning, and transfer learning approaches. Two different pre-processing scenarios were proposed to handle unstructured text data, one of the most challenging aspects of classifying text from social media. The results of the experiments show that the MCML classification algorithm successfully performs detailed classification and produces promising results for each classification level. The proposed pre-processing scenario influences the performance of each classifier and improves classification accuracy. The best accuracy results were obtained for the Robustly Optimised BERT Pre-training Approach (RoBERTa) classifier at level 1 and level 2 classifications, namely 0.98 and 0.85, respectively. Overall, the MCML classification algorithm is proven to be used as a benchmark for early detection of text-based mental health disorders. Universiti Utara Malaysia Press 2024 Article PeerReviewed application/pdf en cc4_by https://repo.uum.edu.my/id/eprint/30349/1/JICT%2023%2001%202024%2077-104.pdf Sutranggono, Abi Nizar and Sarno, Riyanarto and Ghozali, Imam (2024) Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media. Journal of Information and Communication Technology, 23 (1). pp. 77-104. ISSN 2180-3862 https://e-journal.uum.edu.my/index.php/jict/article/view/19042 https://doi.org/10.32890/jict2024.23.1.4 https://doi.org/10.32890/jict2024.23.1.4
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutional Repository
url_provider http://repo.uum.edu.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Sutranggono, Abi Nizar
Sarno, Riyanarto
Ghozali, Imam
Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
description Mental health disorders pose a significant global public health challenge. Social media data provides insights into these conditions. Analysing text can help identify indications of mental health disorders through text-based analysis. However, despite the large number of studies on the analysis of mental health disorders, the predominant algorithm in the existing literature is the Multi-Class Single-Level (MCSL) classification algorithm, which is often used for simple classification tasks involving a limited number of classes. Typically, these classes are binary, representing either an unhealthy or a healthy mental state. This paper uses English text data from Reddit to classify mental health disorders. The Multi-Class Multi-Level (MCML) classification algorithm was applied to perform detailed classification and address the limitations of the research scope using several approaches, including machine learning, deep learning, and transfer learning approaches. Two different pre-processing scenarios were proposed to handle unstructured text data, one of the most challenging aspects of classifying text from social media. The results of the experiments show that the MCML classification algorithm successfully performs detailed classification and produces promising results for each classification level. The proposed pre-processing scenario influences the performance of each classifier and improves classification accuracy. The best accuracy results were obtained for the Robustly Optimised BERT Pre-training Approach (RoBERTa) classifier at level 1 and level 2 classifications, namely 0.98 and 0.85, respectively. Overall, the MCML classification algorithm is proven to be used as a benchmark for early detection of text-based mental health disorders.
format Article
author Sutranggono, Abi Nizar
Sarno, Riyanarto
Ghozali, Imam
author_facet Sutranggono, Abi Nizar
Sarno, Riyanarto
Ghozali, Imam
author_sort Sutranggono, Abi Nizar
title Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
title_short Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
title_full Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
title_fullStr Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
title_full_unstemmed Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media
title_sort multi-class multi-level classification of mental health disorders based on textual data from social media
publisher Universiti Utara Malaysia Press
publishDate 2024
url https://repo.uum.edu.my/id/eprint/30349/1/JICT%2023%2001%202024%2077-104.pdf
https://doi.org/10.32890/jict2024.23.1.4
https://repo.uum.edu.my/id/eprint/30349/
https://e-journal.uum.edu.my/index.php/jict/article/view/19042
https://doi.org/10.32890/jict2024.23.1.4
_version_ 1789943850515562496
score 13.209306