Sentiment analysis on Malay-English mixed language text using artificial neural network

Sentiment analysis (SA) is the study of people's emotions and attitudes toward a particular topic. It is beneficial for monitoring and analyzing social media text in order to gather public opinion. Despite the fact that there are SA applications for monolingual text such as English and non-Engl...

Full description

Saved in:
Bibliographic Details
Main Authors: Yann, Lim May, Zahri, N. A. H., Amir, Amiza, Romli, R., Ghazali, N. H., Anwar, S. A., Hashim, Nik Mohd Zarifie
Format: Conference or Workshop Item
Language:en
Published: 2024
Online Access:http://eprints.utem.edu.my/id/eprint/28811/1/Sentiment%20analysis%20on%20Malay-English%20mixed%20language%20text%20using%20artificial%20neural%20network.pdf
http://eprints.utem.edu.my/id/eprint/28811/
https://doi.org/10.1063/5.0192401
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1834508777715400704
author Yann, Lim May
Zahri, N. A. H.
Amir, Amiza
Romli, R.
Ghazali, N. H.
Anwar, S. A.
Hashim, Nik Mohd Zarifie
author_facet Yann, Lim May
Zahri, N. A. H.
Amir, Amiza
Romli, R.
Ghazali, N. H.
Anwar, S. A.
Hashim, Nik Mohd Zarifie
author_sort Yann, Lim May
building UTEM Library
collection Institutional Repository
content_provider Universiti Teknikal Malaysia Melaka
content_source UTEM Institutional Repository
continent Asia
country Malaysia
description Sentiment analysis (SA) is the study of people's emotions and attitudes toward a particular topic. It is beneficial for monitoring and analyzing social media text in order to gather public opinion. Despite the fact that there are SA applications for monolingual text such as English and non-English languages like Hindi, Chinese and French, the Malay language has far fewer works, not to mention the mixed language such as Malay-English (also known as Manglish). Other than comments and posts from websites and social media, the emoji used by internet users can also help to provide better insights into how they truly feel about a particular topic. Our work focuses on Malay-English mixed language comments and posts on how Malaysians feel about daily new cases of Covid-19 in Malaysia. We proposed a neural network framework to perform SA on languages spoken by Malaysians, namely Malay, English, and Malay-English, by also taking into account the emoji used by internet users. The data was pre-processed to remove noises and then transformed into word vector representation using word embedding technique. Then we propose a framework that involves training and testing mixed language textual data along with emoji analysis by using bidirectional Long Short Term Memory (biLSTM) neural network. To compare with the proposed method, several machine learning models and Long Short Term Memory (LSTM) with word vectorization was used. Finally, compared to the machine learning model such as Naïve Bayes and Logistic Regression, neural networks such as LSTM, the proposed method; biLSTM with tuned hyper-parameter for Malay-English mixed language achieved the highest accuracy of 76.6%, and macro F1-score of 69.6%.
format Conference or Workshop Item
id my.utem.eprints-28811
institution Universiti Teknikal Malaysia Melaka
language en
publishDate 2024
record_format eprints
spelling my.utem.eprints-288112025-06-05T10:19:43Z http://eprints.utem.edu.my/id/eprint/28811/ Sentiment analysis on Malay-English mixed language text using artificial neural network Yann, Lim May Zahri, N. A. H. Amir, Amiza Romli, R. Ghazali, N. H. Anwar, S. A. Hashim, Nik Mohd Zarifie Sentiment analysis (SA) is the study of people's emotions and attitudes toward a particular topic. It is beneficial for monitoring and analyzing social media text in order to gather public opinion. Despite the fact that there are SA applications for monolingual text such as English and non-English languages like Hindi, Chinese and French, the Malay language has far fewer works, not to mention the mixed language such as Malay-English (also known as Manglish). Other than comments and posts from websites and social media, the emoji used by internet users can also help to provide better insights into how they truly feel about a particular topic. Our work focuses on Malay-English mixed language comments and posts on how Malaysians feel about daily new cases of Covid-19 in Malaysia. We proposed a neural network framework to perform SA on languages spoken by Malaysians, namely Malay, English, and Malay-English, by also taking into account the emoji used by internet users. The data was pre-processed to remove noises and then transformed into word vector representation using word embedding technique. Then we propose a framework that involves training and testing mixed language textual data along with emoji analysis by using bidirectional Long Short Term Memory (biLSTM) neural network. To compare with the proposed method, several machine learning models and Long Short Term Memory (LSTM) with word vectorization was used. Finally, compared to the machine learning model such as Naïve Bayes and Logistic Regression, neural networks such as LSTM, the proposed method; biLSTM with tuned hyper-parameter for Malay-English mixed language achieved the highest accuracy of 76.6%, and macro F1-score of 69.6%. 2024 Conference or Workshop Item PeerReviewed text en http://eprints.utem.edu.my/id/eprint/28811/1/Sentiment%20analysis%20on%20Malay-English%20mixed%20language%20text%20using%20artificial%20neural%20network.pdf Yann, Lim May and Zahri, N. A. H. and Amir, Amiza and Romli, R. and Ghazali, N. H. and Anwar, S. A. and Hashim, Nik Mohd Zarifie (2024) Sentiment analysis on Malay-English mixed language text using artificial neural network. In: 6th International Conference on Electronic Design, ICED 2022, 29 August 2022, Perlis. https://doi.org/10.1063/5.0192401
spellingShingle Yann, Lim May
Zahri, N. A. H.
Amir, Amiza
Romli, R.
Ghazali, N. H.
Anwar, S. A.
Hashim, Nik Mohd Zarifie
Sentiment analysis on Malay-English mixed language text using artificial neural network
title Sentiment analysis on Malay-English mixed language text using artificial neural network
title_full Sentiment analysis on Malay-English mixed language text using artificial neural network
title_fullStr Sentiment analysis on Malay-English mixed language text using artificial neural network
title_full_unstemmed Sentiment analysis on Malay-English mixed language text using artificial neural network
title_short Sentiment analysis on Malay-English mixed language text using artificial neural network
title_sort sentiment analysis on malay-english mixed language text using artificial neural network
url http://eprints.utem.edu.my/id/eprint/28811/1/Sentiment%20analysis%20on%20Malay-English%20mixed%20language%20text%20using%20artificial%20neural%20network.pdf
http://eprints.utem.edu.my/id/eprint/28811/
https://doi.org/10.1063/5.0192401
url_provider http://eprints.utem.edu.my/