Hybrid Feature Extraction for Multi-Label Emotion Classification in English Text Messages

Emotions are vital for identifying an individual�s attitude and mental condition. Detecting and classifying emotions in Natural Language Processing applications can improve Human�Computer Interaction systems, leading to effective decision making in organizations. Several studies on emotion class...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahanin, Z., Ismail, M.A., Singh, N.S.S., AL-Ashmori, A.
Format: Article
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2023
Online Access:http://scholars.utp.edu.my/id/eprint/37421/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85169150020&doi=10.3390%2fsu151612539&partnerID=40&md5=8985e6e6b20628d2366925d67d0bf345
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Emotions are vital for identifying an individual�s attitude and mental condition. Detecting and classifying emotions in Natural Language Processing applications can improve Human�Computer Interaction systems, leading to effective decision making in organizations. Several studies on emotion classification have employed word embedding as a feature extraction method, but they do not consider the sentiment polarity of words. Moreover, relying exclusively on deep learning models to extract linguistic features may result in misclassifications due to the small training dataset. In this paper, we present a hybrid feature extraction model using human-engineered features combined with deep learning based features for emotion classification in English text. The proposed model uses data augmentation, captures contextual information, integrates knowledge from lexical resources, and employs deep learning models, including Bidirectional Long Short-Term Memory (Bi-LSTM) and Bidirectional Encoder Representation and Transformer (BERT), to address the issues mentioned above. The proposed model with hybrid features attained the highest Jaccard accuracy on two of the benchmark datasets, with 68.40 on SemEval-2018 and 53.45 on the GoEmotions dataset. The results show the significance of the proposed technique, and we can conclude that the incorporation of the hybrid features improves the performance of the baseline models. © 2023 by the authors.