HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification

We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment cl...

Full description

Saved in:
Bibliographic Details
Main Authors: Aliyu, S.M., Abdulmumin, I., Muhammad, S.H., Ahmad, I.S., Salahudeen, S.A., Yusuf, A., Lawan, F.I.
Format: Conference or Workshop Item
Published: Association for Computational Linguistics 2023
Online Access:http://scholars.utp.edu.my/id/eprint/38026/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:38026
record_format eprints
spelling oai:scholars.utp.edu.my:380262023-12-11T03:01:29Z http://scholars.utp.edu.my/id/eprint/38026/ HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification Aliyu, S.M. Abdulmumin, I. Muhammad, S.H. Ahmad, I.S. Salahudeen, S.A. Yusuf, A. Lawan, F.I. We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain - Reddit) for multilevel classification into Sexist or not Sexist, and other subsequent sub-classifications of the sexist data. We also use synthetic classification of unlabelled dataset and intermediary class information to maximize the performance of our models. We submitted a system in Task A, and it ranked 49th with F1-score of 0.82. This result showed to be competitive as it only under-performed the best system by 0.052 F1-score. © 2023 Association for Computational Linguistics. Association for Computational Linguistics 2023 Conference or Workshop Item NonPeerReviewed Aliyu, S.M. and Abdulmumin, I. and Muhammad, S.H. and Ahmad, I.S. and Salahudeen, S.A. and Yusuf, A. and Lawan, F.I. (2023) HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification. In: UNSPECIFIED. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain - Reddit) for multilevel classification into Sexist or not Sexist, and other subsequent sub-classifications of the sexist data. We also use synthetic classification of unlabelled dataset and intermediary class information to maximize the performance of our models. We submitted a system in Task A, and it ranked 49th with F1-score of 0.82. This result showed to be competitive as it only under-performed the best system by 0.052 F1-score. © 2023 Association for Computational Linguistics.
format Conference or Workshop Item
author Aliyu, S.M.
Abdulmumin, I.
Muhammad, S.H.
Ahmad, I.S.
Salahudeen, S.A.
Yusuf, A.
Lawan, F.I.
spellingShingle Aliyu, S.M.
Abdulmumin, I.
Muhammad, S.H.
Ahmad, I.S.
Salahudeen, S.A.
Yusuf, A.
Lawan, F.I.
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
author_facet Aliyu, S.M.
Abdulmumin, I.
Muhammad, S.H.
Ahmad, I.S.
Salahudeen, S.A.
Yusuf, A.
Lawan, F.I.
author_sort Aliyu, S.M.
title HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
title_short HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
title_full HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
title_fullStr HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
title_full_unstemmed HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
title_sort hausanlp at semeval-2023 task 10: transfer learning, synthetic data and side-information for multi-level sexism classification
publisher Association for Computational Linguistics
publishDate 2023
url http://scholars.utp.edu.my/id/eprint/38026/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81
_version_ 1787138257057742848
score 13.222552