HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification
We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment cl...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Conference or Workshop Item |
Published: |
Association for Computational Linguistics
2023
|
Online Access: | http://scholars.utp.edu.my/id/eprint/38026/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
oai:scholars.utp.edu.my:38026 |
---|---|
record_format |
eprints |
spelling |
oai:scholars.utp.edu.my:380262023-12-11T03:01:29Z http://scholars.utp.edu.my/id/eprint/38026/ HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification Aliyu, S.M. Abdulmumin, I. Muhammad, S.H. Ahmad, I.S. Salahudeen, S.A. Yusuf, A. Lawan, F.I. We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain - Reddit) for multilevel classification into Sexist or not Sexist, and other subsequent sub-classifications of the sexist data. We also use synthetic classification of unlabelled dataset and intermediary class information to maximize the performance of our models. We submitted a system in Task A, and it ranked 49th with F1-score of 0.82. This result showed to be competitive as it only under-performed the best system by 0.052 F1-score. © 2023 Association for Computational Linguistics. Association for Computational Linguistics 2023 Conference or Workshop Item NonPeerReviewed Aliyu, S.M. and Abdulmumin, I. and Muhammad, S.H. and Ahmad, I.S. and Salahudeen, S.A. and Yusuf, A. and Lawan, F.I. (2023) HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification. In: UNSPECIFIED. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81 |
institution |
Universiti Teknologi Petronas |
building |
UTP Resource Centre |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Petronas |
content_source |
UTP Institutional Repository |
url_provider |
http://eprints.utp.edu.my/ |
description |
We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain - Reddit) for multilevel classification into Sexist or not Sexist, and other subsequent sub-classifications of the sexist data. We also use synthetic classification of unlabelled dataset and intermediary class information to maximize the performance of our models. We submitted a system in Task A, and it ranked 49th with F1-score of 0.82. This result showed to be competitive as it only under-performed the best system by 0.052 F1-score. © 2023 Association for Computational Linguistics. |
format |
Conference or Workshop Item |
author |
Aliyu, S.M. Abdulmumin, I. Muhammad, S.H. Ahmad, I.S. Salahudeen, S.A. Yusuf, A. Lawan, F.I. |
spellingShingle |
Aliyu, S.M. Abdulmumin, I. Muhammad, S.H. Ahmad, I.S. Salahudeen, S.A. Yusuf, A. Lawan, F.I. HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
author_facet |
Aliyu, S.M. Abdulmumin, I. Muhammad, S.H. Ahmad, I.S. Salahudeen, S.A. Yusuf, A. Lawan, F.I. |
author_sort |
Aliyu, S.M. |
title |
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
title_short |
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
title_full |
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
title_fullStr |
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
title_full_unstemmed |
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification |
title_sort |
hausanlp at semeval-2023 task 10: transfer learning, synthetic data and side-information for multi-level sexism classification |
publisher |
Association for Computational Linguistics |
publishDate |
2023 |
url |
http://scholars.utp.edu.my/id/eprint/38026/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175400718&partnerID=40&md5=50bd07eed227e02d3ae47e0fe7e50f81 |
_version_ |
1787138257057742848 |
score |
13.222552 |