Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model

Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different lang...

Full description

Saved in:
Bibliographic Details
Main Authors: Naveed, H., Sohail, A., Zain, J.M., Saleem, N., Ali, R.F., Anwar, S.
Format: Article
Published: 2023
Online Access:http://scholars.utp.edu.my/id/eprint/34244/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:34244
record_format eprints
spelling oai:scholars.utp.edu.my:342442023-01-04T02:55:19Z http://scholars.utp.edu.my/id/eprint/34244/ Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model Naveed, H. Sohail, A. Zain, J.M. Saleem, N. Ali, R.F. Anwar, S. Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different languages. Current problem for these types of websites is to handle meaningless and irrelevant content. In this paper we have worked on the Quora insincere questions (questions which are based on false assumptions or questions which are trying to make a statement rather than seeking for helpful answers) dataset in order to identify user insincere questions, so that Quora can eliminate those questions from their platform and ultimately improve the communication among users over the platform. Previously, a research was carried out with recurrent neural network and pretrained glove word embeddings, that achieved the F1 score of 0.69. The proposed study has used a pre-trained ULMFiT model. This model has outperformed the previous model with an F1 score of 0.91, which is much higher than the previous studies. © 2023, Tech Science Press. All rights reserved. 2023 Article NonPeerReviewed Naveed, H. and Sohail, A. and Zain, J.M. and Saleem, N. and Ali, R.F. and Anwar, S. (2023) Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model. Intelligent Automation and Soft Computing, 35 (1). pp. 15-30. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a 10.32604/iasc.2023.023277 10.32604/iasc.2023.023277 10.32604/iasc.2023.023277
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different languages. Current problem for these types of websites is to handle meaningless and irrelevant content. In this paper we have worked on the Quora insincere questions (questions which are based on false assumptions or questions which are trying to make a statement rather than seeking for helpful answers) dataset in order to identify user insincere questions, so that Quora can eliminate those questions from their platform and ultimately improve the communication among users over the platform. Previously, a research was carried out with recurrent neural network and pretrained glove word embeddings, that achieved the F1 score of 0.69. The proposed study has used a pre-trained ULMFiT model. This model has outperformed the previous model with an F1 score of 0.91, which is much higher than the previous studies. © 2023, Tech Science Press. All rights reserved.
format Article
author Naveed, H.
Sohail, A.
Zain, J.M.
Saleem, N.
Ali, R.F.
Anwar, S.
spellingShingle Naveed, H.
Sohail, A.
Zain, J.M.
Saleem, N.
Ali, R.F.
Anwar, S.
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
author_facet Naveed, H.
Sohail, A.
Zain, J.M.
Saleem, N.
Ali, R.F.
Anwar, S.
author_sort Naveed, H.
title Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
title_short Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
title_full Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
title_fullStr Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
title_full_unstemmed Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
title_sort detection of toxic content on social networking platforms using fine tuned ulmfit model
publishDate 2023
url http://scholars.utp.edu.my/id/eprint/34244/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a
_version_ 1754532147046121472
score 13.18916