Classification of sexual harassment on Facebook using term weighting schemes

Facebook is the largest Social Network Service, and its users are growing rapidly. Facebook has become one of the main sources of information for individuals and organizations; and this exponential increase of information has raised the issue of information security. In United States alone, 62% of o...

Full description

Saved in:
Bibliographic Details
Main Authors: Sirat @ Md. Siraj, Maheyzah, Alkatheri, Amer Saeed Ali
Format: Article
Published: International Journal of Innovative Computing (IJIC) 2018
Subjects:
Online Access:http://eprints.utm.my/id/eprint/82158/
http://dx.doi.org/10.11113/ijic.v8n1.157
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.82158
record_format eprints
spelling my.utm.821582019-11-06T03:37:24Z http://eprints.utm.my/id/eprint/82158/ Classification of sexual harassment on Facebook using term weighting schemes Sirat @ Md. Siraj, Maheyzah Alkatheri, Amer Saeed Ali QA75 Electronic computers. Computer science Facebook is the largest Social Network Service, and its users are growing rapidly. Facebook has become one of the main sources of information for individuals and organizations; and this exponential increase of information has raised the issue of information security. In United States alone, 62% of online abuses occurred through Facebook and the most common form of online abuse is sexual harassment with 44%. Victims to online sexual harassment are living under pressure, because the harasser has an ability to propagate messages at any time under any identity. Several content filtering tools for web-based especially Facebook has been proposed. Most of these approaches are not suitable and has limitations when applied to current Social Network Services such as Facebook. As a result, the content-based technique which includes deeper understanding of the semantics of text would probably perform better to forbid illegal post contents. In this project, three terms weighting schemes namely Entropy, TF-IDF, and Modified TF-IDF are used as feature selection process in filtering Facebook posts. The performance of these techniques will be examined via datasets, and the accuracy of their result is measured by Support Vector Machine (SVM). Entropy, TF-IDF, and Modified TF-IDF are judged based on accuracy, precision, recall and F score. Results showed that Modified TF-IDF performed better than Entropy and TFIDF. It is hoped that this study would give other researchers an insight especially who want to work with similar area. International Journal of Innovative Computing (IJIC) 2018 Article PeerReviewed Sirat @ Md. Siraj, Maheyzah and Alkatheri, Amer Saeed Ali (2018) Classification of sexual harassment on Facebook using term weighting schemes. International Journal of Innovative Computing (IJIC), 8 (1). pp. 15-19. ISSN 2180-4370 http://dx.doi.org/10.11113/ijic.v8n1.157 DOI: 10.11113/ijic.v8n1.157
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Sirat @ Md. Siraj, Maheyzah
Alkatheri, Amer Saeed Ali
Classification of sexual harassment on Facebook using term weighting schemes
description Facebook is the largest Social Network Service, and its users are growing rapidly. Facebook has become one of the main sources of information for individuals and organizations; and this exponential increase of information has raised the issue of information security. In United States alone, 62% of online abuses occurred through Facebook and the most common form of online abuse is sexual harassment with 44%. Victims to online sexual harassment are living under pressure, because the harasser has an ability to propagate messages at any time under any identity. Several content filtering tools for web-based especially Facebook has been proposed. Most of these approaches are not suitable and has limitations when applied to current Social Network Services such as Facebook. As a result, the content-based technique which includes deeper understanding of the semantics of text would probably perform better to forbid illegal post contents. In this project, three terms weighting schemes namely Entropy, TF-IDF, and Modified TF-IDF are used as feature selection process in filtering Facebook posts. The performance of these techniques will be examined via datasets, and the accuracy of their result is measured by Support Vector Machine (SVM). Entropy, TF-IDF, and Modified TF-IDF are judged based on accuracy, precision, recall and F score. Results showed that Modified TF-IDF performed better than Entropy and TFIDF. It is hoped that this study would give other researchers an insight especially who want to work with similar area.
format Article
author Sirat @ Md. Siraj, Maheyzah
Alkatheri, Amer Saeed Ali
author_facet Sirat @ Md. Siraj, Maheyzah
Alkatheri, Amer Saeed Ali
author_sort Sirat @ Md. Siraj, Maheyzah
title Classification of sexual harassment on Facebook using term weighting schemes
title_short Classification of sexual harassment on Facebook using term weighting schemes
title_full Classification of sexual harassment on Facebook using term weighting schemes
title_fullStr Classification of sexual harassment on Facebook using term weighting schemes
title_full_unstemmed Classification of sexual harassment on Facebook using term weighting schemes
title_sort classification of sexual harassment on facebook using term weighting schemes
publisher International Journal of Innovative Computing (IJIC)
publishDate 2018
url http://eprints.utm.my/id/eprint/82158/
http://dx.doi.org/10.11113/ijic.v8n1.157
_version_ 1654960001002766336
score 13.18916