Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges

Trends in Internet usage and accessing online content in different languages and formats are proliferating at a considerable speed. There is a vast amount of digital online content available in different formats that are sensitive in nature with respect to writing styles and arrangement of diacritic...

Full description

Saved in:
Bibliographic Details
Main Authors: Hakak, Saqib Iqbal, Kamsin, Amirrudin, Tayan, Omar, Idris, Mohd Yamani Idna, Gilkar, Gulshan Amin
Format: Article
Published: Elsevier 2019
Subjects:
Online Access:http://eprints.um.edu.my/20080/
https://doi.org/10.1016/j.ipm.2017.08.004
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.20080
record_format eprints
spelling my.um.eprints.200802019-01-22T02:35:37Z http://eprints.um.edu.my/20080/ Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges Hakak, Saqib Iqbal Kamsin, Amirrudin Tayan, Omar Idris, Mohd Yamani Idna Gilkar, Gulshan Amin QA75 Electronic computers. Computer science Trends in Internet usage and accessing online content in different languages and formats are proliferating at a considerable speed. There is a vast amount of digital online content available in different formats that are sensitive in nature with respect to writing styles and arrangement of diacritics. However, research done in the area aimed at identifying the necessary techniques suitable for preserving content integrity of sensitive digital online content is limited. So, it is a challenge to determine the techniques most suitable for different formats such as image or binary. Hence, preserving and verifying sensitive content constitutes an emerging problem and calls for timely solutions. The digital Holy Qur'an in Arabic, constitutes, one case of such sensitive content. Due to the different characteristics of the Arabic letters like diacritics (punctuation symbols), kashidas (extended letters) and other symbols, it is very easy to alter the original meaning of the text by simply changing the arrangement of diacritics. This article surveys the different approaches that are presently employed in the process of preserving and verifying the content integrity of sensitive online content. We present the state-of-the-art in content integrity verification and address the existing challenges in preserving the integrity of sensitive texts using the Digital Qur'an as a case study. The proposed taxonomy provides an effective classification and analysis of existing related schemes and their limitations. The paper discusses the recommendations of the expected efficiency of such approaches when applied for use in digital content integrity. Some of the main findings suggest unified approaches of watermarking and string matching approaches can be used to preserve content integrity of any sensitive digital content. Elsevier 2019 Article PeerReviewed Hakak, Saqib Iqbal and Kamsin, Amirrudin and Tayan, Omar and Idris, Mohd Yamani Idna and Gilkar, Gulshan Amin (2019) Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges. Information Processing & Management, 56 (2). pp. 367-380. ISSN 0306-4573 https://doi.org/10.1016/j.ipm.2017.08.004 doi:10.1016/j.ipm.2017.08.004
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Hakak, Saqib Iqbal
Kamsin, Amirrudin
Tayan, Omar
Idris, Mohd Yamani Idna
Gilkar, Gulshan Amin
Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
description Trends in Internet usage and accessing online content in different languages and formats are proliferating at a considerable speed. There is a vast amount of digital online content available in different formats that are sensitive in nature with respect to writing styles and arrangement of diacritics. However, research done in the area aimed at identifying the necessary techniques suitable for preserving content integrity of sensitive digital online content is limited. So, it is a challenge to determine the techniques most suitable for different formats such as image or binary. Hence, preserving and verifying sensitive content constitutes an emerging problem and calls for timely solutions. The digital Holy Qur'an in Arabic, constitutes, one case of such sensitive content. Due to the different characteristics of the Arabic letters like diacritics (punctuation symbols), kashidas (extended letters) and other symbols, it is very easy to alter the original meaning of the text by simply changing the arrangement of diacritics. This article surveys the different approaches that are presently employed in the process of preserving and verifying the content integrity of sensitive online content. We present the state-of-the-art in content integrity verification and address the existing challenges in preserving the integrity of sensitive texts using the Digital Qur'an as a case study. The proposed taxonomy provides an effective classification and analysis of existing related schemes and their limitations. The paper discusses the recommendations of the expected efficiency of such approaches when applied for use in digital content integrity. Some of the main findings suggest unified approaches of watermarking and string matching approaches can be used to preserve content integrity of any sensitive digital content.
format Article
author Hakak, Saqib Iqbal
Kamsin, Amirrudin
Tayan, Omar
Idris, Mohd Yamani Idna
Gilkar, Gulshan Amin
author_facet Hakak, Saqib Iqbal
Kamsin, Amirrudin
Tayan, Omar
Idris, Mohd Yamani Idna
Gilkar, Gulshan Amin
author_sort Hakak, Saqib Iqbal
title Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
title_short Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
title_full Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
title_fullStr Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
title_full_unstemmed Approaches for preserving content integrity of sensitive online Arabic content: A survey and research challenges
title_sort approaches for preserving content integrity of sensitive online arabic content: a survey and research challenges
publisher Elsevier
publishDate 2019
url http://eprints.um.edu.my/20080/
https://doi.org/10.1016/j.ipm.2017.08.004
_version_ 1643691173658755072
score 13.18916