Content extraction of historical Malay manuscripts based on Event Ontology Framework

This article aims to explore representation of the content knowledge of historical Malay manuscripts by extracting the event features using an event ontology framework. The manuscript used during the testing is Sulalatus Salatin (Sejarah Melayu ) by Abdul Ahmad Samad and it was published at Univer...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Nor, Zahila, M. Khalid, Yanti Idaya Aspura, Abdullah, Noorhidawati
Format: Article
Language:English
English
English
Published: IOS Press 2021
Subjects:
Online Access:http://irep.iium.edu.my/90003/1/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf
http://irep.iium.edu.my/90003/2/Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf
http://irep.iium.edu.my/90003/13/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts%20based%20on%20Event%20Ontology%20Framework_Scopus.pdf
http://irep.iium.edu.my/90003/
https://content.iospress.com/articles/applied-ontology/ao210247
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.iium.irep.90003
record_format dspace
spelling my.iium.irep.900032021-08-13T00:19:49Z http://irep.iium.edu.my/90003/ Content extraction of historical Malay manuscripts based on Event Ontology Framework Mohd Nor, Zahila M. Khalid, Yanti Idaya Aspura Abdullah, Noorhidawati Z665 Library Science. Information Science This article aims to explore representation of the content knowledge of historical Malay manuscripts by extracting the event features using an event ontology framework. The manuscript used during the testing is Sulalatus Salatin (Sejarah Melayu ) by Abdul Ahmad Samad and it was published at University of Malaya Digital Library database. In aligning to a domain-specific ontology, the Simple Event Model (SEM) model is adopted and an event-based ontology for historical Malay manuscripts is designed. Information extraction approach is done manually to extract events from the manuscript and mapped into Protégé editor. Competency questions were constructed and submitted to the Protégé editor using SPARQL to check the ontology capability of providing answers as well as to examine its correctness. Event-based ontology model assists in discovering and representing the content knowledge of historical Malay manuscripts and supports organisation of knowledge. All the main concepts are extracted from selected Malay manuscript and 17 concepts used to develop the event-based ontology model. The knowledge was verified by three domain experts in Malay manuscript. In the findings, the interrater reliability for Event and Actor instances is 84%, which means 16% of instances and its type are incorrect and need amendment. For Place, interrater reliability is 95% and 99% for Role. Meanwhile, the experts achieved 100% agreement for Time. In addition, the experts agreed that the concepts, properties and instances for Malay Manuscript Ontology and complied with the criteria of consistency, completeness, conciseness, expandability and ease of use. The development of the event-based model of an ontology-based system with a high level of semantic granularity reflects the various cultural riches and intellectual aspect stored in Malay manuscripts. This will enable systematic research of the knowledge embedded in the manuscripts and make it widely and easily accessible by everyone. IOS Press 2021-04-01 Article PeerReviewed application/pdf en http://irep.iium.edu.my/90003/1/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf application/pdf en http://irep.iium.edu.my/90003/2/Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf application/pdf en http://irep.iium.edu.my/90003/13/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts%20based%20on%20Event%20Ontology%20Framework_Scopus.pdf Mohd Nor, Zahila and M. Khalid, Yanti Idaya Aspura and Abdullah, Noorhidawati (2021) Content extraction of historical Malay manuscripts based on Event Ontology Framework. Applied Ontology, 16 (3). 249 -275. ISSN 1570-5838 E-ISSN 1875-8533 https://content.iospress.com/articles/applied-ontology/ao210247 10.3233/AO-210247
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
English
English
topic Z665 Library Science. Information Science
spellingShingle Z665 Library Science. Information Science
Mohd Nor, Zahila
M. Khalid, Yanti Idaya Aspura
Abdullah, Noorhidawati
Content extraction of historical Malay manuscripts based on Event Ontology Framework
description This article aims to explore representation of the content knowledge of historical Malay manuscripts by extracting the event features using an event ontology framework. The manuscript used during the testing is Sulalatus Salatin (Sejarah Melayu ) by Abdul Ahmad Samad and it was published at University of Malaya Digital Library database. In aligning to a domain-specific ontology, the Simple Event Model (SEM) model is adopted and an event-based ontology for historical Malay manuscripts is designed. Information extraction approach is done manually to extract events from the manuscript and mapped into Protégé editor. Competency questions were constructed and submitted to the Protégé editor using SPARQL to check the ontology capability of providing answers as well as to examine its correctness. Event-based ontology model assists in discovering and representing the content knowledge of historical Malay manuscripts and supports organisation of knowledge. All the main concepts are extracted from selected Malay manuscript and 17 concepts used to develop the event-based ontology model. The knowledge was verified by three domain experts in Malay manuscript. In the findings, the interrater reliability for Event and Actor instances is 84%, which means 16% of instances and its type are incorrect and need amendment. For Place, interrater reliability is 95% and 99% for Role. Meanwhile, the experts achieved 100% agreement for Time. In addition, the experts agreed that the concepts, properties and instances for Malay Manuscript Ontology and complied with the criteria of consistency, completeness, conciseness, expandability and ease of use. The development of the event-based model of an ontology-based system with a high level of semantic granularity reflects the various cultural riches and intellectual aspect stored in Malay manuscripts. This will enable systematic research of the knowledge embedded in the manuscripts and make it widely and easily accessible by everyone.
format Article
author Mohd Nor, Zahila
M. Khalid, Yanti Idaya Aspura
Abdullah, Noorhidawati
author_facet Mohd Nor, Zahila
M. Khalid, Yanti Idaya Aspura
Abdullah, Noorhidawati
author_sort Mohd Nor, Zahila
title Content extraction of historical Malay manuscripts based on Event Ontology Framework
title_short Content extraction of historical Malay manuscripts based on Event Ontology Framework
title_full Content extraction of historical Malay manuscripts based on Event Ontology Framework
title_fullStr Content extraction of historical Malay manuscripts based on Event Ontology Framework
title_full_unstemmed Content extraction of historical Malay manuscripts based on Event Ontology Framework
title_sort content extraction of historical malay manuscripts based on event ontology framework
publisher IOS Press
publishDate 2021
url http://irep.iium.edu.my/90003/1/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf
http://irep.iium.edu.my/90003/2/Content%20extraction%20of%20historical%20Malay%20manuscripts.pdf
http://irep.iium.edu.my/90003/13/90003_Content%20extraction%20of%20historical%20Malay%20manuscripts%20based%20on%20Event%20Ontology%20Framework_Scopus.pdf
http://irep.iium.edu.my/90003/
https://content.iospress.com/articles/applied-ontology/ao210247
_version_ 1709667138519695360
score 13.2014675