Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain
The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theor...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit Universiti Kebangsaan Malaysia
2017
|
Online Access: | http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf http://journalarticle.ukm.my/11840/ http://ejournal.ukm.my/apjitm/issue/view/899 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-ukm.journal.11840 |
---|---|
record_format |
eprints |
spelling |
my-ukm.journal.118402018-07-09T04:05:59Z http://journalarticle.ukm.my/11840/ Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain Suhaimi Ab Rahman, Nazlia Omar, The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theory and concept of detection for the head and modifier in a compound noun that have been discussed by language experts. Additionally, the existing research is also limited especially in the areas of computational linguistics. Therefore, research should be conducted to identify appropriate methods especially used in the detection of head and modifier which appear in Malay setences from the cultural heritage domain. The aim of this study is to construct a list of heuristic rules to be used for detecting the position of compound nouns in Malay sentences from cultural heritage domain. By using 15 rules, the position of head and modifier that exist in a compound noun can also be detected. These rules are called heuristic rules. The purpose of formulating these 15 rules is to detect the head and modifier that exist in the Malay sentences from the cultural heritage domain. To measure the accuracy of the results, precision, recall and F1-score values are used. Based on the results of the experiments, Sentence Structure of Malay Cultural Heritage Domain (SADWBM) have an F1-score of 80.4% compared to Noun Phrase Structure (SFN) which is 56%. Consequently, SADWBM shows better scores compared to SFN. Therefore it is clear that the approach used in this study is effective in resolving the identified problems. Penerbit Universiti Kebangsaan Malaysia 2017-06 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf Suhaimi Ab Rahman, and Nazlia Omar, (2017) Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain. Asia-Pacific Journal of Information Technology and Multimedia, 6 (1). pp. 13-21. ISSN 2289-2192 http://ejournal.ukm.my/apjitm/issue/view/899 |
institution |
Universiti Kebangsaan Malaysia |
building |
Perpustakaan Tun Sri Lanang Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Kebangsaan Malaysia |
content_source |
UKM Journal Article Repository |
url_provider |
http://journalarticle.ukm.my/ |
language |
English |
description |
The process of detection for the head and modifier in Malay sentences from the cultural heritage domain is difficult to identify. This is due to the position of head and modifier which varies in sentences depending on the sentence structures. Hence, there are different point of views about the theory and concept of detection for the head and modifier in a compound noun that have been discussed by language experts. Additionally, the existing research is also limited especially in the areas of computational linguistics. Therefore, research should be conducted to identify appropriate methods especially used in the detection of head and modifier which appear in Malay setences from the cultural heritage domain. The aim of this study is to construct a list of heuristic rules to be used for detecting the position of compound nouns in Malay sentences from cultural heritage domain. By using 15 rules, the position of head and modifier that exist in a compound noun can also be detected. These rules are called heuristic rules. The purpose of formulating these 15 rules is to detect the head and modifier that exist in the Malay sentences from the cultural heritage domain. To measure the accuracy of the results, precision, recall and F1-score values are used. Based on the results of the experiments, Sentence Structure of Malay Cultural Heritage Domain (SADWBM) have an F1-score of 80.4% compared to Noun Phrase Structure (SFN) which is 56%. Consequently, SADWBM shows better scores compared to SFN. Therefore it is clear that the approach used in this study is effective in resolving the identified problems. |
format |
Article |
author |
Suhaimi Ab Rahman, Nazlia Omar, |
spellingShingle |
Suhaimi Ab Rahman, Nazlia Omar, Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
author_facet |
Suhaimi Ab Rahman, Nazlia Omar, |
author_sort |
Suhaimi Ab Rahman, |
title |
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
title_short |
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
title_full |
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
title_fullStr |
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
title_full_unstemmed |
Heuristics-based method for head and modifier detection in Malay sentences from the cultural heritage domain |
title_sort |
heuristics-based method for head and modifier detection in malay sentences from the cultural heritage domain |
publisher |
Penerbit Universiti Kebangsaan Malaysia |
publishDate |
2017 |
url |
http://journalarticle.ukm.my/11840/1/13767-54962-1-PB.pdf http://journalarticle.ukm.my/11840/ http://ejournal.ukm.my/apjitm/issue/view/899 |
_version_ |
1643738617981435904 |
score |
13.214268 |