Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud

Assigning grammatical categories to words in natural text is a vital step in processing natural language. Language resources and text processing tools such as part-of-speech (POS) can be used to assign each word the corresponding grammatical category based on its context. Such resources are availabl...

Full description

Saved in:
Bibliographic Details
Main Authors: Lubani, Mohamed, Mahmud, Rohana
Format: Conference or Workshop Item
Language:English
Published: 2015
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/35721/1/35721.pdf
https://ir.uitm.edu.my/id/eprint/35721/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uitm.ir.35721
record_format eprints
spelling my.uitm.ir.357212022-09-11T23:26:48Z https://ir.uitm.edu.my/id/eprint/35721/ Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud Lubani, Mohamed Mahmud, Rohana Educational technology Learning. Learning strategies Assigning grammatical categories to words in natural text is a vital step in processing natural language. Language resources and text processing tools such as part-of-speech (POS) can be used to assign each word the corresponding grammatical category based on its context. Such resources are available for the major languages such as English, Spanish and Japanese. However, the lack of resources for Malay language makes it very hard to develop new processing tools and contribute to the automation of the language processing. In this paper, a Malay POS dictionary is built using Bahasa wordnet and a POS tagged of Indonesian corpus, as well as a monolingual Malay dictionary. The output is a list of 25,778 Malay POS tagged words where each word is assigned all its possible grammatical categories. The proposed process can also be used as a guideline for future improvements 2015-12 Conference or Workshop Item PeerReviewed text en https://ir.uitm.edu.my/id/eprint/35721/1/35721.pdf Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud. (2015) In: ICOMHAC2015 eproceedings, 16-17 Disember 2015, Century Helang Hotel, Pulau Langkawi.
institution Universiti Teknologi Mara
building Tun Abdul Razak Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Mara
content_source UiTM Institutional Repository
url_provider http://ir.uitm.edu.my/
language English
topic Educational technology
Learning. Learning strategies
spellingShingle Educational technology
Learning. Learning strategies
Lubani, Mohamed
Mahmud, Rohana
Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
description Assigning grammatical categories to words in natural text is a vital step in processing natural language. Language resources and text processing tools such as part-of-speech (POS) can be used to assign each word the corresponding grammatical category based on its context. Such resources are available for the major languages such as English, Spanish and Japanese. However, the lack of resources for Malay language makes it very hard to develop new processing tools and contribute to the automation of the language processing. In this paper, a Malay POS dictionary is built using Bahasa wordnet and a POS tagged of Indonesian corpus, as well as a monolingual Malay dictionary. The output is a list of 25,778 Malay POS tagged words where each word is assigned all its possible grammatical categories. The proposed process can also be used as a guideline for future improvements
format Conference or Workshop Item
author Lubani, Mohamed
Mahmud, Rohana
author_facet Lubani, Mohamed
Mahmud, Rohana
author_sort Lubani, Mohamed
title Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
title_short Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
title_full Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
title_fullStr Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
title_full_unstemmed Building a dictionary of Malay language part-of-speech tagged words using Bahasa WordNet and Bahasa Indonesia resources / Mohamed Lubani and Rohana Mahmud
title_sort building a dictionary of malay language part-of-speech tagged words using bahasa wordnet and bahasa indonesia resources / mohamed lubani and rohana mahmud
publishDate 2015
url https://ir.uitm.edu.my/id/eprint/35721/1/35721.pdf
https://ir.uitm.edu.my/id/eprint/35721/
_version_ 1744357186599911424
score 13.214268