Compilation of Malay criminological terms from online news

A Malay language corpus has been established by the Institute of Language and Literature (Dewan Bahasa dan Pustaka, DBP in Malaysia). Most of the past research on the Malay language corpus has focused on the description, lexicography and translation of the Malay language. However, in the existing...

Full description

Saved in:
Bibliographic Details
Main Authors: Lee, Joanna Chiew Ling *, Teh, Phoey Lee *, Lau, Sian Lun *, Pak, Irina *
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2019
Subjects:
Online Access:http://eprints.sunway.edu.my/930/1/Teh%20Phoey%20Lee%20Compilation%20of%20Malay%20Criminological%20Terms%20from%20Online%20News2305843009226915841.pdf
http://eprints.sunway.edu.my/930/
http://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480/12573
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.sunway.eprints.930
record_format eprints
spelling my.sunway.eprints.9302019-06-11T06:48:12Z http://eprints.sunway.edu.my/930/ Compilation of Malay criminological terms from online news Lee, Joanna Chiew Ling * Teh, Phoey Lee * Lau, Sian Lun * Pak, Irina * PL Languages and literatures of Eastern Asia, Africa, Oceania QA75 Electronic computers. Computer science A Malay language corpus has been established by the Institute of Language and Literature (Dewan Bahasa dan Pustaka, DBP in Malaysia). Most of the past research on the Malay language corpus has focused on the description, lexicography and translation of the Malay language. However, in the existing literature, there is no list of Malay words that categorizes crime terminologies. This study aims to fill that linguistic gap. First, we aggregated the most frequently used crime terminology words from Malaysian online news sources. Five hundred crime-related words were compiled. No automatic machines were in the initial process, but they were subsequently used to verify the data. Four human coders were used to validate the data and ensure the originality of the semantic understanding of the Malay text. Finally, major crime terminologies were outlined from a set of keywords to serve as taggers in our solution. The ultimate goal of this study is to provide a corpus for forensic linguistics, police investigations, and general crime research. This study has established the first corpus of a criminological text in the Malay language. Institute of Advanced Engineering and Science 2019-07 Article PeerReviewed text en cc_by_nc_4 http://eprints.sunway.edu.my/930/1/Teh%20Phoey%20Lee%20Compilation%20of%20Malay%20Criminological%20Terms%20from%20Online%20News2305843009226915841.pdf Lee, Joanna Chiew Ling * and Teh, Phoey Lee * and Lau, Sian Lun * and Pak, Irina * (2019) Compilation of Malay criminological terms from online news. Indonesian Journal of Electrical Engineering and Computer Science, 15 (1). pp. 355-364. ISSN 2502-4752 http://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480/12573
institution Sunway University
building Sunway Campus Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Sunway University
content_source Sunway Institutional Repository
url_provider http://eprints.sunway.edu.my/
language English
topic PL Languages and literatures of Eastern Asia, Africa, Oceania
QA75 Electronic computers. Computer science
spellingShingle PL Languages and literatures of Eastern Asia, Africa, Oceania
QA75 Electronic computers. Computer science
Lee, Joanna Chiew Ling *
Teh, Phoey Lee *
Lau, Sian Lun *
Pak, Irina *
Compilation of Malay criminological terms from online news
description A Malay language corpus has been established by the Institute of Language and Literature (Dewan Bahasa dan Pustaka, DBP in Malaysia). Most of the past research on the Malay language corpus has focused on the description, lexicography and translation of the Malay language. However, in the existing literature, there is no list of Malay words that categorizes crime terminologies. This study aims to fill that linguistic gap. First, we aggregated the most frequently used crime terminology words from Malaysian online news sources. Five hundred crime-related words were compiled. No automatic machines were in the initial process, but they were subsequently used to verify the data. Four human coders were used to validate the data and ensure the originality of the semantic understanding of the Malay text. Finally, major crime terminologies were outlined from a set of keywords to serve as taggers in our solution. The ultimate goal of this study is to provide a corpus for forensic linguistics, police investigations, and general crime research. This study has established the first corpus of a criminological text in the Malay language.
format Article
author Lee, Joanna Chiew Ling *
Teh, Phoey Lee *
Lau, Sian Lun *
Pak, Irina *
author_facet Lee, Joanna Chiew Ling *
Teh, Phoey Lee *
Lau, Sian Lun *
Pak, Irina *
author_sort Lee, Joanna Chiew Ling *
title Compilation of Malay criminological terms from online news
title_short Compilation of Malay criminological terms from online news
title_full Compilation of Malay criminological terms from online news
title_fullStr Compilation of Malay criminological terms from online news
title_full_unstemmed Compilation of Malay criminological terms from online news
title_sort compilation of malay criminological terms from online news
publisher Institute of Advanced Engineering and Science
publishDate 2019
url http://eprints.sunway.edu.my/930/1/Teh%20Phoey%20Lee%20Compilation%20of%20Malay%20Criminological%20Terms%20from%20Online%20News2305843009226915841.pdf
http://eprints.sunway.edu.my/930/
http://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480/12573
_version_ 1644324437247393792
score 13.209306