The potential contribution of general and specialized corpora to research on Malay and Malaysian English.

Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation,...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd. Don, Zuraidah, Gerry, Knowles
Format: Article
Language:English
Published: Penerbit UTM Press 2022
Subjects:
Online Access:http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf
http://eprints.utm.my/104505/
http://dx.doi.org/10.11113/lspi.v9.19469
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.104505
record_format eprints
spelling my.utm.1045052024-02-08T08:16:45Z http://eprints.utm.my/104505/ The potential contribution of general and specialized corpora to research on Malay and Malaysian English. Mohd. Don, Zuraidah Gerry, Knowles L Education (General) Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation, and tend to work bottom-up. This paper seeks to close the gap, using a general corpus and a specialised corpus. The point of departure is the assumption that a corpus is compiled to study the language of texts in some language for some special purpose beyond the existence of the corpus itself. The particular languages in mind are Malay and Malaysian English. The introduction deals with matters that have to be considered when a corpus project is planned, and with the problems that can arise, some of which have been reported. The methodology section concentrates on the groundwork that has to be done for just about any corpus-based project, and starts with a project undertaken long before computers were invented, and describes the role of computational expertise in modern corpus-based projects. The results section reports some preliminary work on a specialised corpus containing the speeches of Tun Mahathir Mohamed, which attempts to go beyond the groundwork to ascertain objectively what the speeches are about. The paper ends with a combined discussion and conclusion that summarises the content of the paper. Penerbit UTM Press 2022-12-26 Article PeerReviewed application/pdf en http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf Mohd. Don, Zuraidah and Gerry, Knowles (2022) The potential contribution of general and specialized corpora to research on Malay and Malaysian English. LSP International Journal, 9 (2). pp. 85-96. ISSN 2601–002X http://dx.doi.org/10.11113/lspi.v9.19469 DOI: 10.11113/lspi.v9.19469
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic L Education (General)
spellingShingle L Education (General)
Mohd. Don, Zuraidah
Gerry, Knowles
The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
description Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation, and tend to work bottom-up. This paper seeks to close the gap, using a general corpus and a specialised corpus. The point of departure is the assumption that a corpus is compiled to study the language of texts in some language for some special purpose beyond the existence of the corpus itself. The particular languages in mind are Malay and Malaysian English. The introduction deals with matters that have to be considered when a corpus project is planned, and with the problems that can arise, some of which have been reported. The methodology section concentrates on the groundwork that has to be done for just about any corpus-based project, and starts with a project undertaken long before computers were invented, and describes the role of computational expertise in modern corpus-based projects. The results section reports some preliminary work on a specialised corpus containing the speeches of Tun Mahathir Mohamed, which attempts to go beyond the groundwork to ascertain objectively what the speeches are about. The paper ends with a combined discussion and conclusion that summarises the content of the paper.
format Article
author Mohd. Don, Zuraidah
Gerry, Knowles
author_facet Mohd. Don, Zuraidah
Gerry, Knowles
author_sort Mohd. Don, Zuraidah
title The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
title_short The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
title_full The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
title_fullStr The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
title_full_unstemmed The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
title_sort potential contribution of general and specialized corpora to research on malay and malaysian english.
publisher Penerbit UTM Press
publishDate 2022
url http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf
http://eprints.utm.my/104505/
http://dx.doi.org/10.11113/lspi.v9.19469
_version_ 1792147787137155072
score 13.18916