The potential contribution of general and specialized corpora to research on Malay and Malaysian English.
Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation,...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit UTM Press
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf http://eprints.utm.my/104505/ http://dx.doi.org/10.11113/lspi.v9.19469 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.104505 |
---|---|
record_format |
eprints |
spelling |
my.utm.1045052024-02-08T08:16:45Z http://eprints.utm.my/104505/ The potential contribution of general and specialized corpora to research on Malay and Malaysian English. Mohd. Don, Zuraidah Gerry, Knowles L Education (General) Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation, and tend to work bottom-up. This paper seeks to close the gap, using a general corpus and a specialised corpus. The point of departure is the assumption that a corpus is compiled to study the language of texts in some language for some special purpose beyond the existence of the corpus itself. The particular languages in mind are Malay and Malaysian English. The introduction deals with matters that have to be considered when a corpus project is planned, and with the problems that can arise, some of which have been reported. The methodology section concentrates on the groundwork that has to be done for just about any corpus-based project, and starts with a project undertaken long before computers were invented, and describes the role of computational expertise in modern corpus-based projects. The results section reports some preliminary work on a specialised corpus containing the speeches of Tun Mahathir Mohamed, which attempts to go beyond the groundwork to ascertain objectively what the speeches are about. The paper ends with a combined discussion and conclusion that summarises the content of the paper. Penerbit UTM Press 2022-12-26 Article PeerReviewed application/pdf en http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf Mohd. Don, Zuraidah and Gerry, Knowles (2022) The potential contribution of general and specialized corpora to research on Malay and Malaysian English. LSP International Journal, 9 (2). pp. 85-96. ISSN 2601–002X http://dx.doi.org/10.11113/lspi.v9.19469 DOI: 10.11113/lspi.v9.19469 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
L Education (General) |
spellingShingle |
L Education (General) Mohd. Don, Zuraidah Gerry, Knowles The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
description |
Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation, and tend to work bottom-up. This paper seeks to close the gap, using a general corpus and a specialised corpus. The point of departure is the assumption that a corpus is compiled to study the language of texts in some language for some special purpose beyond the existence of the corpus itself. The particular languages in mind are Malay and Malaysian English. The introduction deals with matters that have to be considered when a corpus project is planned, and with the problems that can arise, some of which have been reported. The methodology section concentrates on the groundwork that has to be done for just about any corpus-based project, and starts with a project undertaken long before computers were invented, and describes the role of computational expertise in modern corpus-based projects. The results section reports some preliminary work on a specialised corpus containing the speeches of Tun Mahathir Mohamed, which attempts to go beyond the groundwork to ascertain objectively what the speeches are about. The paper ends with a combined discussion and conclusion that summarises the content of the paper. |
format |
Article |
author |
Mohd. Don, Zuraidah Gerry, Knowles |
author_facet |
Mohd. Don, Zuraidah Gerry, Knowles |
author_sort |
Mohd. Don, Zuraidah |
title |
The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
title_short |
The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
title_full |
The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
title_fullStr |
The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
title_full_unstemmed |
The potential contribution of general and specialized corpora to research on Malay and Malaysian English. |
title_sort |
potential contribution of general and specialized corpora to research on malay and malaysian english. |
publisher |
Penerbit UTM Press |
publishDate |
2022 |
url |
http://eprints.utm.my/104505/1/ZuraidahMohdDonGerryKnowles2022_ThePotentialContributionofGeneralandSpecializedCopra.pdf http://eprints.utm.my/104505/ http://dx.doi.org/10.11113/lspi.v9.19469 |
_version_ |
1792147787137155072 |
score |
13.214268 |