Extracting Significant Words In Engineering Texts For Specialised Language Descriptions

The academic discourse of a specialised language is characterised by specialised and technical vocabulary, and lexicogrammar. Studies on language description suggest the need to explore and determine the specific characteristics of the academic discourse of each specialised language, to serve the la...

Full description

Saved in:
Bibliographic Details
Main Authors: Khamis, Noorli, Abdullah, Imran-Ho
Format: Article
Language:English
Published: Blue Eyes Intelligence Engineering & Sciences Publication (BEIESP) 2019
Online Access:http://eprints.utem.edu.my/id/eprint/24301/2/NOORLI%20IJITEE.PDF
http://eprints.utem.edu.my/id/eprint/24301/
https://www.ijitee.org/wp-content/uploads/papers/v8i12/L25171081219.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The academic discourse of a specialised language is characterised by specialised and technical vocabulary, and lexicogrammar. Studies on language description suggest the need to explore and determine the specific characteristics of the academic discourse of each specialised language, to serve the language needs of the learners. This study demonstrates an exploration of this discipline specificity by looking at the nouns used in a specialised language - an Engineering English. It attempts to integrate a multivariate technique, i.e. the Correspondence Analysis (CA), as a tool to extract significant nouns in a specialised language for any further language use scrutiny. CA allows visual representations of the word interrelationships across different genres in a specialised language. To exemplify this, an Engineering English Corpus (E2C) was created. E2C is composed of two sub-corpora (genres): Engineering reference books (RBC) and online journals articles (EJC). The British National Corpus (BNC) was used as the reference corpus. 30 key-key-nouns were identified from the E2C, and the frequency lists of the words were retrieved from all the corpora to run the CA. The CA maps of the nouns display how these corpora are different from each other, as well as, which words characterise not only E2C from a general corpus (BNC), but also the different genres in E2C. Thus, CA proves to be a potential tool to display words which characterise not only a specialised corpus from a general corpus, but also the different genres in that specialised corpus. This study promises more informed descriptions of a specialised language can be made with the identification of specific and significant vocabulary for any academic discourse investigations