Statistical Malay dependency parser for knowledge acquisition based on word dependency relation

One of the common problems faced when processing information gathered from any natural language is the 'semantic gap' where the 'meaning' of the sentences is not exactly extracted. In Malay Natural Language Processing (NLP), as our knowledge, there is no existing Malay Parser tha...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohamed H., Omar N., Aziz M.J.A., Rahman S.A.
Other Authors: 49964168000
Format: Conference paper
Published: 2023
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uniten.dspace-30375
record_format dspace
spelling my.uniten.dspace-303752023-12-29T15:47:09Z Statistical Malay dependency parser for knowledge acquisition based on word dependency relation Mohamed H. Omar N. Aziz M.J.A. Rahman S.A. 49964168000 23397755200 36089651800 57195320360 Dependency Grammar Dependency Parser Malay corpus Malay Parser Parser Syntactic Relation One of the common problems faced when processing information gathered from any natural language is the 'semantic gap' where the 'meaning' of the sentences is not exactly extracted. In Malay Natural Language Processing (NLP), as our knowledge, there is no existing Malay Parser that can be used to develop a knowledge acquisition feature to extract 'meaning' from Malay articles based-on syntactic relations. This relation is basically the relation between a word and its dependents. This paper will examine the Dependency Grammar (DG) for developing Malay Grammar Parser and discuss the possibilities of developing probabilistic dependency Malay parser using the projected syntactic relation from annotated English corpus. The English side of a parallel corpus, project the analysis to the second language (Malay). Thus, the rules for adaptation from English DG to Malay DG will be defined. The projected tree structure in Malay will be used in training a stochastic analyzer. The training will produce a set of tree lattices which contains chunks of dependency trees for Malay attached with their probability value. A decoder will be developed to test the lattices. A DG for a new Malay sentence is built by combining the pre-determined lattices according to their plausible highest probability of combination. Final 2023-12-29T07:47:09Z 2023-12-29T07:47:09Z 2011 Conference paper 10.1016/j.sbspro.2011.10.597 2-s2.0-83755171540 https://www.scopus.com/inward/record.uri?eid=2-s2.0-83755171540&doi=10.1016%2fj.sbspro.2011.10.597&partnerID=40&md5=c89747a83af914a3394fbea4eee52ec7 https://irepository.uniten.edu.my/handle/123456789/30375 27 188 193 All Open Access; Gold Open Access Scopus
institution Universiti Tenaga Nasional
building UNITEN Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
url_provider http://dspace.uniten.edu.my/
topic Dependency Grammar
Dependency Parser
Malay corpus
Malay Parser
Parser
Syntactic Relation
spellingShingle Dependency Grammar
Dependency Parser
Malay corpus
Malay Parser
Parser
Syntactic Relation
Mohamed H.
Omar N.
Aziz M.J.A.
Rahman S.A.
Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
description One of the common problems faced when processing information gathered from any natural language is the 'semantic gap' where the 'meaning' of the sentences is not exactly extracted. In Malay Natural Language Processing (NLP), as our knowledge, there is no existing Malay Parser that can be used to develop a knowledge acquisition feature to extract 'meaning' from Malay articles based-on syntactic relations. This relation is basically the relation between a word and its dependents. This paper will examine the Dependency Grammar (DG) for developing Malay Grammar Parser and discuss the possibilities of developing probabilistic dependency Malay parser using the projected syntactic relation from annotated English corpus. The English side of a parallel corpus, project the analysis to the second language (Malay). Thus, the rules for adaptation from English DG to Malay DG will be defined. The projected tree structure in Malay will be used in training a stochastic analyzer. The training will produce a set of tree lattices which contains chunks of dependency trees for Malay attached with their probability value. A decoder will be developed to test the lattices. A DG for a new Malay sentence is built by combining the pre-determined lattices according to their plausible highest probability of combination.
author2 49964168000
author_facet 49964168000
Mohamed H.
Omar N.
Aziz M.J.A.
Rahman S.A.
format Conference paper
author Mohamed H.
Omar N.
Aziz M.J.A.
Rahman S.A.
author_sort Mohamed H.
title Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
title_short Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
title_full Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
title_fullStr Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
title_full_unstemmed Statistical Malay dependency parser for knowledge acquisition based on word dependency relation
title_sort statistical malay dependency parser for knowledge acquisition based on word dependency relation
publishDate 2023
_version_ 1806424377102696448
score 13.214268