Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors

Word stemming algorithm (or word stemmer) is an important preprocessing component in the information retrieval and text categorization that aims to reduce derived words to their respective root words. Most of the existing Malay word stemmers adopt rule-based affixes removal method and dictionary loo...

Full description

Saved in:
Bibliographic Details
Main Authors: Kassim, Mohamad Nizam, Maarof, Mohd. Aizaini, Zainal, Anazida, Abdul Wahab, Amirudin
Format: Article
Published: Universiti Teknikal Malaysia Melaka 2016
Subjects:
Online Access:http://eprints.utm.my/id/eprint/74132/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84984815301&partnerID=40&md5=640aa0357eaae8400d5451cae4eafc5e
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.74132
record_format eprints
spelling my.utm.741322017-11-28T05:01:13Z http://eprints.utm.my/id/eprint/74132/ Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors Kassim, Mohamad Nizam Maarof, Mohd. Aizaini Zainal, Anazida Abdul Wahab, Amirudin QA75 Electronic computers. Computer science Word stemming algorithm (or word stemmer) is an important preprocessing component in the information retrieval and text categorization that aims to reduce derived words to their respective root words. Most of the existing Malay word stemmers adopt rule-based affixes removal method and dictionary lookup to stem affixation words. Despite of many stemming approaches have been proposed in the past research, the existing Malay word stemmers still suffer from affixation stemming errors due to the complexity of Malay morphology. These stemming errors can be classified into over stemming, under stemming, unstem, and special variations and exceptions. Hence this paper presents the enhanced affixation word stemmer that aims to solve these stemming errors. This paper also examined the root causes of these stemming errors in the existing Malay stemmers. The experimental results indicate that the enhanced word stemmerable to stem prefixation, suffixation, confixation and infixation wordswith better stemming accuracy by using enhanced Rule Application Order and Stemming Errors Reducer. Universiti Teknikal Malaysia Melaka 2016 Article PeerReviewed Kassim, Mohamad Nizam and Maarof, Mohd. Aizaini and Zainal, Anazida and Abdul Wahab, Amirudin (2016) Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors. Journal of Telecommunication, Electronic and Computer Engineering, 8 (3). pp. 37-41. ISSN 2180-1843 https://www.scopus.com/inward/record.uri?eid=2-s2.0-84984815301&partnerID=40&md5=640aa0357eaae8400d5451cae4eafc5e
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Kassim, Mohamad Nizam
Maarof, Mohd. Aizaini
Zainal, Anazida
Abdul Wahab, Amirudin
Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
description Word stemming algorithm (or word stemmer) is an important preprocessing component in the information retrieval and text categorization that aims to reduce derived words to their respective root words. Most of the existing Malay word stemmers adopt rule-based affixes removal method and dictionary lookup to stem affixation words. Despite of many stemming approaches have been proposed in the past research, the existing Malay word stemmers still suffer from affixation stemming errors due to the complexity of Malay morphology. These stemming errors can be classified into over stemming, under stemming, unstem, and special variations and exceptions. Hence this paper presents the enhanced affixation word stemmer that aims to solve these stemming errors. This paper also examined the root causes of these stemming errors in the existing Malay stemmers. The experimental results indicate that the enhanced word stemmerable to stem prefixation, suffixation, confixation and infixation wordswith better stemming accuracy by using enhanced Rule Application Order and Stemming Errors Reducer.
format Article
author Kassim, Mohamad Nizam
Maarof, Mohd. Aizaini
Zainal, Anazida
Abdul Wahab, Amirudin
author_facet Kassim, Mohamad Nizam
Maarof, Mohd. Aizaini
Zainal, Anazida
Abdul Wahab, Amirudin
author_sort Kassim, Mohamad Nizam
title Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
title_short Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
title_full Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
title_fullStr Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
title_full_unstemmed Enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
title_sort enhanced affixation word stemmer with stemming error reducer to solve affixation stemming errors
publisher Universiti Teknikal Malaysia Melaka
publishDate 2016
url http://eprints.utm.my/id/eprint/74132/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84984815301&partnerID=40&md5=640aa0357eaae8400d5451cae4eafc5e
_version_ 1643656810831282176
score 13.18916