Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus

This research concerns a study of stemming algorithm for Malay words begin with alphabet 'M'. This research involves a Malay stemming approach called Rules-Application-Order (RAO). The performance of this Malay stemming algorithm is tested using the test collection of 1066 words that start...

Full description

Saved in:
Bibliographic Details
Main Author: Mohd Yunus, Mohd Zawawi
Format: Thesis
Language:English
Published: 2000
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/98081/1/98081.pdf
https://ir.uitm.edu.my/id/eprint/98081/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This research concerns a study of stemming algorithm for Malay words begin with alphabet 'M'. This research involves a Malay stemming approach called Rules-Application-Order (RAO). The performance of this Malay stemming algorithm is tested using the test collection of 1066 words that starts with the letter 'M' that have been extracted from 6236 Malay Quran documents. It also used 24 different combinations of Malay affixes that consist of prefix, prefix-suffix, suffix and infix. The results are obtained from the experiments that use the four rules and it combination. The type of errors found in the stemming algorithm is overstemmed, understemmed, spelling exception and unstemmed. These stemming algorithm problems will be solved by doing five experiments such as analysis the existing algorithm, do correction in the file, adding rules, correct the stemming algorithm and use two combination rules. The results of the experiments will show that the algorithm has successfully stemmed all Malay words begin with alphabet 'M' that extracted from Quran documents.