A word stemming algorithm for Hausa language

Hausa, a highly inflected language, needs a worthy stemming approach for efficient information retrieval (IR). However, there is a limited or unavailable study to stemming in the language. Stemming refers to the systematic way of reducing a word to its base or root form. It is a crucial aspect in...

Full description

Saved in:
Bibliographic Details
Main Authors: Muazzam, Bashir, Azilawati, Rozaimee, Wan Malini, Wan Isa
Format: Article
Language:English
Published: 2015
Subjects:
Online Access:http://eprints.unisza.edu.my/5000/1/FH02-FIK-15-04221.pdf
http://eprints.unisza.edu.my/5000/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Hausa, a highly inflected language, needs a worthy stemming approach for efficient information retrieval (IR). However, there is a limited or unavailable study to stemming in the language. Stemming refers to the systematic way of reducing a word to its base or root form. It is a crucial aspect in the field of natural language processing (NLP) such as text summarization and machine translation. As such, this study inspirationally presents an automatic word stemming system for Hausa language with a view to contributing to the field of electronic text processing, as well as NLP, in general. The proposed method is a modification of Porter’s algorithm to fit Hausa morphological rules. The system has an accuracy of 73.8% for implementation with 2573 words extracted from four different articles from Hausa Leadership newspaper. If immensely improved over time (employing more exceptional cases in future work), it would inspire the development of more tools for the language. Hence, the language would rapidly adopt the advancement in technology.