A comparative study of the ensemble and base classifiers performance in Malay text categorization

Automatic text categorization (ATC) has attracted the attention of the research community over the last decade as it frees organizations from the need of manually organized documents. The ensemble techniques, which combine the results of a number of individually trained base classifiers, always impr...

Full description

Saved in:
Bibliographic Details
Main Authors: Alshalabi, Hamood Ali, Sabrina Tiun,, Nazlia Omar,
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2017
Online Access:http://journalarticle.ukm.my/11854/1/19180-65874-1-PB.pdf
http://journalarticle.ukm.my/11854/
http://ejournals.ukm.my/apjitm/issue/view/1050
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-ukm.journal.11854
record_format eprints
spelling my-ukm.journal.118542018-07-10T00:05:41Z http://journalarticle.ukm.my/11854/ A comparative study of the ensemble and base classifiers performance in Malay text categorization Alshalabi, Hamood Ali Sabrina Tiun, Nazlia Omar, Automatic text categorization (ATC) has attracted the attention of the research community over the last decade as it frees organizations from the need of manually organized documents. The ensemble techniques, which combine the results of a number of individually trained base classifiers, always improve classification performance better than base classifiers. This paper intends to compare the effectiveness of ensemble with that of base classifiers for Malay text classification. Two feature selection methods (the Gini Index (GI) and Chi-square) with the ensemble methods are applied to examine Malay text classification, with the intention to efficiently integrate base classifiers algorithms into a more accurate classification procedure. Two types of ensemble methods, namely the voting combination and meta-classifier combination, are evaluated. A wide range of comparative experiments are conducted to assess classified Malay dataset. The applied experiments reveal that meta-classifier ensemble framework performed better than the best individual classifiers on the tested datasets. Penerbit Universiti Kebangsaan Malaysia 2017-12 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/11854/1/19180-65874-1-PB.pdf Alshalabi, Hamood Ali and Sabrina Tiun, and Nazlia Omar, (2017) A comparative study of the ensemble and base classifiers performance in Malay text categorization. Asia-Pacific Journal of Information Technology and Multimedia, 6 (2). pp. 53-64. ISSN 2289-2192 http://ejournals.ukm.my/apjitm/issue/view/1050
institution Universiti Kebangsaan Malaysia
building Perpustakaan Tun Sri Lanang Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Kebangsaan Malaysia
content_source UKM Journal Article Repository
url_provider http://journalarticle.ukm.my/
language English
description Automatic text categorization (ATC) has attracted the attention of the research community over the last decade as it frees organizations from the need of manually organized documents. The ensemble techniques, which combine the results of a number of individually trained base classifiers, always improve classification performance better than base classifiers. This paper intends to compare the effectiveness of ensemble with that of base classifiers for Malay text classification. Two feature selection methods (the Gini Index (GI) and Chi-square) with the ensemble methods are applied to examine Malay text classification, with the intention to efficiently integrate base classifiers algorithms into a more accurate classification procedure. Two types of ensemble methods, namely the voting combination and meta-classifier combination, are evaluated. A wide range of comparative experiments are conducted to assess classified Malay dataset. The applied experiments reveal that meta-classifier ensemble framework performed better than the best individual classifiers on the tested datasets.
format Article
author Alshalabi, Hamood Ali
Sabrina Tiun,
Nazlia Omar,
spellingShingle Alshalabi, Hamood Ali
Sabrina Tiun,
Nazlia Omar,
A comparative study of the ensemble and base classifiers performance in Malay text categorization
author_facet Alshalabi, Hamood Ali
Sabrina Tiun,
Nazlia Omar,
author_sort Alshalabi, Hamood Ali
title A comparative study of the ensemble and base classifiers performance in Malay text categorization
title_short A comparative study of the ensemble and base classifiers performance in Malay text categorization
title_full A comparative study of the ensemble and base classifiers performance in Malay text categorization
title_fullStr A comparative study of the ensemble and base classifiers performance in Malay text categorization
title_full_unstemmed A comparative study of the ensemble and base classifiers performance in Malay text categorization
title_sort comparative study of the ensemble and base classifiers performance in malay text categorization
publisher Penerbit Universiti Kebangsaan Malaysia
publishDate 2017
url http://journalarticle.ukm.my/11854/1/19180-65874-1-PB.pdf
http://journalarticle.ukm.my/11854/
http://ejournals.ukm.my/apjitm/issue/view/1050
_version_ 1643738622052007936
score 13.214268