Categorical term descriptor: a proposed term weighting scheme for feature selection

This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selection in automated text categorization. CTD is an adaptation of the term frequency inverse document frequency (TFIDF). We compared the performance of the proposed method against classical methods such as...

Full description

Saved in:
Bibliographic Details
Main Authors: Bong, Chih How, Kulathuramaiyer, Narayanan, Wong, Ting Kiong
Format: Conference or Workshop Item
Language:English
Published: IEEE 2005
Subjects:
Online Access:http://ir.unimas.my/id/eprint/1196/1/categorical%2Bterm%2Bdescriptor%2B%2BA%2Bproposed%2Bterm%2Bweighting%2Bscheme%2Bfor%2Bfeature%2Bselection%2528abstract%2529.pdf
http://ir.unimas.my/id/eprint/1196/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selection in automated text categorization. CTD is an adaptation of the term frequency inverse document frequency (TFIDF). We compared the performance of the proposed method against classical methods such as correlation coefficient, chi-square and information gain using the multinomial naive Bayes and the support vector machine (SVKD) classifiers on the Reuters (10) and Reuters (115) variants of Reuters-21578 dataset.