Text categorization using naive bayes algorithm

As the volume of information available on the internet and corporate intranet continues to increase, there is a growing interest in helping people better find, filter, and manage all these resources. Text categorization is one of the techniques that can be applied in this situation. This paper prese...

Full description

Saved in:
Bibliographic Details
Main Authors: Wan lsmail @ W. Abdullah, Wan Hazimah, Kamaruddin, Siti Sakira, Sainin, Mohd Shamrie
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:http://repo.uum.edu.my/82/1/text.pdf
http://repo.uum.edu.my/82/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:As the volume of information available on the internet and corporate intranet continues to increase, there is a growing interest in helping people better find, filter, and manage all these resources. Text categorization is one of the techniques that can be applied in this situation. This paper presents text categorization system based on naive Bayes algorithm. This algorithm has long been used for text categorization tasks. Naive Bayes classifier is based on probability model that integrate strong independence assumptions which often have no bearing in reality. The aims of this project are to categorize the textual document using naive Bayes algorithm and to measure the correctness of the chosen technique for the categorization process. This paper also discusses the experiment in categorizing articles using naive Bayes.