AUTOMATED NEWS CLASSIFICATION USING MACHINE LEARNING

The way of news consumption has changed drastically since the past as 87% of the respondent of a survey has cited online as their primary news source which makes the news article more accessible as ever for the readers, but the accessibility may overwhelm the readers due to the vast amount of...

Full description

Saved in:
Bibliographic Details
Main Author: ASOGAN, ARVIN KUMAR
Format: Final Year Project
Language:English
Published: IRC 2020
Subjects:
Online Access:http://utpedia.utp.edu.my/21687/1/17411_Arun%20Kumar.pdf
http://utpedia.utp.edu.my/21687/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The way of news consumption has changed drastically since the past as 87% of the respondent of a survey has cited online as their primary news source which makes the news article more accessible as ever for the readers, but the accessibility may overwhelm the readers due to the vast amount of news that is published online every day. Thus, having news classification is important but with the amount of the news published, we need the help of the machine learning to classify the news article in which this project was set to do. The objective of this project is to find the best machine learning model that can be used for the news classification, developing process for online news extraction and finally developing an automated system to extract and classify the news articles. Support Vector Machine (SVM) with TF-IDF Vectorization method was found to be the best machine learning model for news article classification and online news article extraction process was done using Python script. After that, extracted articles are used to develop the machine learning model with an accuracy of 89.92%. The developed model was then used for the classification process in the automated news classification program which was build in Python as well. At the end, this project has helped to develop an automated news classification system will be helpful for the readers as they are able to view the news articles that are interesting to them.