WEB-BASED ARTICLE SUMMARIZATION WITH MACHINE LEARNING TECHNIQUES

The motivation behind this project is the increasing amount of information available on the internet, which makes it difficult for people to sift through and find the relevant information they need. Text summarization can help to address this problem by condensing lengthy texts into shorter su...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Wu Tong
Format: Final Year Project Report
Language:English
English
Published: Universiti Malaysia Sarawak, (UNIMAS) 2023
Subjects:
Online Access:http://ir.unimas.my/id/eprint/44163/1/Lim%20Wu%20Tong%20%2824%20pgs%29.pdf
http://ir.unimas.my/id/eprint/44163/2/Lim%20Wu%20Tong%20ft.pdf
http://ir.unimas.my/id/eprint/44163/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The motivation behind this project is the increasing amount of information available on the internet, which makes it difficult for people to sift through and find the relevant information they need. Text summarization can help to address this problem by condensing lengthy texts into shorter summaries that convey the main points and ideas of the original text. However, traditional text summarization methods often produce summaries that are too short or lack coherence, which can make them difficult to understand. Machine learning techniques have the potential to overcome these limitations and produce more accurate and coherent summaries. In order to develop the web-based article summarization system, various machine learning techniques were studied and compared. The Naive Bayes, Neural Network, and decision tree techniques were chosen for their ability to handle both numerical and categorical data, and their robustness to noise and missing values. These techniques were implemented using the Python programming language and the scikit-learn library. The front-end of the system was developed using the Django framework, along with HTML, CSS and JavaScript for styling and interactive elements. The performance of the system was evaluated using a dataset of articles and their corresponding summaries. The quality of the summaries was assessed using metrics such as ROUGE and expert evaluation, while the preferredness were evaluated through user surveys and time efficiency observed from the system. The results showed that the system was able to produce summaries that were of good quality, preferred by users, and efficient in terms of time. Overall, the web-based article summarization system with machine learning techniques demonstrated the potential to be a useful tool for condensing and summarizing texts in a more accurate and coherent manner