Genetic algorithm based sentence extraction for text summarization

The goal of text summarization is to generate summary of the original text that helps the user to quickly understand large volumes of information available in that text. This paper focuses on text summarization based on sentence extraction. One of the methods to obtain suitable sentences is to assig...

Full description

Saved in:
Bibliographic Details
Main Authors: Suanmali, Ladda, Salim, Naomie, Binwahlan, Mohammed Salem
Format: Article
Language:English
Published: Penerbit UTM Press 2011
Subjects:
Online Access:http://eprints.utm.my/id/eprint/39945/1/NaomieSalim2011_GeneticAlgorithmbasedSentenceExtraction.pdf
http://eprints.utm.my/id/eprint/39945/
http://se.fc.utm.my/ijic/index.php/ijic/article/view/6
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.39945
record_format eprints
spelling my.utm.399452019-03-05T01:38:18Z http://eprints.utm.my/id/eprint/39945/ Genetic algorithm based sentence extraction for text summarization Suanmali, Ladda Salim, Naomie Binwahlan, Mohammed Salem QA75 Electronic computers. Computer science The goal of text summarization is to generate summary of the original text that helps the user to quickly understand large volumes of information available in that text. This paper focuses on text summarization based on sentence extraction. One of the methods to obtain suitable sentences is to assign some numerical measure for sentences called sentence weighting and then select the best ones. The first step in summarization by extraction is the identification of important features. In this paper, we consider the effectiveness of the features selected using Genetic Algorithm (GA). GA is used for the training of 100 documents in DUC 2002 data set to learn the weight of each feature, which is evaluated using recall measurement generated by ROUGE for a fitness function. The weights obtained by GA were used to adjust the important features score. We compare our results with Microsoft Word 2007 summarizer and Copernic summarizer both for 100 documents and 62 unseen documents. The results show that the best average precision, recall, and f-measure for the summaries were obtained by GA.Â. Penerbit UTM Press 2011 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/39945/1/NaomieSalim2011_GeneticAlgorithmbasedSentenceExtraction.pdf Suanmali, Ladda and Salim, Naomie and Binwahlan, Mohammed Salem (2011) Genetic algorithm based sentence extraction for text summarization. International Journal of Innovative Computing, 1 (1). ISSN 2180-4370 http://se.fc.utm.my/ijic/index.php/ijic/article/view/6
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Suanmali, Ladda
Salim, Naomie
Binwahlan, Mohammed Salem
Genetic algorithm based sentence extraction for text summarization
description The goal of text summarization is to generate summary of the original text that helps the user to quickly understand large volumes of information available in that text. This paper focuses on text summarization based on sentence extraction. One of the methods to obtain suitable sentences is to assign some numerical measure for sentences called sentence weighting and then select the best ones. The first step in summarization by extraction is the identification of important features. In this paper, we consider the effectiveness of the features selected using Genetic Algorithm (GA). GA is used for the training of 100 documents in DUC 2002 data set to learn the weight of each feature, which is evaluated using recall measurement generated by ROUGE for a fitness function. The weights obtained by GA were used to adjust the important features score. We compare our results with Microsoft Word 2007 summarizer and Copernic summarizer both for 100 documents and 62 unseen documents. The results show that the best average precision, recall, and f-measure for the summaries were obtained by GA.Â.
format Article
author Suanmali, Ladda
Salim, Naomie
Binwahlan, Mohammed Salem
author_facet Suanmali, Ladda
Salim, Naomie
Binwahlan, Mohammed Salem
author_sort Suanmali, Ladda
title Genetic algorithm based sentence extraction for text summarization
title_short Genetic algorithm based sentence extraction for text summarization
title_full Genetic algorithm based sentence extraction for text summarization
title_fullStr Genetic algorithm based sentence extraction for text summarization
title_full_unstemmed Genetic algorithm based sentence extraction for text summarization
title_sort genetic algorithm based sentence extraction for text summarization
publisher Penerbit UTM Press
publishDate 2011
url http://eprints.utm.my/id/eprint/39945/1/NaomieSalim2011_GeneticAlgorithmbasedSentenceExtraction.pdf
http://eprints.utm.my/id/eprint/39945/
http://se.fc.utm.my/ijic/index.php/ijic/article/view/6
_version_ 1643650400982663168
score 13.15806