Graph-based extractive text summarization method for Hausa text

Automatic text summarization is one of the most promising solutions to the ever-growing challenges of textual data as it produces a shorter version of the original document with fewer bytes, but the same information as the original document. Despite the advancements in automatic text summarization r...

Full description

Saved in:
Bibliographic Details
Main Authors: Abubakar Bichi, Abdulkadir, Samsudin, Ruhaidah, Hassan, Rohayanti, Abdallah Hasan, Layla Rasheed, Ado Rogo, Abubakar
Format: Article
Language:English
Published: Public Library of Science 2023
Subjects:
Online Access:http://eprints.utm.my/106400/1/AbdulkadirAbubakarBichi2023_GraphbasedExtractiveTextSummarizationMethod.pdf
http://eprints.utm.my/106400/
http://dx.doi.org/10.1371/journal.pone.0285376
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.106400
record_format eprints
spelling my.utm.1064002024-06-29T07:18:45Z http://eprints.utm.my/106400/ Graph-based extractive text summarization method for Hausa text Abubakar Bichi, Abdulkadir Samsudin, Ruhaidah Hassan, Rohayanti Abdallah Hasan, Layla Rasheed Ado Rogo, Abubakar QA75 Electronic computers. Computer science Automatic text summarization is one of the most promising solutions to the ever-growing challenges of textual data as it produces a shorter version of the original document with fewer bytes, but the same information as the original document. Despite the advancements in automatic text summarization research, research involving the development of automatic text summarization methods for documents written in Hausa, a Chadic language widely spoken in West Africa by approximately 150,000,000 people as either their first or second language, is still in early stages of development. This study proposes a novel graph-based extractive single-document summarization method for Hausa text by modifying the existing PageRank algorithm using the normalized common bigrams count between adjacent sentences as the initial vertex score. The proposed method is evaluated using a primarily collected Hausa summarization evaluation dataset comprising of 113 Hausa news articles on ROUGE evaluation toolkits. The proposed approach outperformed the standard methods using the same datasets. It outperformed the TextRank method by 2.1%, LexRank by 12.3%, centroid-based method by 19.5%, and BM25 method by 17.4% Public Library of Science 2023-05 Article PeerReviewed application/pdf en http://eprints.utm.my/106400/1/AbdulkadirAbubakarBichi2023_GraphbasedExtractiveTextSummarizationMethod.pdf Abubakar Bichi, Abdulkadir and Samsudin, Ruhaidah and Hassan, Rohayanti and Abdallah Hasan, Layla Rasheed and Ado Rogo, Abubakar (2023) Graph-based extractive text summarization method for Hausa text. PLoS ONE, 18 (5). pp. 1-15. ISSN 1932-6203 http://dx.doi.org/10.1371/journal.pone.0285376 DOI:10.1371/journal.pone.0285376
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Abubakar Bichi, Abdulkadir
Samsudin, Ruhaidah
Hassan, Rohayanti
Abdallah Hasan, Layla Rasheed
Ado Rogo, Abubakar
Graph-based extractive text summarization method for Hausa text
description Automatic text summarization is one of the most promising solutions to the ever-growing challenges of textual data as it produces a shorter version of the original document with fewer bytes, but the same information as the original document. Despite the advancements in automatic text summarization research, research involving the development of automatic text summarization methods for documents written in Hausa, a Chadic language widely spoken in West Africa by approximately 150,000,000 people as either their first or second language, is still in early stages of development. This study proposes a novel graph-based extractive single-document summarization method for Hausa text by modifying the existing PageRank algorithm using the normalized common bigrams count between adjacent sentences as the initial vertex score. The proposed method is evaluated using a primarily collected Hausa summarization evaluation dataset comprising of 113 Hausa news articles on ROUGE evaluation toolkits. The proposed approach outperformed the standard methods using the same datasets. It outperformed the TextRank method by 2.1%, LexRank by 12.3%, centroid-based method by 19.5%, and BM25 method by 17.4%
format Article
author Abubakar Bichi, Abdulkadir
Samsudin, Ruhaidah
Hassan, Rohayanti
Abdallah Hasan, Layla Rasheed
Ado Rogo, Abubakar
author_facet Abubakar Bichi, Abdulkadir
Samsudin, Ruhaidah
Hassan, Rohayanti
Abdallah Hasan, Layla Rasheed
Ado Rogo, Abubakar
author_sort Abubakar Bichi, Abdulkadir
title Graph-based extractive text summarization method for Hausa text
title_short Graph-based extractive text summarization method for Hausa text
title_full Graph-based extractive text summarization method for Hausa text
title_fullStr Graph-based extractive text summarization method for Hausa text
title_full_unstemmed Graph-based extractive text summarization method for Hausa text
title_sort graph-based extractive text summarization method for hausa text
publisher Public Library of Science
publishDate 2023
url http://eprints.utm.my/106400/1/AbdulkadirAbubakarBichi2023_GraphbasedExtractiveTextSummarizationMethod.pdf
http://eprints.utm.my/106400/
http://dx.doi.org/10.1371/journal.pone.0285376
_version_ 1803335002983759872
score 13.159267