A Text Mining Algorithm Optimising the Determination of Relevant Studies
In this paper, we develop a text mining algorithm that influences the identification of relevant literature studies. The algorithm consists of three processes, detection process; preparation process; and mining process. The detection process includes the determination of document language and abstra...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Paper |
Language: | English |
Published: |
2019
|
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.uniten.dspace-11757 |
---|---|
record_format |
dspace |
spelling |
my.uniten.dspace-117572020-07-07T08:50:09Z A Text Mining Algorithm Optimising the Determination of Relevant Studies Khashfeh, M. Mahmoud, M.A. Ahmad, M.S. In this paper, we develop a text mining algorithm that influences the identification of relevant literature studies. The algorithm consists of three processes, detection process; preparation process; and mining process. The detection process includes the determination of document language and abstract and keywords. The Preparation includes the processes, split content to paragraphs; paragraph length determination; converting text to lower case; text typography factor; content tokenization, removing stop words. Finally, the mining includes the processes, regular expression; normalization; grouping and computing frequency. The proposed algorithm would be useful in providing an alternative means of searching highly relevant content from large databases. © 2018 IEEE. 2019-03-06T07:36:39Z 2019-03-06T07:36:39Z 2018 Conference Paper 10.1109/ISAMSR.2018.8540553 en International Symposium on Agents, Multi-Agent Systems and Robotics 2018, ISAMSR 2018 19 November 2018, Article number 8540553 2018 International Symposium on Agents, Multi-Agent Systems and Robotics, ISAMSR 2018; The Everly PutrajayaPutrajaya; Malaysia; 27 August 2018 through ; Category numberCFP18C71-ART; Code 143006 |
institution |
Universiti Tenaga Nasional |
building |
UNITEN Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tenaga Nasional |
content_source |
UNITEN Institutional Repository |
url_provider |
http://dspace.uniten.edu.my/ |
language |
English |
description |
In this paper, we develop a text mining algorithm that influences the identification of relevant literature studies. The algorithm consists of three processes, detection process; preparation process; and mining process. The detection process includes the determination of document language and abstract and keywords. The Preparation includes the processes, split content to paragraphs; paragraph length determination; converting text to lower case; text typography factor; content tokenization, removing stop words. Finally, the mining includes the processes, regular expression; normalization; grouping and computing frequency. The proposed algorithm would be useful in providing an alternative means of searching highly relevant content from large databases. © 2018 IEEE. |
format |
Conference Paper |
author |
Khashfeh, M. Mahmoud, M.A. Ahmad, M.S. |
spellingShingle |
Khashfeh, M. Mahmoud, M.A. Ahmad, M.S. A Text Mining Algorithm Optimising the Determination of Relevant Studies |
author_facet |
Khashfeh, M. Mahmoud, M.A. Ahmad, M.S. |
author_sort |
Khashfeh, M. |
title |
A Text Mining Algorithm Optimising the Determination of Relevant Studies |
title_short |
A Text Mining Algorithm Optimising the Determination of Relevant Studies |
title_full |
A Text Mining Algorithm Optimising the Determination of Relevant Studies |
title_fullStr |
A Text Mining Algorithm Optimising the Determination of Relevant Studies |
title_full_unstemmed |
A Text Mining Algorithm Optimising the Determination of Relevant Studies |
title_sort |
text mining algorithm optimising the determination of relevant studies |
publishDate |
2019 |
_version_ |
1672614169059262464 |
score |
13.160551 |