Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)

Classification is the process of finding a model or function that explains or distinguishes concepts or data classes, intending to estimate the category of an object whose label is unknown, and various types of classification, one of which is the classification of text documents. Document text class...

全面介绍

Saved in:
书目详细资料
Main Authors: Redho Aidil, Iqrom, Tri Basuki, Kurniawan
格式: Article
语言:English
出版: INTI International University 2023
主题:
在线阅读:http://eprints.intimal.edu.my/1779/1/jods2023_05.pdf
http://eprints.intimal.edu.my/1779/
http://ipublishing.intimal.edu.my/jods.html
标签: 添加标签
没有标签, 成为第一个标记此记录!
id my-inti-eprints.1779
record_format eprints
spelling my-inti-eprints.17792023-08-18T08:55:29Z http://eprints.intimal.edu.my/1779/ Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN) Redho Aidil, Iqrom Tri Basuki, Kurniawan Q Science (General) QA75 Electronic computers. Computer science Classification is the process of finding a model or function that explains or distinguishes concepts or data classes, intending to estimate the category of an object whose label is unknown, and various types of classification, one of which is the classification of text documents. Document text classification based on label category is one of the mandatory components in the retrieval system to provide better and more accurate information. Based on existing research, only single-label Classification of text documents is carried out, and it is infrequent for multi-label Classification of IT journals, especially in the Indonesian language. Therefore, this research is aimed at multi-label text classification using the K-Nearest Neighbors (KNN) method, and the OnevsRest Classifier approach model, where the classification process will be determined by the closest k = n value in the category of documents that are similar and the multi-labels are in prediction with One vs. Rest Classifier. Training and testing are done with a dataset of 500 Indonesian IT journals. The test results are sufficient to give good results with an accuracy of 84% and a hamming loss of 0.076. INTI International University 2023-08 Article PeerReviewed text en cc_by_4 http://eprints.intimal.edu.my/1779/1/jods2023_05.pdf Redho Aidil, Iqrom and Tri Basuki, Kurniawan (2023) Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN). Journal of Data Science, 2023 (05). pp. 1-9. ISSN 2805-5160 http://ipublishing.intimal.edu.my/jods.html
institution INTI International University
building INTI Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider INTI International University
content_source INTI Institutional Repository
url_provider http://eprints.intimal.edu.my
language English
topic Q Science (General)
QA75 Electronic computers. Computer science
spellingShingle Q Science (General)
QA75 Electronic computers. Computer science
Redho Aidil, Iqrom
Tri Basuki, Kurniawan
Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
description Classification is the process of finding a model or function that explains or distinguishes concepts or data classes, intending to estimate the category of an object whose label is unknown, and various types of classification, one of which is the classification of text documents. Document text classification based on label category is one of the mandatory components in the retrieval system to provide better and more accurate information. Based on existing research, only single-label Classification of text documents is carried out, and it is infrequent for multi-label Classification of IT journals, especially in the Indonesian language. Therefore, this research is aimed at multi-label text classification using the K-Nearest Neighbors (KNN) method, and the OnevsRest Classifier approach model, where the classification process will be determined by the closest k = n value in the category of documents that are similar and the multi-labels are in prediction with One vs. Rest Classifier. Training and testing are done with a dataset of 500 Indonesian IT journals. The test results are sufficient to give good results with an accuracy of 84% and a hamming loss of 0.076.
format Article
author Redho Aidil, Iqrom
Tri Basuki, Kurniawan
author_facet Redho Aidil, Iqrom
Tri Basuki, Kurniawan
author_sort Redho Aidil, Iqrom
title Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
title_short Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
title_full Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
title_fullStr Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
title_full_unstemmed Multi-Label Text Classification for Indonesian Language IT Journal with K-Nearest Neighbors (KNN)
title_sort multi-label text classification for indonesian language it journal with k-nearest neighbors (knn)
publisher INTI International University
publishDate 2023
url http://eprints.intimal.edu.my/1779/1/jods2023_05.pdf
http://eprints.intimal.edu.my/1779/
http://ipublishing.intimal.edu.my/jods.html
_version_ 1775628425223995392
score 13.154949