Automatic classification using concept knowledge of web documents

In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for eac...

Full description

Saved in:
Bibliographic Details
Main Authors: Choi, Sang-Ho, Park, Sa-Joon, Hwang, Su-Cheol, Kim, Ki-Tae
Format: Conference or Workshop Item
Language:English
Published: 2004
Subjects:
Online Access:http://repo.uum.edu.my/13843/1/KM112.pdf
http://repo.uum.edu.my/13843/
http://www.kmice.cms.net.my
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.repo.13843
record_format eprints
spelling my.uum.repo.138432015-04-13T08:56:36Z http://repo.uum.edu.my/13843/ Automatic classification using concept knowledge of web documents Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae QA76 Computer software In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents. 2004-02-14 Conference or Workshop Item PeerReviewed application/pdf en http://repo.uum.edu.my/13843/1/KM112.pdf Choi, Sang-Ho and Park, Sa-Joon and Hwang, Su-Cheol and Kim, Ki-Tae (2004) Automatic classification using concept knowledge of web documents. In: Knowledge Management International Conference and Exhibition 2004 (KMICE 2004), 14-15 February 2004, Evergreen Laurel Hotel, Penang. http://www.kmice.cms.net.my
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutionali Repository
url_provider http://repo.uum.edu.my/
language English
topic QA76 Computer software
spellingShingle QA76 Computer software
Choi, Sang-Ho
Park, Sa-Joon
Hwang, Su-Cheol
Kim, Ki-Tae
Automatic classification using concept knowledge of web documents
description In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents.
format Conference or Workshop Item
author Choi, Sang-Ho
Park, Sa-Joon
Hwang, Su-Cheol
Kim, Ki-Tae
author_facet Choi, Sang-Ho
Park, Sa-Joon
Hwang, Su-Cheol
Kim, Ki-Tae
author_sort Choi, Sang-Ho
title Automatic classification using concept knowledge of web documents
title_short Automatic classification using concept knowledge of web documents
title_full Automatic classification using concept knowledge of web documents
title_fullStr Automatic classification using concept knowledge of web documents
title_full_unstemmed Automatic classification using concept knowledge of web documents
title_sort automatic classification using concept knowledge of web documents
publishDate 2004
url http://repo.uum.edu.my/13843/1/KM112.pdf
http://repo.uum.edu.my/13843/
http://www.kmice.cms.net.my
_version_ 1644281297053417472
score 13.18916