The Analysis Of Metadata Based Classification For Classifying Educational Websites
Initially websites can be easily categorized based on its domain extensions. But due to the explosion of the internet, the domain name restrictions are no longer being adhered. Web classification can help to categorize websites, especially educational websites that being the focus of this research....
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English |
Published: |
2016
|
Subjects: | |
Online Access: | http://eprints.utem.edu.my/id/eprint/18198/1/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites%2024%20Pages.pdf http://eprints.utem.edu.my/id/eprint/18198/2/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites.pdf http://eprints.utem.edu.my/id/eprint/18198/ https://plh.utem.edu.my/cgi-bin/koha/opac-detail.pl?biblionumber=100103 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utem.eprints.18198 |
---|---|
record_format |
eprints |
spelling |
my.utem.eprints.181982021-10-08T07:46:31Z http://eprints.utem.edu.my/id/eprint/18198/ The Analysis Of Metadata Based Classification For Classifying Educational Websites Zaraini, Mohd Nazrien Z665 Library Science. Information Science Initially websites can be easily categorized based on its domain extensions. But due to the explosion of the internet, the domain name restrictions are no longer being adhered. Web classification can help to categorize websites, especially educational websites that being the focus of this research. Classification will be done based on content and metadata in order to get the impact of metadata implementation in terms of classification accuracy. Three sets of 200 pre-determined educational websites taken from DMOZ directory utilized as training data. This is the total number of educational websites with metadata information available in that directory. For content based classification, keywords extracted from the contents and TF-IDF ranking used to get the top educational keywords. These keywords used as a training dataset attribute for educational web classification. The same method goes for metadata based classification, but the difference is that the keywords were taken from its meta description. One class support vector machine method was used because this research is focusing on single class classification only. Cross validation technique and two sets of test data; all educational websites and various categories of website will be used to validate this research. The results shows that content based classification gives more accuracy compare to metadata. Top ranking educational keywords and the analysis of metadata implementation known from this research based on the information retrieval and web classification process. 2016 Thesis NonPeerReviewed text en http://eprints.utem.edu.my/id/eprint/18198/1/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites%2024%20Pages.pdf text en http://eprints.utem.edu.my/id/eprint/18198/2/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites.pdf Zaraini, Mohd Nazrien (2016) The Analysis Of Metadata Based Classification For Classifying Educational Websites. Masters thesis, Universiti Teknikal Malaysia Melaka. https://plh.utem.edu.my/cgi-bin/koha/opac-detail.pl?biblionumber=100103 |
institution |
Universiti Teknikal Malaysia Melaka |
building |
UTEM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknikal Malaysia Melaka |
content_source |
UTEM Institutional Repository |
url_provider |
http://eprints.utem.edu.my/ |
language |
English English |
topic |
Z665 Library Science. Information Science |
spellingShingle |
Z665 Library Science. Information Science Zaraini, Mohd Nazrien The Analysis Of Metadata Based Classification For Classifying Educational Websites |
description |
Initially websites can be easily categorized based on its domain extensions. But due to the explosion of the internet, the domain name restrictions are no longer being adhered. Web classification can help to categorize websites, especially educational websites that being the focus of this research. Classification will be done based on content and metadata in order to get the impact of metadata implementation in terms of classification accuracy. Three sets of 200 pre-determined educational websites taken from DMOZ directory utilized as training data. This is the total number of educational websites with metadata information available in that directory. For content based classification, keywords extracted from the contents and TF-IDF ranking used to get the top educational keywords. These keywords used as a training dataset attribute for educational web classification. The same method goes for metadata based classification, but the difference is that the keywords were taken from its meta description. One class support vector machine method was used because this research is focusing on single class classification only. Cross validation technique and two sets of test data; all educational websites and various categories of website will be used to validate this research. The results shows that content based classification gives more accuracy compare to metadata. Top ranking educational keywords and the analysis of metadata implementation known from this research based on the information retrieval and web classification process. |
format |
Thesis |
author |
Zaraini, Mohd Nazrien |
author_facet |
Zaraini, Mohd Nazrien |
author_sort |
Zaraini, Mohd Nazrien |
title |
The Analysis Of Metadata Based Classification For Classifying Educational Websites |
title_short |
The Analysis Of Metadata Based Classification For Classifying Educational Websites |
title_full |
The Analysis Of Metadata Based Classification For Classifying Educational Websites |
title_fullStr |
The Analysis Of Metadata Based Classification For Classifying Educational Websites |
title_full_unstemmed |
The Analysis Of Metadata Based Classification For Classifying Educational Websites |
title_sort |
analysis of metadata based classification for classifying educational websites |
publishDate |
2016 |
url |
http://eprints.utem.edu.my/id/eprint/18198/1/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites%2024%20Pages.pdf http://eprints.utem.edu.my/id/eprint/18198/2/The%20Analysis%20Of%20Metadata%20Based%20Classification%20For%20Classifying%20Educational%20Websites.pdf http://eprints.utem.edu.my/id/eprint/18198/ https://plh.utem.edu.my/cgi-bin/koha/opac-detail.pl?biblionumber=100103 |
_version_ |
1715193889835450368 |
score |
13.160551 |