Document ranking using information quality criteria in weblog search engine

Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their w...

Full description

Saved in:
Bibliographic Details
Main Author: Azimzadeh, Fatemeh
Format: Thesis
Language:English
Published: 2013
Online Access:http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf
http://psasir.upm.edu.my/id/eprint/38937/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.38937
record_format eprints
spelling my.upm.eprints.389372016-01-18T08:50:24Z http://psasir.upm.edu.my/id/eprint/38937/ Document ranking using information quality criteria in weblog search engine Azimzadeh, Fatemeh Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their wanted information on blog sites. A major problem is that a weblog includes nontraditional features of the Web pages such as Weblog post, links, tags, and comments. Thus, the use of traditional rank algorithms like PageRank and HITS in general search engines are not appropriate to evaluate the Weblog posts because such algorithms do not consider the blog specific features. On the other hand, information quality criteria are important factors for the users. From Weblogs, which have unfiltered information without expert peer review, users expect that search engines deliver quality information for their queries. There has been little framework which consider information quality criteria in the Weblog search engine. This thesis establishes an integrated framework which incorporates information quality criteria into the ranking function of search engine on Persian weblogs. The presented framework rank Weblogs and posts based on the selected information quality criteria. Then, the ranking scores are merged with relevancy in the search engine. A ranking method is developed for the Weblog search engine where the post is considered as the document retrieved. This thesis proposes two ranking functions in the search engine which are combined with the information quality criteria, and then compared with a PageRank based ranking function. The results reveal that combination of quality criteria with relevancy, without suitable weight for each one, does not lead to user’s satisfaction. Instead, applying proper weights to both information quality factors and relevancy intelligibly improve the results of the search engine and consequently lead to user satisfaction. 2013-01 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf Azimzadeh, Fatemeh (2013) Document ranking using information quality criteria in weblog search engine. PhD thesis, Universiti Putra Malaysia.
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their wanted information on blog sites. A major problem is that a weblog includes nontraditional features of the Web pages such as Weblog post, links, tags, and comments. Thus, the use of traditional rank algorithms like PageRank and HITS in general search engines are not appropriate to evaluate the Weblog posts because such algorithms do not consider the blog specific features. On the other hand, information quality criteria are important factors for the users. From Weblogs, which have unfiltered information without expert peer review, users expect that search engines deliver quality information for their queries. There has been little framework which consider information quality criteria in the Weblog search engine. This thesis establishes an integrated framework which incorporates information quality criteria into the ranking function of search engine on Persian weblogs. The presented framework rank Weblogs and posts based on the selected information quality criteria. Then, the ranking scores are merged with relevancy in the search engine. A ranking method is developed for the Weblog search engine where the post is considered as the document retrieved. This thesis proposes two ranking functions in the search engine which are combined with the information quality criteria, and then compared with a PageRank based ranking function. The results reveal that combination of quality criteria with relevancy, without suitable weight for each one, does not lead to user’s satisfaction. Instead, applying proper weights to both information quality factors and relevancy intelligibly improve the results of the search engine and consequently lead to user satisfaction.
format Thesis
author Azimzadeh, Fatemeh
spellingShingle Azimzadeh, Fatemeh
Document ranking using information quality criteria in weblog search engine
author_facet Azimzadeh, Fatemeh
author_sort Azimzadeh, Fatemeh
title Document ranking using information quality criteria in weblog search engine
title_short Document ranking using information quality criteria in weblog search engine
title_full Document ranking using information quality criteria in weblog search engine
title_fullStr Document ranking using information quality criteria in weblog search engine
title_full_unstemmed Document ranking using information quality criteria in weblog search engine
title_sort document ranking using information quality criteria in weblog search engine
publishDate 2013
url http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf
http://psasir.upm.edu.my/id/eprint/38937/
_version_ 1643832277711454208
score 13.211869