Document ranking using information quality criteria in weblog search engine
Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their w...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2013
|
Online Access: | http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf http://psasir.upm.edu.my/id/eprint/38937/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.upm.eprints.38937 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.389372016-01-18T08:50:24Z http://psasir.upm.edu.my/id/eprint/38937/ Document ranking using information quality criteria in weblog search engine Azimzadeh, Fatemeh Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their wanted information on blog sites. A major problem is that a weblog includes nontraditional features of the Web pages such as Weblog post, links, tags, and comments. Thus, the use of traditional rank algorithms like PageRank and HITS in general search engines are not appropriate to evaluate the Weblog posts because such algorithms do not consider the blog specific features. On the other hand, information quality criteria are important factors for the users. From Weblogs, which have unfiltered information without expert peer review, users expect that search engines deliver quality information for their queries. There has been little framework which consider information quality criteria in the Weblog search engine. This thesis establishes an integrated framework which incorporates information quality criteria into the ranking function of search engine on Persian weblogs. The presented framework rank Weblogs and posts based on the selected information quality criteria. Then, the ranking scores are merged with relevancy in the search engine. A ranking method is developed for the Weblog search engine where the post is considered as the document retrieved. This thesis proposes two ranking functions in the search engine which are combined with the information quality criteria, and then compared with a PageRank based ranking function. The results reveal that combination of quality criteria with relevancy, without suitable weight for each one, does not lead to user’s satisfaction. Instead, applying proper weights to both information quality factors and relevancy intelligibly improve the results of the search engine and consequently lead to user satisfaction. 2013-01 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf Azimzadeh, Fatemeh (2013) Document ranking using information quality criteria in weblog search engine. PhD thesis, Universiti Putra Malaysia. |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
Social media has revolutionized the Web industry. Weblog medium, fundamentally,is an innovation in personal publishing. It has also come to engender a new form of
social interaction on the web. Because much firsthand information is recorded in blog posts, more and more people tend to search their wanted information on blog sites. A major problem is that a weblog includes nontraditional features of the Web pages such as Weblog post, links, tags, and comments. Thus, the use of traditional rank algorithms like PageRank and HITS in general search engines are not appropriate to evaluate the Weblog posts because such algorithms do not consider the blog specific features.
On the other hand, information quality criteria are important factors for the users. From Weblogs, which have unfiltered information without expert peer review, users
expect that search engines deliver quality information for their queries. There has been little framework which consider information quality criteria in the Weblog
search engine. This thesis establishes an integrated framework which incorporates information quality criteria into the ranking function of search engine on Persian
weblogs. The presented framework rank Weblogs and posts based on the selected information quality criteria. Then, the ranking scores are merged with relevancy in the search engine. A ranking method is developed for the Weblog search engine where the post is considered as the document retrieved. This thesis proposes two ranking functions in the search engine which are combined with the information quality criteria, and then compared with a PageRank based ranking function. The results reveal that combination of quality criteria with relevancy, without suitable weight for each one, does not lead to user’s satisfaction. Instead, applying proper weights to both information quality factors and relevancy intelligibly improve the
results of the search engine and consequently lead to user satisfaction. |
format |
Thesis |
author |
Azimzadeh, Fatemeh |
spellingShingle |
Azimzadeh, Fatemeh Document ranking using information quality criteria in weblog search engine |
author_facet |
Azimzadeh, Fatemeh |
author_sort |
Azimzadeh, Fatemeh |
title |
Document ranking using information quality criteria in weblog search engine |
title_short |
Document ranking using information quality criteria in weblog search engine |
title_full |
Document ranking using information quality criteria in weblog search engine |
title_fullStr |
Document ranking using information quality criteria in weblog search engine |
title_full_unstemmed |
Document ranking using information quality criteria in weblog search engine |
title_sort |
document ranking using information quality criteria in weblog search engine |
publishDate |
2013 |
url |
http://psasir.upm.edu.my/id/eprint/38937/1/FK%202013%204R.pdf http://psasir.upm.edu.my/id/eprint/38937/ |
_version_ |
1643832277711454208 |
score |
13.211869 |