Multi-agent crawling systems (MACS) architecture for effective web retrieval

Recently, many web search engines used for information gathering in World Wide Web (WWW). For instance, Google, Yahoo, AltaVista and others. Web crawler is a program or automated script which browses the WWW in a methodically, automated manner that mainly used to create a copy of all the visited pag...

Full description

Saved in:
Bibliographic Details
Main Authors: Ibrahim, Siti Nurkhadijah Aishah, Selamat, Ali
Format: Conference or Workshop Item
Published: 2007
Subjects:
Online Access:http://eprints.utm.my/id/eprint/14234/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.14234
record_format eprints
spelling my.utm.142342017-08-02T06:10:21Z http://eprints.utm.my/id/eprint/14234/ Multi-agent crawling systems (MACS) architecture for effective web retrieval Ibrahim, Siti Nurkhadijah Aishah Selamat, Ali QA75 Electronic computers. Computer science Recently, many web search engines used for information gathering in World Wide Web (WWW). For instance, Google, Yahoo, AltaVista and others. Web crawler is a program or automated script which browses the WWW in a methodically, automated manner that mainly used to create a copy of all the visited pages for later processing by a search engine that will index the downloaded pages to provide fast searches. From the study, we found that web pages crawled by crawlers will slow down the server. Thus, it makes users refuses to allow crawlers exploring web pages and even worst if they block the crawler’s IP address during entering the web pages. In order to achieve higher accuracy rate, we propose the architecture of multi-agent system in web crawling known as Multi-Agent Crawling System (MACS). Since Java Agent Development Framework (JADE) is one of the most used and promising agent development framework, MACS will be model in Java based on JADE architecture. We expected this model will enhance the network interaction between the web agents and servers. 2007 Conference or Workshop Item PeerReviewed Ibrahim, Siti Nurkhadijah Aishah and Selamat, Ali (2007) Multi-agent crawling systems (MACS) architecture for effective web retrieval. In: Postgraduate Annual Research Seminar (PARS’ 07), 2007, UTM.
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Ibrahim, Siti Nurkhadijah Aishah
Selamat, Ali
Multi-agent crawling systems (MACS) architecture for effective web retrieval
description Recently, many web search engines used for information gathering in World Wide Web (WWW). For instance, Google, Yahoo, AltaVista and others. Web crawler is a program or automated script which browses the WWW in a methodically, automated manner that mainly used to create a copy of all the visited pages for later processing by a search engine that will index the downloaded pages to provide fast searches. From the study, we found that web pages crawled by crawlers will slow down the server. Thus, it makes users refuses to allow crawlers exploring web pages and even worst if they block the crawler’s IP address during entering the web pages. In order to achieve higher accuracy rate, we propose the architecture of multi-agent system in web crawling known as Multi-Agent Crawling System (MACS). Since Java Agent Development Framework (JADE) is one of the most used and promising agent development framework, MACS will be model in Java based on JADE architecture. We expected this model will enhance the network interaction between the web agents and servers.
format Conference or Workshop Item
author Ibrahim, Siti Nurkhadijah Aishah
Selamat, Ali
author_facet Ibrahim, Siti Nurkhadijah Aishah
Selamat, Ali
author_sort Ibrahim, Siti Nurkhadijah Aishah
title Multi-agent crawling systems (MACS) architecture for effective web retrieval
title_short Multi-agent crawling systems (MACS) architecture for effective web retrieval
title_full Multi-agent crawling systems (MACS) architecture for effective web retrieval
title_fullStr Multi-agent crawling systems (MACS) architecture for effective web retrieval
title_full_unstemmed Multi-agent crawling systems (MACS) architecture for effective web retrieval
title_sort multi-agent crawling systems (macs) architecture for effective web retrieval
publishDate 2007
url http://eprints.utm.my/id/eprint/14234/
_version_ 1643646355937165312
score 13.15806