Buchcover von Design of A Priority Based Frequency Regulated Incremental Crawler
Buchtitel:

Design of A Priority Based Frequency Regulated Incremental Crawler

LAP LAMBERT Academic Publishing (02.07.2014 )

Books loader

Omni badge gutscheinfähig
ISBN-13:

978-3-659-57001-8

ISBN-10:
365957001X
EAN:
9783659570018
Buchsprache:
Englisch
Klappentext:
People are likely to surf the web using search engines. Crawler (a part of search engine) continuously crawls the web and keeps its collection as fresh as possible. An efficient crawler should address issues like unnecessary burden on the web crawler, parallelism of the crawling process, freshness of the web contents discovered and revisiting frequency of web pages. When pages are changing very fast then these crawlers need to visit the pages as frequently as possible. Today when web size has become very large; these revisits not only engage the network traffic for a longer time but the crawler will also not be able to crawl the complete web in feasible time. In this work, an alternate approach for optimizing the frequency of visits to sites, and a mechanism for computing the dynamic priority for any site has been developed. It adjusts the frequency of visit by dynamically assigning a priority to a site. It employs an ecology of crawl workers to crawl the web sites. The architecture designed is not only incremental but also scalable that can be parallelized at URL queues and crawl workers level, responsible for downloading documents from the web.
Verlag:
LAP LAMBERT Academic Publishing
Webseite:
https://www.lap-publishing.com/
von (Autor):
Niraj Singhal
Seitenanzahl:
104
Veröffentlicht am:
02.07.2014
Lagerbestand:
Lieferbar
Kategorie:
Informatik, EDV
Preis:
4.938,26 руб
Stichworte:
freshness, frequency, web Information Retrieval

Books loader

Newsletter

Adyen::diners Adyen::jcb Adyen::discover Adyen::amex Adyen::mc Adyen::visa Adyen::cup Adyen::unionpay Paypal CryptoWallet

  0 Produkte im Warenkorb
Warenkorb bearbeiten
Loading frontend
LOADING