In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...
The design and reification of Web Information Systems is a complex task, for which many integrated development methods have been proposed. While all these methods ultimately lead ...
Abstract—Cache pre-filling is emerging as a new concept for increasing the availability of popular web items in cache servers. According to this concept, web items are sent by a...
The MAPA system provides improved navigation facility for large web sites. It extracts a hierarchical structure from an arbitrary web site, with some minimal user assistance, and ...
The Web frequently suffers from failures which affect the performance and consistency of applications run over it. An important fault-tolerance technique is the use of atomic tran...