Web Congress, Latin American (2003)
Santiago, Chile
Nov. 10, 2003 to Nov. 12, 2003
ISBN: 0-7695-2058-8
pp: 212
Carlos Castillo , Universidad de Chile
<p>Search engines provide search results based on a large repository of pages downloaded by a web crawler from several servers. To provide best results, this repository must be kept as fresh as possible, but this can be difficult due to the large volume of pages involved and to the fact that polling is the only method for detecting changes.</p> <p>In this paper, we explore and compare several alternatives for keeping fresh repositories that involve some degree of cooperation from servers.</p>
