The Community for Technology Leaders
String Processing and Information Retrieval, International Symposium on (1999)
Cancun, Mexico
Sept. 21, 1999 to Sept. 24, 1999
ISBN: 0-7695-0268-7
pp: 184
Altigran S. da Silva , Federal University of Minas Gerais
Eveline A. Veloso , Federal University of Minas Gerais
Paulo B. Golghe , Federal University of Minas Gerais
Berthier Ribeiro-Neto , Federal University of Minas Gerais
Alberto H. F. Laender , Federal University of Minas Gerais
Nivio Ziviani , Federal University of Minas Gerais
ABSTRACT
One of the key components of current Web search engines is the document collector. This paper describes CoBWeb, an automatic document collector, whose architecture is distributed and highly scalable. CoBWeb aims at collecting large amounts of documents per time period, while observing operational and ethical limits in the crawling process. CoBWeb is part of the SIAM (Information Systems in Mobile Computing Environments) search engine which is being implemented to support the Brazilian Web. Thus, several results related to the Brazilian Web are presented.
INDEX TERMS
Crawling, Search Engine, Web
CITATION

A. H. Laender, N. Ziviani, P. B. Golghe, B. Ribeiro-Neto, A. S. Silva and E. A. Veloso, "CoBWeb ? A Crawler for the Brazilian Web," String Processing and Information Retrieval, International Symposium on(SPIRE), Cancun, Mexico, 1999, pp. 184.
doi:10.1109/SPIRE.1999.796594
158 ms
(Ver 3.3 (11022016))