21st International Conference on Data Engineering Workshops (ICDEW'05)
Simulation Study of Language Specific Web Crawling
Tokyo, Japan
April 05-April 08
ISBN: 0-7695-2657-8
The Web has been recognized as an important part of our cultural heritage. Many nations started archiving national web spaces for future generations. A key technology for data acquisition employed by these archiving projects is web crawling. Crawling cultural and/or linguistic specific resources from the borderless Web raises many challenging issues. In this paper, we propose the language specific web crawling and evaluate the language specific crawling strategies on the web crawling simulator.
Citation:
Kulwadee Somboonviwat, Masaru Kitsuregawa, Takayuki Tamura , "Simulation Study of Language Specific Web Crawling," icdew, pp.1254, 21st International Conference on Data Engineering Workshops (ICDEW'05), 2005