loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Data Engineering Workshops (ICDEW'06)
Finding Thai Web Pages in Foreign Web Spaces
Atlanta, Georgia
April 03-April 07
ISBN: 0-7695-2571-7
Kulwadee Somboonviwat, The University of Tokyo, Japan
Takayuki Tamura, Mitsubishi Electric Corporation
Masaru Kitsuregawa, The University of Tokyo, Japan
This paper proposes language specific web crawling (LSWC) as a method of creating large-scale language specific Web archives for countries with linguistic identities such as Thailand. The LSWC strategy for selectively gathering Thai web pages from virtually anywhere on the Web is derived based on the results of static analyses of the Thai Web graph. We evaluated the performance of the LSWC strategy using a web crawling simulator.
Citation:
Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kitsuregawa, "Finding Thai Web Pages in Foreign Web Spaces," icdew, pp.x135, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.