loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing
A Hierarchical Cache Scheme for the Large-scale Web Search Engine
August 06-August 08
ISBN: 978-0-7695-3263-9
Over the past decade, much research has been done to solve technical challenges regarding the web search engine, such as crawling web documents, high performance indexes, and ranking systems using hyperlink analysis. However, implementation details of its query processing system are rarely dealt with in the literature. In this paper we present a distributed architecture for the query processing system and its hierarchal cache scheme. Our paper is based on the development experience of a commercial web search engine designed to answer 5 million user queries against over 6.5 million web pages per day. Using the hierarchal cache scheme, we keep a portion of query results in multi-level caches so that excessive I/O or CPU time is not used for query processing. With that scheme, it is possible to reduce around 70% of the server costs.
Index Terms:
searche engine, large-scale cache
Citation:
Sungchae Lim, Joonseon Ahn, "A Hierarchical Cache Scheme for the Large-scale Web Search Engine," snpd, pp.925-930, 2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008
Usage of this product signifies your acceptance of the Terms of Use.