loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Workshop on Data Engineering Issues in E-Commerce
Using Semantic Information to Improve Transparent Query Caching for Dynamic Content Web Sites
Tokyo, Japan
April 09-April 09
ISBN: 0-7695-2401-X
Gokul Soundararajan, Department of Electrical and Computer Engineering, University of Toronto
Cristiana Amza, Department of Electrical and Computer Engineering, University of Toronto

In this paper, we study the use of semantic information to improve performance of transparent query caching for dynamic content web sites. We observe that in dynamic content web applications, the most recently inserted items are also the ones that register the highest activity. For example, the newest books in a bookstore are also the ones more frequently browsed and bought. Hence, assuming repeatable queries, a particular read-only query response is likely to incrementally change as new rows are added to the query?s tables. We avoid the cached query response invalidations that would otherwise occur due to the addition of new items by keeping the newly inserted rows in small temporary tables. This allows us to reuse cached responses for partial coverage of query results. A query result is then obtained from merging an existing cached response with one or more lightweight residual query results that involve the temporary tables. In addition, we enhance our cache with other partial coverage techniques based on per-query semantic information such as sub-range queries for all queries that match a specific template.

We implement semantic query caching on top of an existing template-based cache with column-based invalidations. Our evaluation is based on a dynamic content site using the Apache web server with Tomcat Java servlets and the MySQL relational database. We use the industry-standard TPC-W e-commerce benchmark as our benchmark application. We conclude that augmenting transparent query caching with the ability to retrieve partial results from the cache improves performance substantially in terms of latency and to a lesser extent in terms of hit-rate and throughput.

Citation:
Gokul Soundararajan, Cristiana Amza, "Using Semantic Information to Improve Transparent Query Caching for Dynamic Content Web Sites," deec, pp.132-138, International Workshop on Data Engineering Issues in E-Commerce, 2005
Usage of this product signifies your acceptance of the Terms of Use.