loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
13th International Conference on Data Engineering (ICDE'97)
Index selection for OLAP
University of Birmingham, Birmingham, U.K.
April 07-April 11
ISBN: 0-8186-7807-0
H. Gupta, Dept. of Comput. Sci., Stanford Univ., CA, USA
V. Harinarayan, Dept. of Comput. Sci., Stanford Univ., CA, USA
A. Rajaraman, Dept. of Comput. Sci., Stanford Univ., CA, USA
J.D. Ullman, Dept. of Comput. Sci., Stanford Univ., CA, USA
On-line analytical processing (OLAP) is a recent and important application of database systems. Typically, OLAP data is presented as a multidimensional "data cube." OLAP queries are complex and can take many hours or even days to run, if executed directly on the raw data. The most common method of reducing execution time is to precompute some of the queries into summary tables (subcubes of the data cube) and then to build indexes on these summary tables. In most commercial OLAP systems today, the summary tables that are to be precomputed are picked first, followed by the selection of the appropriate indexes on them. A trial-and-error approach is used to divide the space available between the summary tables and the indexes. This two-step process can perform very poorly. Since both summary tables and indexes consume the same resource-space-their selection should be done together for the most efficient use of space. The authors give algorithms that automate the selection of summary tables and indexes. In particular, they present a family of algorithms of increasing time complexities, and prove strong performance bounds for them. The algorithms with higher complexities have better performance bounds. However, the increase in the performance bound is diminishing, and they show that an algorithm of moderate complexity can perform fairly close to the optimal.
Index Terms:
decision support systems; on-line analytical processing; OLAP; database systems; multidimensional data cube; OLAP queries; execution time reduction; query precomputation; summary tables; trial-and-error approach; efficient space use; algorithms; automated summary table selection; automated index selection; time complexity; performance bounds
Citation:
H. Gupta, V. Harinarayan, A. Rajaraman, J.D. Ullman, "Index selection for OLAP," icde, pp.208, 13th International Conference on Data Engineering (ICDE'97), 1997
Usage of this product signifies your acceptance of the Terms of Use.