This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th International Conference on Data Engineering (ICDE'03)
Automating Layout of Relational Databases
Bangalore, India
March 05-March 08
ISBN: 0-7803-7665-X
Sanjay Agrawal, Microsoft Research
Surajit Chaudhuri, Microsoft Research
Abhinandan Das, Cornell University
Vivek Narasayya, Microsoft Research
The choice of database layout, i.e., how database objects such as tables and indexes are assigned to disk drives can significantly impact the I/O performance of the system. Today, DBAs typically rely on fully striping objects across all available disk drives as the basic mechanism for optimizing I/O performance. While full striping maximizes I/O parallelism, when query execution involves co-access of two or more large objects, e.g., a merge join of two tables, the above strategy may be sub-optimal due to the increased number of random I/O accesses on each disk drive. In this paper, we propose a framework for automating the choice of database layout for a given database that also takes into account the effects of co-accessed objects in the workload faced by the system. We formulate the above as an optimization problem and present an efficient solution to the problem that judiciously takes into account the trade-off between I/O parallelism and random I/O accesses. Our experiments on Microsoft SQL Server show the superior I/O performance of our techniques compared to the traditional approach of fully striping each database object across all disk drives.
Citation:
Sanjay Agrawal, Surajit Chaudhuri, Abhinandan Das, Vivek Narasayya, "Automating Layout of Relational Databases," icde, pp.607, 19th International Conference on Data Engineering (ICDE'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.