This Article 
 Bibliographic References 
 Add to: 
The sqlLoader Data-Loading Pipeline
January/February 2008 (vol. 10 no. 1)
pp. 38-48
Alex Szalay, Johns Hopkins University
Ani R. Thakar, Johns Hopkins University
Jim Gray, Microsoft Research
Using a database management system (DBMS) is essential to ensure the data integrity and reliability of large, multidimensional data sets. However, loading multiterabyte data into a DBMS is a time-consuming and error-prone task that the authors have tried to automate by developing the sqlLoader pipeline—a distributed workflow system for data loading.

1. A. Szalay, "The National Virtual Observatory," Proc. Conf. Astronomical Data Analysis Software and Systems (ADASS) X, vol. 238, F.R. Harnden Jr., F.A. Primini, and H.E. Payne, eds., Astronomical Soc. of the Pacific, 2001., p.3.
2. A. Thakar, S. Szalay, and J. Gray, "From FITS to SQL - Loading and Publishing the SDSS Data," Astronomical Data Analysis Software and Systems (ADASS) XIII, vol. 314, F. Ochsenbein, M.G. Allen, and D. Egret, eds., Astronomical Soc. of the Pacific, 2004., p. 38.

Index Terms:
Sloan Digital Sky Survey Science Archive, SDSS, astronomy, large-scale databases, database management systems
Alex Szalay, Ani R. Thakar, Jim Gray, "The sqlLoader Data-Loading Pipeline," Computing in Science and Engineering, vol. 10, no. 1, pp. 38-48, Jan.-Feb. 2008, doi:10.1109/MCSE.2008.18
Usage of this product signifies your acceptance of the Terms of Use.