DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MCSE.2008.18
Using a database management system (DBMS) is essential to ensure the data integrity and reliability of large, multidimensional data sets. However, loading multiterabyte data into a DBMS is a time-consuming and error-prone task that the authors have tried to automate by developing the sqlLoader pipeline—a distributed workflow system for data loading. 1. A. Szalay, "The National Virtual Observatory," Proc. Conf. Astronomical Data Analysis Software and Systems (ADASS) X, vol. 238, F.R. Harnden Jr., F.A. Primini, and H.E. Payne, eds., Astronomical Soc. of the Pacific, 2001., p.3.
Index Terms:
Sloan Digital Sky Survey Science Archive, SDSS, astronomy, large-scale databases, database management systems
Citation:
Alex Szalay, Ani R. Thakar, Jim Gray, "The sqlLoader Data-Loading Pipeline," Computing in Science and Engineering, vol. 10, no. 1, pp. 38-48, Jan./Feb. 2008, doi:10.1109/MCSE.2008.18 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||