The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - January/February (2008 vol.10)
pp: 38-48
Ani R. Thakar , Johns Hopkins University
Alex Szalay , Johns Hopkins University
ABSTRACT
Using a database management system (DBMS) is essential to ensure the data integrity and reliability of large, multidimensional data sets. However, loading multiterabyte data into a DBMS is a time-consuming and error-prone task that the authors have tried to automate by developing the sqlLoader pipeline—a distributed workflow system for data loading.
INDEX TERMS
Sloan Digital Sky Survey Science Archive, SDSS, astronomy, large-scale databases, database management systems
CITATION
Ani R. Thakar, Alex Szalay, "The sqlLoader Data-Loading Pipeline", Computing in Science & Engineering, vol.10, no. 1, pp. 38-48, January/February 2008, doi:10.1109/MCSE.2008.18
REFERENCES
1. A. Szalay, "The National Virtual Observatory," Proc. Conf. Astronomical Data Analysis Software and Systems (ADASS) X, vol. 238, F.R. Harnden Jr., F.A. Primini, and H.E. Payne, eds., Astronomical Soc. of the Pacific, 2001., p.3.
2. A. Thakar, S. Szalay, and J. Gray, "From FITS to SQL - Loading and Publishing the SDSS Data," Astronomical Data Analysis Software and Systems (ADASS) XIII, vol. 314, F. Ochsenbein, M.G. Allen, and D. Egret, eds., Astronomical Soc. of the Pacific, 2004., p. 38.
6 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool