The Community for Technology Leaders
Green Image
Issue No. 01 - January/February (2008 vol. 10)
ISSN: 1521-9615
pp: 38-48
Alex Szalay , Johns Hopkins University
Ani R. Thakar , Johns Hopkins University
Jim Gray , Microsoft Research
Using a database management system (DBMS) is essential to ensure the data integrity and reliability of large, multidimensional data sets. However, loading multiterabyte data into a DBMS is a time-consuming and error-prone task that the authors have tried to automate by developing the sqlLoader pipeline—a distributed workflow system for data loading.
Sloan Digital Sky Survey Science Archive, SDSS, astronomy, large-scale databases, database management systems

A. R. Thakar, J. Gray and A. Szalay, "The sqlLoader Data-Loading Pipeline," in Computing in Science & Engineering, vol. 10, no. , pp. 38-48, 2008.
86 ms
(Ver 3.3 (11022016))