The Community for Technology Leaders
Green Image
Issue No. 01 - January/February (2008 vol. 10)
ISSN: 1521-9615
pp: 38-48
Ani R. Thakar , Johns Hopkins University
Jim Gray , Microsoft Research
Alex Szalay , Johns Hopkins University
ABSTRACT
Using a database management system (DBMS) is essential to ensure the data integrity and reliability of large, multidimensional data sets. However, loading multiterabyte data into a DBMS is a time-consuming and error-prone task that the authors have tried to automate by developing the sqlLoader pipeline—a distributed workflow system for data loading.
INDEX TERMS
Sloan Digital Sky Survey Science Archive, SDSS, astronomy, large-scale databases, database management systems
CITATION
Ani R. Thakar, Jim Gray, Alex Szalay, "The sqlLoader Data-Loading Pipeline", Computing in Science & Engineering, vol. 10, no. , pp. 38-48, January/February 2008, doi:10.1109/MCSE.2008.18
173 ms
(Ver 3.3 (11022016))