Distributing the Sloan Digital Sky Survey Using UDT and Sector
Found in: e-Science and Grid Computing, International Conference on
By Yunhong Gu, Robert L. Grossman, Alex Szalay, Ani Thakar
Issue Date:December 2006
pp. 56
In this paper, we describe a peer-to-peer storage system called Sector that is designed to access and transport large data sets over wide area high performance networks. We also describe our recent experience using Sector to distribute the Sloan Digital Sk...
Batch is Back: CasJobs, Serving Multi-TB Data on the Web
Found in: Web Services, IEEE International Conference on
By William O'Mullane, Nolan Li, MarĂ­a Nieto-Santisteban, Alex Szalay, Ani Thakar
Issue Date:July 2005
pp. 33-40
The Sloan Digital Sky Survey (SDSS) science database describes over 230 million objects and is over 1.6 TB in size. The SDSS Catalog Archive Server (CAS) provides several levels of query interface to the SDSS data via the SkyServer website. Most queries ex...
Migrating a Multiterabyte Archive from Object to Relational Databases
Found in: Computing in Science and Engineering
By Ani Thakar, Alex Szalay, Peter Kunszt, Jim Gray
Issue Date:September 2003
pp. 16-29
<p>A commercial, object-oriented database engine with custom tools for data-mining the multiterabyte Sloan Digital Sky Survey archive did not meet its performance objectives. We describe the problems, technical issues, and process of migrating this l...
GrayWulf: Scalable Clustered Architecture for Data Intensive Computing
Found in: Hawaii International Conference on System Sciences
By Alexander S. Szalay, Gordon Bell, Jan Vandenberg, Alainna Wonders, Randal Burns, Dan Fay, Jim Heasley, Tony Hey, Maria Nieto-Santisteban, Ani Thakar, Catharine van Ingen, Richard Wilton
Issue Date:January 2009
pp. 1-10
Data intensive computing presents a significant challenge for traditional supercomputing architectures that maximize FLOPS since CPU speed has surpassed IO capabilities of HPC systems and BeoWulf clusters. We present the architecture for a three tier commo...
Ten Years of SkyServer I: Tracking Web and SQL e-Science Usage
Found in: Computing in Science & Engineering
By M. Raddick,Ani Thakar,Rafael Santos,Alexender S. Szalay
Issue Date:March 2014
pp. 1
We describe how we collect and analyze the complete record of more than ten years of web hits and SQL queries to SkyServer, the primary catalog data portal of the Sloan Digital Sky Survey (SDSS) that makes multiple Terabytes of astronomy data available to ...
Ten Years of SkyServer II: How Astronomers and the Public have Embraced e-Science
Found in: Computing in Science & Engineering
By M. Raddick,Rafael Santos,Ani Thakar,Alexender S. Szalay
Issue Date:March 2014
pp. 1
A comprehensive analysis of ten years of Web and SQL traffic on SkyServer-- the online portal to the multi-terabyte Sloan Digital Sky Survey (SDSS) archive--shows the impressive reach of the SDSS to the research community and the public, and provides insig...
Migrating a (large) science database to the cloud
Found in: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC '10)
By Alex Szalay, Ani Thakar
Issue Date:June 2010
pp. 430-434
We report on attempts to put an existing scientific (astronomical) database -- the Sloan Digital Sky Survey (SDSS) science archive [1] - in the cloud. Based on our experience, it is either very frustrating or impossible at this time to migrate an existing,...
Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey
Found in: Proceedings of the 2000 ACM SIGMOD international conference on Management of data (SIGMOD '00)
By Alexander S. Szalay, Ani Thakar, Don Slutz, Jim Gray, Peter Z. Kunszt, Robert J. Brunner
Issue Date:May 2000
pp. 245-253
The next-generation astronomy digital archives will cover most of the sky at fine resolution in many wavelengths, from X-rays, through ultraviolet, optical, and infrared. The archives will be stored at diverse geographical locations. One of the first of th...