Search For:

Displaying 1-15 out of 15 total
VisWeek Capstone Address
Found in: IEEE Transactions on Visualization and Computer Graphics
By Alexander S. Szalay
Issue Date:November 2010
pp. xxv-xxvi
No summary available.
   
Just-in-Time Analytics on Large File Systems
Found in: IEEE Transactions on Computers
By H. Howie Huang,Nan Zhang,Wei Wang,Gautam Das,Alexander S. Szalay
Issue Date:November 2012
pp. 1651-1664
As file systems reach the petabytes scale, users and administrators are increasingly interested in acquiring high-level analytical information for file management and analysis. Two particularly important tasks are the processing of aggregate and top-k quer...
 
GrayWulf: Scalable Software Architecture for Data Intensive Computing
Found in: Hawaii International Conference on System Sciences
By Yogesh Simmhan, Roger Barga, Catharine van Ingen, Maria Nieto-Santisteban, Lazslo Dobos, Nolan Li, Michael Shipway, Alexander S. Szalay, Sue Werner, Jim Heasley
Issue Date:January 2009
pp. 1-10
Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf. These services are i...
   
The Claremont report on database research
Found in: Communications of the ACM
By Alexander S. Szalay, Alon Y. Halevy, Anastasia Ailamaki, Anhai Doan, Beng Chin Ooi, Daniela Florescu, Donald Kossmann, Eric A. Brewer, Gerhard Weikum, Hank F. Korth, Hector Garcia-Molina, Johannes Gehrke, Joseph M. Hellerstein, Laura M. Haas, Le Gruenwald, Michael J. Carey, Michael J. Franklin, Michael Stonebraker, Philip A. Bernstein, Raghu Ramakrishnan, Rakesh Agrawal, Roger Magoulas, Samuel Madden, Sunita Sarawagi, Surajit Chaudhuri, Tim O'Reilly, Yannis E. Ioannidis, Alexander S. Szalay, Alon Y. Halevy, Anastasia Ailamaki, Anhai Doan, Beng Chin Ooi, Daniela Florescu, Donald Kossmann, Eric A. Brewer, Gerhard Weikum, Hank F. Korth, Hector Garcia-Molina, Johannes Gehrke, Joseph M. Hellerstein, Laura M. Haas, Le Gruenwald, Michael J. Carey, Michael J. Franklin, Michael Stonebraker, Philip A. Bernstein, Raghu Ramakrishnan, Rakesh Agrawal, Roger Magoulas, Samuel Madden, Sunita Sarawagi, Surajit Chaudhuri, Tim O'Reilly, Yannis E. Ioannidis
Issue Date:June 2009
pp. 101-104
Database research is expanding, with major efforts in system architecture, new languages, cloud services, mobile and virtual worlds, and interplay between structure and text.
     
SkyQuery: Federating Astronomy Archives
Found in: Computing in Science & Engineering
By Tamas Budavari,Laszlo Dobos,Alexander S. Szalay
Issue Date:May 2013
pp. 12-20
Astronomical discoveries often happen at the edge of our observational capabilities. To fully analyze telescopic images, researchers must combine data from separate telescopes, but large volumes of data with intrinsic differences make this difficult. SkyQu...
 
GrayWulf: Scalable Clustered Architecture for Data Intensive Computing
Found in: Hawaii International Conference on System Sciences
By Alexander S. Szalay, Gordon Bell, Jan Vandenberg, Alainna Wonders, Randal Burns, Dan Fay, Jim Heasley, Tony Hey, Maria Nieto-Santisteban, Ani Thakar, Catharine van Ingen, Richard Wilton
Issue Date:January 2009
pp. 1-10
Data intensive computing presents a significant challenge for traditional supercomputing architectures that maximize FLOPS since CPU speed has surpassed IO capabilities of HPC systems and BeoWulf clusters. We present the architecture for a three tier commo...
   
The Sloan Digital Sky Survey
Found in: Computing in Science and Engineering
By Alexander S. Szalay
Issue Date:March 1999
pp. 54-62
<p>Astronomy is about to undergo a major paradigm shift, with data sets becoming larger and more homogeneous, designed for the first time in a top-down fashion. In a few years, it might be much easier for astronomers to
 
Inverted indices for particle tracking in petascale cosmological simulations
Found in: Proceedings of the 25th International Conference on Scientific and Statistical Database Management (SSDBM)
By Alexander S. Szalay, Bridget Falck, Daniel Crankshaw, Jie Wang, Randal Burns, Tamás Budavári
Issue Date:July 2013
pp. 1-10
We describe the challenges arising from tracking dark matter particles in state of the art cosmological simulations. We are in the process of running the Indra suite of simulations, with an aggregate count of more than 35 trillion particles and 1.1PB of to...
     
Graywulf: a platform for federated scientific databases and services
Found in: Proceedings of the 25th International Conference on Scientific and Statistical Database Management (SSDBM)
By Alexander S. Szalay, István Csabai, László Dobos, Nolan Li, Tamás Budavári
Issue Date:July 2013
pp. 1-12
Many fields of science rely on relational database management systems to analyze, publish and share data. Since RDBMS are originally designed for, and their development directions are primarily driven by, business use cases they often lack features very im...
     
Toward millions of file system IOPS on low-cost, commodity hardware
Found in: Proceedings of SC13: International Conference for High Performance Computing, Networking, Storage and Analysis (SC '13)
By Alexander S. Szalay, Da Zheng, Randal Burns
Issue Date:November 2013
pp. 1-12
We describe a storage system that removes I/O bottlenecks to achieve more than one million IOPS based on a userspace file abstraction for arrays of commodity SSDs. The file abstraction refactors I/O scheduling and placement for extreme parallelism and non-...
     
The open connectome project data cluster: scalable analysis and vision for high-throughput neuroscience
Found in: Proceedings of the 25th International Conference on Scientific and Statistical Database Management (SSDBM)
By Alexander S. Szalay, Daniel R. Berger, Davi D. Bock, Dean Kleissas, Eric Perlman, Jeff Lichtman, Joshua T. Vogelstein, Karl Deisseroth, Kunal Lillaney, Kwanghun Chung, Logan Grosenick, Michael Kazhdan, Narayanan Kasthuri, Nicholas C. Weiler, Priya Manavalan, R. Clay Reid, R. Jacob Vogelstein, Randal Burns, Stephen J. Smith, William Gray Roncal
Issue Date:July 2013
pp. 1-11
We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily...
     
Jim Gray, astronomer
Found in: Communications of the ACM
By Alexander S. Szalay
Issue Date:November 2008
pp. 486-487
How he helped develop the SkyServer, delivering computation directly to terabytes of astronomical data.
     
Scientific publishing in the era of petabye data
Found in: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries (JCDL '08)
By Alexander S. Szalay
Issue Date:June 2008
pp. 597-617
Today's scientific datasets are growing into Petabytes. A similar transition is happening in industry and society. Web search companies have to deal routinely with tens of Petabytes, a substantial fraction of the world's computers go into data warehouses o...
     
The SDSS skyserver: public access to the sloan digital sky server data
Found in: Proceedings of the 2002 ACM SIGMOD international conference on Management of data (SIGMOD '02)
By Alexander S. Szalay, Ani R. Thakar, Christopher Stoughton, Jan vandenBerg, Jim Gray, Jordan Raddick, Peter Z. Kunszt, Tanu Malik
Issue Date:June 2002
pp. 570-581
The SkyServer provides Internet access to the public Sloan Digital Sky Survey (SDSS) data for both astronomers and for science education. This paper describes the SkyServer goals and architecture. It also describes our experience operating the SkyServer on...
     
Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey
Found in: Proceedings of the 2000 ACM SIGMOD international conference on Management of data (SIGMOD '00)
By Alexander S. Szalay, Ani Thakar, Don Slutz, Jim Gray, Peter Z. Kunszt, Robert J. Brunner
Issue Date:May 2000
pp. 245-253
The next-generation astronomy digital archives will cover most of the sky at fine resolution in many wavelengths, from X-rays, through ultraviolet, optical, and infrared. The archives will be stored at diverse geographical locations. One of the first of th...
     
 1