This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Extreme Data-Intensive Scientific Computing
Nov.-Dec. 2011 (vol. 13 no. 6)
pp. 34-41
Alex Szalay, The Johns Hopkins University

Scientific computing increasingly involves massive data; in astronomy, observations and numerical simulations are on the verge of generating petabytes. This new, data-centric computing requires a new look at computing architectures and strategies. Using Amdahl's law to characterize architectures and workloads, it's possible to use existing commodity parts to build systems that approach an ideal Amdahl machine.

1. A.S. Szalay and J. Gray, "The World-Wide Telescope," Science, vol. 293, no. 5537, 2011, pp. 2037–2040.
2. G. Bell, A. Hey, and A.S. Szalay, "Beyond the Data Deluge," Science, vol. 323, no. 5919, 2009, pp. 1297–1298.
3. A.R. Thakar et al., "The Catalog Archive Server Database Management System," Computing in Science & Eng., vol. 10, no. 1, 2008, pp. 30–37.
4. A.S. Szalay et al., "Low-Power Amdahl-Balanced Blades for Data Intensive Computing," SIGOPS Operating Systems Rev., vol. 44, no. 1, 2010, pp. 71–75.
5. V. Springel et al., "Simulations of the Formation, Evolution and Clustering of Galaxies and Quasars," Nature, vol. 435, 2005, pp. 629–636.
6. G. Lemson and the Virgo Consortium, "Halo and Galaxy Formation Histories from the Millennium Simulation: Public Release of a VO-Oriented Database," arxiv.org, 2006; arXiv:astro-ph/0608019v2.
7. Y. Li et al., "A Public Turbulence Database Cluster and Applications to Study Lagrangian Evolution of Velocity Increments in Turbulence," J. Turbulence, vol. 9, no. 31, 2008, pp. 1–29.
8. V. Singh et al., SkyServer Traffic Report—The First Five Years, tech. report, MSR-TR-2006-190, Microsoft Research, 2006.
9. A.S. Szalay and J. Gray, "Science in an Exponential World," Nature, vol. 440, 2006, pp. 23–24.
10. G.M. Amdahl, "Computer Architecture and Amdahl's Law," IEEE Solid State Circuits Society News, vol. 12, no. 3, 2007, pp. 4–9.
11. G. Bell, J. Gray, and A.S. Szalay, "Petascale Computational Systems: Balanced Cyber-Infrastructure in a Data-Centric World," Computer, vol. 39, no. 1, 2006, pp. 110–113.
12. A.S. Szalay et al., "GrayWulf: Scalable Clustered Architecture for Data Intensive Computing," Proc. Hawaii Int'l Conf. System Sciences, IEEE Press, 2009, pp. 1–10.
13. J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Proc. 6th Symp. Operating System Design and Implementations, Usenix Assoc., 2004, pp. 137–150.
14. M. Stonebraker et al., "Requirements for Science Databases and SCIDB," Proc. Conf. Innovative Data Systems Research, 2009; www.cidrdb.org/cidr2009cidr2009.zip.

Index Terms:
Scientific computing, massive datasets, astronomy, Amdahl machine, simulations
Citation:
Alex Szalay, "Extreme Data-Intensive Scientific Computing," Computing in Science and Engineering, vol. 13, no. 6, pp. 34-41, Nov.-Dec. 2011, doi:10.1109/MCSE.2011.74
Usage of this product signifies your acceptance of the Terms of Use.