The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - May-June (2013 vol.15)
pp: 54-62
Michael Stonebraker , Massachusetts Institute of Technology
Paul Brown , Paradigm4
Donghui Zhang , Paradigm4
Jacek Becla , SLAC National Accelerator Laboratory
ABSTRACT
A description and discussion of the SciDB database management system focuses on lessons learned, application areas, performance comparisons against other solutions, and additional approaches to managing data and complex analytics.
INDEX TERMS
Arrays, Data models, Parallel processing, File systems, Database languages, Analytical models, Large Hadron Collider, scientific computing, scientific databases, array databases, complex analytics, parallel databases, large-scale matrix operations
CITATION
Michael Stonebraker, Paul Brown, Donghui Zhang, Jacek Becla, "SciDB: A Database Management System for Applications with Complex Analytics", Computing in Science & Engineering, vol.15, no. 3, pp. 54-62, May-June 2013, doi:10.1109/MCSE.2013.19
REFERENCES
1. J. Becla and K.-T. Lim, “Report from the First Workshop on Extremely Large Databases,” Data Science J., vol. 7, 2008, pp. 1–13.
2. M. Stonebraker et al., “The Architecture of SciDB,” Proc. Scientific and Statistical Data Management Conf., Springer-Verlag, 2011, pp. 1–16.
3. A.S. Szalay et al., “Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey,” Proc. Sigmod Conf., ACM, 2000, pp. 451–462.
4. A. Pavlo and et al., “A Comparison of Approaches to Large-scale Data Analysis,” Proc. Sigmod Conf., ACM, 2009, pp. 165–178.
5. P.A. Boncz, S. Manegold, and M.L. Kersten, “Database Architecture Evolution: Mammals Flourished Long Before Dinosaurs Became Extinct,” Proc. Conf. Very Large Data Bases (VLDB), VLDB Endowment, 2009, pp. 1648–1653.
6. J. Duggan, “Compression and Execution in SciDB,” in preparation.
7. J. Dongarra et al., “A Composite Data Management and Linear Algebra Benchmark,” in preparation.
8. M. Stonebraker:, “The Design of the Postgres Storage System,” Proc. Conf. Very Large Data Bases, Morgan Kaufmann, 1987, pp. 289–300.
9. P. Leyshock, “Agrios: A Hybrid Approach to Scalable Data Analysis Systems,” Extremely Large Data Base Workshop, presentation, 2012; www-conf.slac.stanford.edu/xldb2012/talks xldb2012_tue_LT06_Leyshock.pdf.
10. G. Planthaber et al., “EarthDB: Scalable Analysis of MODIS Data Using SciDB,” Proc. 1st ACM SIGSPATIAL Int'l Workshop on Analytics for Big Geospatial Data, ACM, 2012, pp. 11–19.
11. A.V. Mironov et al., “The Multicolor 'Lyra' Photometric System for Variable stars and Halo Studies,” 2010; http://arxiv.org/abs1002.4644.
12. A. Bhattacharjee et al., “Classification of Human Lung Carcinomas by mRNA Expression Profiling Reveals Distinct Adenocarcinoma Subclasses,” supplementary material, Proc. National Academy Science, vol. 98, no. 24, 2001, pp. 13790–13795; www.pnas.org/content/98/24/13790/supplDC1 .
13. E. Soroush et al., “ArrayStore: A Storage Manager for Complex Parallel Array Processing,” Proc. Sigmod Conf., ACM, 2011, pp. 253–264.
14. P. Cudre-Mauroux et al., “SS-DB: A Standard Science DBMS Benchmark,” submitted for publication.
15. A. Seering et al., “Efficient Versioning for Scientific Array Databases,” Proc. Int'l Conf. Data Eng., IEEE, 2012, pp. 1013–1024.
16. E. Wu et al., “A Demonstration of DBWipes: Clean as You Query,” Proc. Conf. Very Large Data Bases, VLDB Endowment, 2012, pp. 1894–1897.
37 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool