| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Ariel Rabkin, Randy Katz, "How Hadoop Clusters Break," IEEE Software, vol. 99, no. 1, pp. , , 5555. | |||
| BibTex | x | ||
| @article{ 10.1109/MS.2012.73, author = {Ariel Rabkin and Randy Katz}, title = {How Hadoop Clusters Break}, journal ={IEEE Software}, volume = {99}, number = {1}, issn = {0740-7459}, year = {5555}, doi = {http://doi.ieeecomputersociety.org/10.1109/MS.2012.73}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - MGZN JO - IEEE Software TI - How Hadoop Clusters Break IS - 1 SN - 0740-7459 SP EP EPD - A1 - Ariel Rabkin, A1 - Randy Katz, PY - 5555 VL - 99 JA - IEEE Software ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MS.2012.73
This article describes lessons from examining a sample of several hundred support tickets for the Hadoop ecosystem, a widely-used group of "big data" storage and processing systems. We give a taxonomy of errors and describe how they are addressed by supporters today. We show that misconfigurations are the dominant cause of failures. We describe these misconfigurations in detail. Using these failure reports, we identify some of the design "anti-patterns" and missing platform features that contribute to the problems we observed. We offer advice to developers about how to build more robust distributed systems. We also advise users and administrators how to avoid some of the rough edges we found.
Citation:
Ariel Rabkin, Randy Katz, "How Hadoop Clusters Break," IEEE Software, 07 June 2012. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/MS.2012.73>
Usage of this product signifies your acceptance of the Terms of Use.

