Ian Gorton , CMU, Pittsburgh
John Klein , Software Engineering Institute, Pittsburgh
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MS.2014.51
Exponential data growth from the Internet, low cost sensors, and high fidelity instruments has fueled the development of advanced analytics operating on vast data repositories. These analytics bring business benefits ranging from web content personalization to predictive maintenance of aircraft components. To construct the data repositories that underpin these systems, there has been rapid innovation in distributed data management technologies, employing schema-less data models and relaxing consistency guarantees to satisfy scalability and availability requirements. This paper describes the challenges of these "big data" systems that confront software architects. We show how distributed software architecture quality attributes are tightly linked to the both the data and deployment architectures. This causes a consolidation of concerns, and designs must be closely harmonized across these three structures to satisfy quality requirements.
Ian Gorton, John Klein, "Distribution, Data, Deployment: Software Architecture Convergence in Big Data Systems", IEEE Software, , no. 1, pp. 1, PrePrints PrePrints, doi:10.1109/MS.2014.51