2016 Joint Conference of the International Workshop on Software Measurement and the International Conference on Software Process and Product Measurement (2016)
Oct. 5, 2016 to Oct. 7, 2016
High-quality data is a prerequisite for most types of analysis provided by software systems. However, since data quality does not come for free, it has to be assessed and managed continuously. The increasing quantity, diversity, and velocity that characterize big data today make these tasks even more challenging. We identified challenges that are specific for big data quality assessments with particular emphasis on their usage in smart ecosystems and make a proposal for a scalable cross-organizational approach that addresses these challenges. We developed an initial prototype to investigate scalability in a multi-node test environment using big data technologies. Based on the observed horizontal scalability behavior, there is an indication that the proposed approach also allows dealing with increasing volumes of heterogeneous data.
Big data, Quality assessment, Metadata, Instruments, Ecosystems, Companies
M. Klas, W. Putz and T. Lutz, "Quality Evaluation for Big Data: A Scalable Assessment Approach and First Evaluation Results," 2016 Joint Conference of the International Workshop on Software Measurement and the International Conference on Software Process and Product Measurement(IWSM Mensura), Berlin, Germany, 2016, pp. 115-124.