Utility and Cloud Computing, IEEE Internatonal Conference on (2011)
Melbourne, Victoria Australia
Dec. 5, 2011 to Dec. 8, 2011
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/UCC.2011.13
We describe the design of a high-throughput storage system, Galileo, for data streams generated in observational settings. The shared-nothing architecture in Galileo supports incremental assimilation of nodes, while accounting for heterogeneity in their capabilities, to cope with data volumes. To achieve efficient storage and retrievals of data, Galileo accounts for the geospatial and chronological characteristics of such time-series observational data streams. Our benchmarks demonstrate that Galileo supports high-throughput storage and efficient retrievals of specific portions of large datasets while supporting different types of queries.
data storage, commodity clusters, distributed systems, scale-out architectures, observational streams, query evaluations
S. Pallickara, M. Malensek and S. L. Pallickara, "Galileo: A Framework for Distributed Storage of High-Throughput Data Streams," 2011 IEEE 4th International Conference on Utility and Cloud Computing (UCC 2011)(UCC), Victoria, NSW, 2011, pp. 17-24.