The Community for Technology Leaders
International Symposium on Parallel and Distributed Processing with Applications (2011)
Busan, Korea
May 26, 2011 to May 28, 2011
ISBN: 978-0-7695-4428-1
pp: 329-334
ABSTRACT
Big data analysis is a main challenge we meet recently. Cloud computing is attracting more and more big data analysis applications, due to its well scalability and fault-tolerance. Some aggregation functions, like SUM, can be computed in parallel, because they satisfy distributive law of addition. Unfortunately, some of statistical functions are not naturally parallelizable. That means they do not satisfy distributive law of addition. In this paper, we focus on percentile computing problem. We proposed an iterative-style prediction-based parallel algorithm in a distributed system. Prediction is done through a sampling technique. Experiment results verify the efficiency of our algorithm.
INDEX TERMS
Hierarchical Encoding, Percentile, Iterative
CITATION
Shan Wang, Xiongpai Qin, Xiaoyong Du, Huiju Wang, "Parallel Aggregation Queries over Star Schema: A Hierarchical Encoding Scheme and Efficient Percentile Computing as a Case", International Symposium on Parallel and Distributed Processing with Applications, vol. 00, no. , pp. 329-334, 2011, doi:10.1109/ISPA.2011.34
120 ms
(Ver 3.3 (11022016))