loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2004 International Conference on Parallel Processing (ICPP'04)
Adaptive Data Partition for Sorting Using Probability Distribution
Montreal, Quebec, Canada
August 15-August 18
ISBN: 0-7695-2197-5
Xipeng Shen, University of Rochester
Chen Ding, University of Rochester
Many computing problems benefit from dynamic partition of data into smaller chunks with better parallelism and locality. However, it is difficult to partition all types of inputs with the same high efficiency. This paper presents a new partition method in sorting scenario based on probability distribution, an idea first studied by Janus and Lamagna in early 1980?s on a mainframe computer. The new technique makes three improvements. The first is a rigorous sampling technique that ensures accurate estimate of the probability distribution. The second is an efficient implementation on modern, cache-based machines. The last is the use of probability distribution in parallel sorting. Experiments show 10-30% improvement in partition balance and 20-70% reduction in partition overhead, compared to two commonly used techniques. The new method reduces the parallel sorting time by 33-50% and outperforms the previous fastest sequential sorting technique by up to 30%.
Citation:
Xipeng Shen, Chen Ding, "Adaptive Data Partition for Sorting Using Probability Distribution," icpp, pp.250-257, 2004 International Conference on Parallel Processing (ICPP'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.