loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2006 First International Multi-Symposiums on Computer and Computational Sciences
On The Even-Out Effect of Probabilistic Sampling
Hangzhou, Zhejiang, China
June 20-June 24
ISBN: 0-7695-2581-4
Ziqian Liu, Beijing Jiaotong University, China
Changjia Chen, Beijing Jiaotong University, China
Sampling is widely used in social investigations and network measurements since it can significantly reduce the expense of data storage and processing. However, sampling will inevitably miss or even distort the original data characteristics to some extent. This paper studies the effect of probabilistic sampling on a set of data with unbalanced size distribution. We introduce the Lorenz curve, widely used in economics, associated with the crossover split, a recently proposed quantifier, to measure the deviation of size distribution before and after sampling. By using simulation and real Internet data, we observe that as the sampling probability decreases, the size distribution becomes less unbalanced. We call this phenomenon the even-out effect. The relations among the probability sampling, the crossover split and Pareto distribution are also revealed.
Citation:
Ziqian Liu, Changjia Chen, "On The Even-Out Effect of Probabilistic Sampling," imsccs, vol. 2, pp.692-698, 2006 First International Multi-Symposiums on Computer and Computational Sciences, 2006
Usage of this product signifies your acceptance of the Terms of Use.