loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th International Conference on Scientific and Statistical Database Management (SSDBM 2007)
MAMCost: Global and Local Estimates leading to Robust Cost Estimation of Similarity Queries
Banff, Alberta, Canada
July 09-July 11
ISBN: 0-7695-2868-6
Gisele Busichia Baioco, University of Sao Paulo at S. Carlos, Brazil
Agma J. M. Traina, University of Sao Paulo at S. Carlos, Brazil
Caetano Traina Jr., University of Sao Paulo at S. Carlos, Brazil
This paper presents an effective cost model to estimate the number of disk accesses (I/O cost) and the number of distance calculations (CPU cost) to process similarity queries over data indexed by metric access methods. Two types of similarity queries were taken into consideration: range and k-nearest neighbor queries. The main point of the cost model is considering not only global parameters of the data set but also the local data distribution. The model takes advantage of the intrinsic dimension of the data set, estimated by its correlation fractal dimension. Experiments were performed on real and synthetic data sets, with different sizes and dimensions, in order to validate the proposed model. They confirmed that the estimations are accurate, within the range achieved by real queries.
Citation:
Gisele Busichia Baioco, Agma J. M. Traina, Caetano Traina Jr., "MAMCost: Global and Local Estimates leading to Robust Cost Estimation of Similarity Queries," ssdbm, pp.6, 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.