loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06)
Assessing Data Virtualization for Irregularly Replicated Large Datasets
Singapore
May 16-May 19
ISBN: 0-7695-2585-7
Bruno Diniz, Federal University of Minas Gerais, Brazil
Diego L. Nogueira, Federal University of Minas Gerais, Brazil
Andre Cardoso, Federal University of Minas Gerais, Brazil
Renato A. Ferreira, Federal University of Minas Gerais, Brazil
Dorgival Guedes, Federal University of Minas Gerais, Brazil
Wagner Meira Jr., Federal University of Minas Gerais, Brazil
Large volumes of data are generated every day by experiments, simulations and all sorts of applications. It is common to observe situations where portions of data are irregularly replicated and distributed in different data sources. It would be desirable to be able to handle these several pieces of irregular data (replicated or not) as a unique large dataset. This is called data virtualization and is the focus of this paper. In this paper, we present a system which is capable of dealing with irregularly replicated data and is able to create a virtual view of the union of the individual irregular portions of data hosted by each data source. Our system indexes the data intervals from each data source and allows clients to submit queries against the virtual dataset created. In order to select what server will be responsible for each data interval of a query, we use and compare three algorithms, namely Random, Round-Robin and Weighted Round-Robin. The comparison is driven by simulation and the parameters for the simulation are all taken from a real data-centered application (the Virtual Microscope).
Citation:
Bruno Diniz, Diego L. Nogueira, Andre Cardoso, Renato A. Ferreira, Dorgival Guedes, Wagner Meira Jr., "Assessing Data Virtualization for Irregularly Replicated Large Datasets," ccgrid, pp.505-512, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.