Parallel and Distributed Processing Symposium, International (2001)
San Francisco, California, USA
Apr. 23, 2001 to Apr. 27, 2001
Many scientific applications are I/O intensive and generate large data sets, spanning hundreds or thousands of "files." Management, storage, efficient access, and analysis of this data present an extremely challenging task. We have developed a software system, called Scientific Data Manager (SDM), that uses a combination of parallel file I/O and database support for high-performance scientific data management. SDMprovides a high-level API to the user and, internally, uses a parallel file system to store real data and a database to store application-related metadata. In this pa-per, we describe how we designed and implemented SDM to support irregular applications. SDM can efficiently handle the reading and writing of data in an irregular mesh, as well as the distribution of index values. We describe the SDM user interface and how we have implemented it to achieve high performance. SDM makes extensive use of MPI-IO's noncontiguous collective I/O functions. SDM also uses the concept of a history .le to optimize the cost of the index distribution using the metadata stored in database. We present performance results with two irregular applications, a CFD code called FUN3D and a Rayleigh-Taylor instability code, on the SGI Origin2000 at Argonne National Laboratory.
Jaechun No, Rajeev Thakur, Dinesh Kaushik, Lori Freitag, Alok Choudhary, "A Scientific Data Management System for Irregular Applications", Parallel and Distributed Processing Symposium, International, vol. 03, no. , pp. 30120a, 2001, doi:10.1109/IPDPS.2001.925096