This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Many Task Computing for Real-Time Uncertainty Prediction and Data Assimilation in the Ocean
June 2011 (vol. 22 no. 6)
pp. 1012-1024
Constantinos Evangelinos, Massachusetts Institute of Technology, Cambridge
Pierre F.J. Lermusiaux, Massachusetts Institute of Technology, Cambridge
Jinshan Xu, Massachusetts Institute of Technology, Cambridge
Patrick J. Haley, Massachusetts Institute of Technology, Cambridge
Chris N. Hill, Massachusetts Institute of Technology, Cambridge
Uncertainty prediction for ocean and climate predictions is essential for multiple applications today. Many-Task Computing can play a significant role in making such predictions feasible. In this manuscript, we focus on ocean uncertainty prediction using the Error Subspace Statistical Estimation (ESSE) approach. In ESSE, uncertainties are represented by an error subspace of variable size. To predict these uncertainties, we perturb an initial state based on the initial error subspace and integrate the corresponding ensemble of initial conditions forward in time, including stochastic forcing during each simulation. The dominant error covariance (generated via SVD of the ensemble) is used for data assimilation. The resulting ocean fields are used as inputs for predictions of underwater sound propagation. ESSE is a classic case of Many Task Computing: It uses dynamic heterogeneous workflows and ESSE ensembles are data intensive applications. We first study the execution characteristics of a distributed ESSE workflow on a medium size dedicated cluster, examine in more detail the I/O patterns exhibited and throughputs achieved by its components as well as the overall ensemble performance seen in practice. We then study the performance/usability challenges of employing Amazon EC2 and the Teragrid to augment our ESSE ensembles and provide better solutions faster.

[1] I. Raicu, I. Foster, and Y. Zhao, “Many-Task Computing for Grids and Supercomputers,” Proc. IEEE Workshop Many-Task Computing on Grids and Supercomputers (MTAGS '08), 2008.
[2] C.L. Keppenne, “Data Assimilation into a Primitive-Equation Model with a Parallel Ensemble Kalman Filter,” Monthly Weather Rev., vol. 128, pp. 1971-1981, June 2000.
[3] C.L. Keppenne and M.M. Rienecker, “Initial Testing of a Massively Parallel Ensemble Kalman Filter with the Poseidon Isopycnal Ocean General Circulation Model,” Monthly Weather Rev., vol. 130, pp. 2951-2965, 2002.
[4] J. Anderson and N. Collins, “Scalable Implementations of Ensemble Filter Algorithms for Data Assimilation,” J. Atmospheric and Oceanic Technology, vol. 24, no. 8, pp. 1452-1463, Aug. 2007.
[5] P.J.F. Lermusiaux, “Uncertainty Estimation and Prediction for Interdisciplinary Ocean Dynamics,” J. Computational Physics, Special Issue on Uncertainty Quantification, vol. 217, pp. 176-199, 2006.
[6] P.F.J. Lermusiaux, “Adaptive Modeling, Adaptive Data Assimilation and Adaptive Sampling,” Physica D, Special Issue on Mathematical Issues and Challenges in Data Assimilation for Geophysical Systems: Interdisciplinary Perspectives, vol. 230, pp. 172-196, 2007.
[7] Y. Gu and R. Grossman, “Exploring Data Parallelism and Locality in Wide Area Networks,” Proc. First Workshop Many-Task Computing on Grids and Supercomputers, 2008.
[8] D. Abramson, B. Bethwaite, C. Enticott, S. Garic, and T. Peachley, “Robust Workflows for Science and Engineering,” Proc. Second Workshop Many-Task Computing on Grids and Supercomputers, 2009.
[9] E. Ogasawara, D. De Oliveira, F. Seabra, C. Barbosa, R. Elias, A. Coutinho, and M. Mattoso, “Exploring Many Task Computing in Scientific Workflows,” Proc. Second Workshop Many-Task Computing on Grids and Supercomputers, 2009.
[10] X.S. Liang and A.R. Robinson, “A Study of the Iceland-Faeroe Frontal Variability with the Multiscale Energy and Vorticity Analysis,” J. Physical Oceanography, vol. 34, pp. 2571-2591, 2004.
[11] X.S. Liang and D.G. Anderson, “Multiscale Window Transform,” SIAM J. Multiscale Modeling and Simulation, vol. 6, pp. 437-467, 2007.
[12] P.F.J. Lermusiaux, C.-S. Chiu, G.G. Gawarkiewicz, P. Abbot, A.R. Robinson, R.N. Miller, P.J. HaleyJr., W.G. Leslie, S.J. Majumdar, A. Pang, and F. Lekien, “Quantifying Uncertainties in Ocean Predictions,” Oceanography, Special Issue on Advances in Computational Oceanography, vol. 19, no. 1, pp. 92-105, 2006.
[13] P.F.J. Lermusiaux and C.-S. Chiu, “Four-Dimensional Data Assimilation for Coupled Physical-Acoustical Fields,” Acoustic Variability, N. Pace and F. Jensen, eds., pp. 417-424, Kluwer Academic Press, 2002.
[14] J. Xu, P.F.J. Lermusiaux, P.J. HaleyJr., W.G. Leslie, and O.G. Logutov, “Spatial and Temporal Variations in Acoustic Propagation during the Plusnet07 Exercise in Dabob Bay,” Proc. 155th Meeting on Acoustics (POMA), vol. 4, 2008.
[15] A.H. Jazwinski, Stochastic Processes and Filtering Theory. Academic Press, 1970.
[16] I.K., P. Courtier, M. Ghil, and A.C. Lorenc, “Unified Notation for Data Assimilation: Operational, Sequential and Variational,” J. Meteorological Soc. of Japan, vol. 75, no. 1B, pp. 181-189, 1997.
[17] Applied Optimal Estimation, A. Gelb, ed. MIT Press. 1974.
[18] C. Gardiner, Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences. Springer-Verlag, 1983.
[19] P.F.J. Lermusiaux and A.R. Robinson, “Data Assimilation via Error Subspace Statistical Estimation, Part i: Theory and Schemes,” Monthly Weather Rev., vol. 127, no. 7, pp. 1385-1407, 1999.
[20] P.F.J. Lermusiaux, A. Robinson, P.J. HaleyJr., and W.G. Leslie, “Advanced Interdisciplinary Data Assimilation: Filtering and Smoothing via Error Subspace Statistical Estimation,” Proc. OCEANS '02 MTS/IEEE Conf., pp. 795-802, 2002.
[21] P.F.J. Lermusiaux, D.G.M. Anderson, and C.J. Lozano, “On the Mapping of Multivariate Geophysical Fields: Error and Variability Subspace Estimates,” Quarterly J. Royal Meteorological Soc., vol. 126, pp. 1387-1429, 2000.
[22] P.F.J. Lermusiaux, “On the Mapping of Multivariate Geophysical Fields: Sensitivity to Size, Scales and Dynamics,” J. Atmospheric and Oceanic Technology, vol. 19, pp. 1602-1637, 2002.
[23] R.A. Horn and C.R. Johnson, Matrix Analysis. Cambridge Univ. Press, 1985.
[24] R.A. Horn and C.R. Johnson, Topics in Matrix Analysis. Cambridge Univ. Press, 1985.
[25] W. Gentzsch, “Sun Grid Engine: Towards Creating a Compute Power Grid,” Proc. First IEEE Int'l Symp. Cluster Computing and the Grid (CCGrid), pp. 35-36, May 2001.
[26] D. Thain, T. Tannenbaum, and M. Linvy, “Distributed Computing in Practice: the Condor Experience,” Concurrency and Computation: Practice and Experience-Grid Performance, vol. 17, nos. 2-4, pp. 323-356, Feb. 2005.
[27] H.L. Jenter and R.P. Signell, “Netcdf: A Public-Domain-Software Solution to Data-Access Problems for Numerical Modelers,” 1992.
[28] J. Layton, “Strace Analyzer,” http://clusterbuffer.wetpaint.com/pageStrace+Analyzer , 2011.
[29] R. Ross and R. Latham, “Pvfs: A Parallel File System,” Proc. SC '06: ACM/IEEE Conf. Supercomputing, p. 34, 2006.
[30] G. Developers, “The Gluster Web Site,” http:/www.gluster.org, 2008.
[31] E. Walker, “A Distributed File System for a Wide-Area High Performance Computing Infrastructure,” Proc. Third Conf. USENIX Workshop Real, Large Distributed Systems, vol. 3. pp. 9++, 2006.
[32] M.E. Hoskins, “Sshfs: Super Easy File Access over Ssh,” Linux J., vol. 146, p. 4, June 2006.
[33] E. Walker, P.J. Gardner, V. Litvin, and E. Turner, “Personal Adaptive Clusters as Containers for Scientific Jobs,” Cluster Computing, vol. 10, no. 3, Sept. 2007.
[34] P.J. HaleyJr. and P.F.J. Lermusiaux, “Multiscale Two-Way Embedding Schemes for Free-Surface Primitive-Equations in the Multidisciplinary Simulation, Estimation and Assimilation System (MSEAS),” Ocean Dynamics, doi:10.1007/s10236-010-0349-4, vol. 60, pp. 1497-1537, 2010.
[35] M.P. Ueckermann and P.F.J. Lermusiaux, “High Order Schemes for 2D Unsteady Biogeochemical Ocean Models,” Ocean Dynamics, doi:10.1007/s10236-010-0351-x, vol. 60, pp. 1415-1445, 2010.
[36] C. Evangelinos, P.F.J. Lermusiaux, S. Geiger, R.C. Chang, and N.M. Patrikalakis, “Web-Enabled Configuration and Control of Legacy Codes: An Application to Ocean Modeling,” Ocean Modeling, vol. 13, pp. 197-220, 2006.
[37] P.F.J. Lermusiaux, P.J. HaleyJr., and N.K. Yilmaz, “Environmental Prediction, Path Planning and Adaptive Sampling: Sensing and Modeling for Efficient Ocean Monitoring, Management and Pollution Control,” Sea Technology, vol. 48, no. 9, pp. 35-38, 2007.
[38] K.D. Heaney, G. Gawarkiewicz, T.F. Duda, and P.F.J. Lermusiaux, “Non-Linear Optimization of Autonomous Undersea Vehicle Sampling Strategies for Oceanographic Data-Assimilation,” J. Field Robotics, Special Issue on Underwater Robotics, vol. 24, no. 6, pp. 437-448, 2007.
[39] N.K. Yilmaz, C. Evangelinos, P.F.J. Lermusiaux, and N.M. Patrikalakis, “Path Planning of Autonomous Underwater Vehicles for Adaptive Sampling Using Mixed Integer Linear Programming,” IEEE J. Oceanic Eng., vol. 33, no. 4, pp. 522-537, Oct. 2008.
[40] D. Wang, P.F.J. Lermusiaux, P.J. HaleyJr., D. Eickstedt, W.G. Leslie, and H. Schmidt, “Acoustically Focused Adaptive Sampling and On-Board Routing for Marine Rapid Environmental Assessment,” J. Marine Systems, Special Issue on Coastal Processes: Challenges for Monitoring and Prediction, vol. 78, pp. S393-S407, 2009.
[41] F.P. Lam, P.J. HaleyJr., J. Janmaat, P.F.J. Lermusiaux, W.G. Leslie, M.W. Schouten, L.A. te Raa, and M. Rixen, “At-Sea Real-Time Coupled Four-Dimensional Oceanographic and Acoustic Forecasts during Battlespace Preparation 2007,” J. Marine Systems, Special Issue on Coastal processes: Challenges for Monitoring and Prediction, vol. 78, pp. S306-S320, 2009.
[42] T.P. Sapsis and P.F.J. Lermusiaux, “Dynamically Orthogonal Field Equations for Continuous Stochastic Dynamical Systems,” Physica D, vol. 238, pp. 2347-2360, 2009.
[43] A. Gangopadhyay, A.R. Robinson, P.J. HaleyJr., W.G. Leslie, C.J. Lozano, J.J. Bisagni, and Y. Zhitao, “Feature-Oriented Regional Modeling and Simulations in the Gulf of Maine and Georges Bank,” Continental Shelf Research, vol. 23, nos. 3/4, pp. 317-353, 2003.

Index Terms:
MTC, assimilation, data-intensive, ensemble.
Citation:
Constantinos Evangelinos, Pierre F.J. Lermusiaux, Jinshan Xu, Patrick J. Haley, Chris N. Hill, "Many Task Computing for Real-Time Uncertainty Prediction and Data Assimilation in the Ocean," IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 6, pp. 1012-1024, June 2011, doi:10.1109/TPDS.2011.64
Usage of this product signifies your acceptance of the Terms of Use.