2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) (2010)
Long Beach, CA, USA
Mar. 1, 2010 to Mar. 6, 2010
Daniel Zinn , Department of Computer Science, University of California at Davis, 1 Shields Ave, USA
Shawn Bowers , Department of Computer Science, Gonzaga University, 502 East Boone Avenue, Spokane, WA, USA
Bertram Ludascher , Department of Computer Science, University of California at Davis, 1 Shields Ave, USA
Scientific workflows are increasingly used to rapidly integrate existing algorithms to create larger and more complex programs. However, designing workflows using purely dataflow-oriented computation models introduces a number of challenges, including the need to use low-level components to mediate and transform data (so-called shims) and large numbers of additional “wires” for routing data to components within a workflow. To address these problems, we employ Virtual Data Assembly Lines (VDAL), a modeling paradigm that can eliminate most shims and reduce wiring complexity. We show how a VDAL design can be implemented using existing XML technologies and how static analysis can provide significant help to scientists during workflow design and evolution, e.g., by displaying actor dependencies or by detecting so-called unproductive actors.
D. Zinn, S. Bowers and B. Ludascher, "XML-based computation for scientific workflows," 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)(ICDE), Long Beach, CA, USA, 2010, pp. 812-815.