This Article 
 Bibliographic References 
 Add to: 
A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution
January-March 2009 (vol. 2 no. 1)
pp. 79-92
Cui Lin, Wayne State University, Detroit
Shiyong Lu, Wayne State University, Detroit
Xubo Fei, Wayne State University, Detroit
Artem Chebotko, University of Texas - Pan American, Edinburg
Darshan Pai, Wayne State University, Detroit
Zhaoqiang Lai, Wayne State University, Detroit
Farshad Fotouhi, Wayne State University, Detroit
Jing Hua, Wayne State University, Detroit
Scientific workflows have recently emerged as a new paradigm for scientists to formalize and structure complex and distributed scientific processes to enable and accelerate many scientific discoveries. In contrast to business workflows, which are typically control flow oriented, scientific workflows tend to be dataflow oriented, introducing a new set of requirements for system development. These requirements demand a new architectural design for scientific workflow management systems (SWFMSs). Although several SWFMSs have been developed that provide much experience for future research and development, a study from an architectural perspective is still missing. The main contributions of this paper are: 1) based on a comprehensive survey of the literature and identification of key requirements for SWFMSs, we propose the first reference architecture for SWFMSs; 2) according to the reference architecture, we further propose a service-oriented architecture for View (a VIsual sciEntific Workflow management system); 3) we implemented View to validate the feasibility of the proposed architectures; and 4) we present a View-based scientific workflow application system (SWFAS), called FiberFlow, to showcase the application of our View system.

[1] A. Tsalgatidou , G. Athanasopoulos , M. Pantazoglou , C. Pautasso , T. Heinis , R. Grønmo , H. Hoff , A. Berre , M. Glittum , and S. Topouzidou , “Developing Scientific Workflows from Heterogeneous Services,” SIGMOD Record, vol. 35, no. 2, pp.22-28, 2006.
[2] D. Hollingsworth , Workflow Management Coalition Specification: The Workflow Reference Model, Document Number TC00-1003, v. 1.1, 1995.
[3] P. Grefen and R. de Vries , “A Reference Architecture for Workflow Management Systems,” Data Knowledge Eng., vol. 27, no. 1, pp.31-57, 1998.
[4] W. van der Aalst , L. Aldred , M. Dumas , and A. ter Hofstede , “Design and Implementation of the YAWL System,” Proc. Center for Advancement of Informal Science Education Conf. (CAiSE '04), pp.142-159, 2004.
[5] L. Liu , C. Pu , and D. Ruiz , “A Systematic Approach to Flexible Specification, Composition, and Restructuring of Workflow Activities,” J. Database Management, vol. 15, no. 1, pp.1-40, 2004.
[6] J. Miller , D. Palaniswami , A. Sheth , K. Kochut , and H. Singh , “Webwork: METEOR $_{{\rm 2}}$ 's Web-Based Workflow Management System,” J. Intelligent Information Systems, vol. 10, no. 2, pp.185-215, 1998.
[7] G. Alonso , R. Günthör , M. Kamath , D. Agrawal , A. Abbadi , and C. Mohan , “Exotica/FMDC: A Workflow Management System for Mobile and Disconnected Clients,” Distributed and Parallel Databases, vol. 4, no. 3, pp.229-247, 1996.
[8] F. Leymann and D. Roller , “Business Process Management with FlowMark,” Proc. IEEE CS Int'l Conf. (COMPCON '94), pp.230-234, 1994.
[9] B. Ludäscher , I. Altintas , C. Berkley , D. Higgins , E. Jaeger , M. Jones , E. Lee , J. Tao , and Y. Zhao , “Scientific Workflow Management and the Kepler System,” Concurrency and Computation: Practice and Experience, vol. 18, no. 10, pp.1039-1065, 2006.
[10] S. Callahan , J. Freire , E. Santos , C. Scheidegger , C. Silva , and H. Vo , “VisTrails: Visualization Meets Data Management,” Proc. Special Interest Group on Management of Data Conf. (SIGMOD '06), pp.745-747, 2006.
[11] Y. Zhao , M. Hategan , B. Clifford , I. Foster , G. von Vonlaszewski , I. Raicu , T. Stef-Praun , and M. Wilde , “Swift: Fast, Reliable, Loosely Coupled Parallel Computation,” Proc. IEEE Int'l Workshop Scientific Workflows (SWF '07), pp.199-206, 2007.
[12] E. Deelman , G. Singh , M. Su , J. Blythe , Y. Gil , C. Kesselman , G. Mehta , K. Vahi , G. Berriman , J. Good , A. Laity , J. Jacob , and D. Katz , “Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems,” Scientific Programming J., vol. 13, no. 3, pp.219-237, 2005.
[13] T. Oinn , M. Addis , J. Ferris , D. Marvin , M. Senger , R. Greenwood , T. Carver , K. Glover , M. Pocock , A. Wipat , and P. Li , “Taverna: A Tool for the Composition and Enactment of Bioinformatics Workflows,” Bioinformatics, vol. 20, no. 17, pp.3045-3054, 2004.
[14] S. Majithia , M. Shields , I. Taylor , and I. Wang , “Triana: A Graphical Web Service Composition and Execution Toolkit,” Proc. IEEE Int'l Conf. Web Services (ICWS '04), pp.514-524, 2004.
[15] L. Zhang , J. Zhang , and H. Cai , Services Computing. Springer, 2007.
[16] C. Lin and S. Lu , “Architectures of Workflow Management Systems: A Survey,” Technical Report TR-SWR-01-2008, 2008.
[17] I. Foster , Y. Zhao , I. Raicu , and S. Lu , “Cloud Computing and Grid Computing 360-Degree Compared,” Proc. IEEE Grid Computing Environments Workshop, pp. 1-10, 2008.
[18] D. Georgakopoulos , M. Hornick , and A. Sheth , “An Overview of Workflow Management: From Process Modeling to Workflow Automation Infrastructure,” Distributed and Parallel Databases, vol. 3, no. 2, pp.119-153, 1995.
[19] K. Ostrowski , K. Birman , and D. Dolev , “Extensible Architecture for High-Performance, Scalable, Reliable Publish-Subscribe Eventing and Notification,” Int'l J. Web Service Research, vol. 4, no. 4, pp.18-58, 2007.
[20] A. Chebotko , X. Fei , C. Lin , S. Lu , and F. Fotouhi , “Storing and Querying Scientific Workflow Provenance Metadata Using an RDBMS,” Proc. Second IEEE Int'l Workshop Scientific Workflows and Business Workflow Standards in E-Science, pp.611-618, 2007.
[21] M. Atay , A. Chebotko , D. Liu , S. Lu , and F. Fotouhi , “Efficient Schema-Based XML-to-Relational Data Mapping,” Information Systems, vol. 32, no. 3, pp.458-476, 2007.
[22] A. Chebotko , M. Atay , S. Lu , and F. Fotouhi , “XML Subtree Reconstruction from Relational Storage of XML Documents,” Data Knowledge Eng., vol. 62, no. 2, pp.199-218, 2007.
[23] T. Erl , Service-Oriented Architecture Concepts, Technology and Design. Pearson Education, Inc., 2005.
[24] A. Chebotko , C. Lin , X. Fei , Z. Lai , S. Lu , J. Hua , and F. Fotouhi , “VIEW: A Visual Scientific Workflow Management System,” Proc. IEEE Int'l Workshop Scientific Workflows (SWF '07), pp.207-208, 2007.
[25] D. Pai , O. Muzik , and J. Hua , “Quantitative Analysis of Diffusion Tensor Images Across Subjects Using Probabilistic Tractography,” Proc. Int'l Conf. Image Processing (ICIP '08), pp.1448-1451, 2008.
[26] C. Lin , S. Lu , X. Liang , J. Hua , and O. Muzik , “Cocluster Analysis of Thalamo-Cortical Fiber Tracts Extracted from Diffusion Tensor MRI,” Int'l J. Data Mining and Bioinformatics, vol. 2, no. 4, pp.342-361, 2008.
[27] T. Oinn , M. Greenwood , M.J. Addis , M.N. Alpdemir , J. Ferris , K. Glover , C. Goble , A. Goderis , D. Hull , D.J. Marvin , P. Li , P. Lord , M.R. Pocock , M. Senger , R. Stevens , A. Wipat , and C. Wroe , “Taverna: Lessons in Creating a Workflow Environment for the Life Sciences,” J. Concurrency and Computation: Practice and Experience, vol. 18, no. 10, pp.1067-1100, 2002.
[28] I. Altintas , O. Barney , and E. Jaeger-Frank , “Provenance Collection Support in the Kepler Scientific Workflow System,” Proc. Int'l Provenance and Annotation Workshop (IPAW '06), pp.118-132, 2006.
[29] E. Deelman and A. Chervenak , “Data Management Challenges of Data Intensive Scientific Workflows,” Proc. IEEE Int'l Symp. Cluster Computing and the Grid (CCGRID '08), pp.687-692, 2008.
[30] Open Provenance Model, OPM, 2009.
[31] M. Vouk and M. Singh , “Quality of Service and Scientific Workflows,” Proc. Working Conf. Quality of Numerical Software, pp.77-89, 1996.
[32] J. Yu and R. Buyya , “A Taxonomy of Scientific Workflow Systems for Grid Computing,” SIGMOD Record, vol. 34, no. 3, pp.44-49, 2005.
[33] I. Taylor , E. Deelman , D. Gannon , and M. Shields , Workflows for E-Science. Springer-Verlag London, Ltd., 2007.
[34] W. van der Aalst , A. ter Hofstede , B. Kiepuszewski , and A. Barros , “Workflow Patterns,” Distributed and Parallel Databases, vol. 14, no. 1, pp.5-51, 2003.
[35] A. Arsanjani , L.-J. Zhang , M. Ellis , A. Allam , and K. Channabasa-vaiah , “S3: A Service-Oriented Reference Architecture,” IT Professional, pp.10-17, 2007.
[36] C. Lin , S. Lu , Z. Lai , A. Chebotko , X. Fei , J. Hua , and F. Fotouhi , “Service-Oriented Architecture for VIEW: A Visual Scientific Workflow Management System,” Proc. IEEE Int'l Conf. Services Computing (SCC '08), pp.335-342, 2008.

Index Terms:
Reference architecture, scientific workflows, scientific workflow management system, SOA, View.
Cui Lin, Shiyong Lu, Xubo Fei, Artem Chebotko, Darshan Pai, Zhaoqiang Lai, Farshad Fotouhi, Jing Hua, "A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution," IEEE Transactions on Services Computing, vol. 2, no. 1, pp. 79-92, Jan.-March 2009, doi:10.1109/TSC.2009.4
Usage of this product signifies your acceptance of the Terms of Use.