The Community for Technology Leaders
RSS Icon
Issue No.01 - Jan.-March (2014 vol.7)
pp: 2-17
Jia Zhang , Northern Illinois University, DeKalb
Daniel Kuc , Northern Illinois University, DeKalb
Shiyong Lu , Wayne State University, Detroit
Modern scientific data management and analysis usually rely on multiple scientists with diverse expertise. In recent years, such a collaborative effort is often structured and automated by a data flow-oriented process called scientific workflow. However, such workflows may have to be designed and revised among multiple scientists over a long time period. Existing workbenches are single user-oriented and do not support scientific workflow application development in a "collaborative fashion.â' In this paper, we report our research on the enabling techniques in the aspects of collaboration provenance management and reproduciability. Based on a scientific collaboration ontology, we propose a service-oriented collaboration model supported by a set of composable collaboration primitives and patterns. The collaboration protocols are then applied to support effective concurrency control in the process of collaborative workflow composition. We also report the design and development of Confucius, a service-oriented collaborative scientific workflow composition tool that extends an open-source, single-user development environment.
Collaboration, Ontologies, Business, Indexes, Collaborative work, Service oriented architecture, Concurrency control,computer-supported cooperative work, Workflow management, service-oriented architecture
Jia Zhang, Daniel Kuc, Shiyong Lu, "Confucius: A Tool Supporting Collaborative Scientific Workflow Composition", IEEE Transactions on Services Computing, vol.7, no. 1, pp. 2-17, Jan.-March 2014, doi:10.1109/TSC.2012.13
[1] G.M. Olson, A. Zimmerman, and N. Bos, eds., Scientific Collaboration on the Internet. MIT Press, 2008.
[2] LSST, "Large Synoptic Survey Telescope,", 2009.
[3] B. Ludäscher, "Scientific Workflows: Cyberinfrastructure for E-Science," Proc. Pacific Neighborhood Consortium (PNC '07), Oct. 2007.
[4] Y. Gil, E. Deelman, J. Blythe, C. Kesselman, and H. Tangmunarunkit, "Artificial Intelligence and Grids: Workflow Planning and Beyond," IEEE Intelligent Systems, vol. 19, no. 1, pp. 26-33, Jan./Feb. 2004.
[5] E. Deelman and Y. Gil, Proc. NSF Workshop Challenges of Scientific Workflows. May 2006.
[6] S. Wuchty, B. Jones, and B. Uzzi, "The Increasing Dominance of Teams in Production of Knowledge," Science, vol. 316, pp. 1036-1039, 2007.
[7] Committee on Facilitating Interdisciplinary Research, Facilitating Interdisciplinary Research. Nat'l Academies Press, 2004.
[8] G. Bell, T. Hey, and A. Szalay, "Beyond the Data Deluge," Science, vol. 323, no. 5919, pp. 1297-1298, 2009.
[9] S. Lu and J. Zhang, "Collaborative Scientific Workflows Supporting Collaborative Science," Int'l J. Business Process Integration and Management, vol. 5, no. 2, pp. 185-199, 2011.
[10] B. Ludäscher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Jones, E.A. Lee, J. Tao, and Y. Zhao, "Scientific Workflow Management and the Kepler System," Concurrency and Computation: Practice and Experience, vol. 8, no. 10, pp. 1039-1065, 2006.
[11] T. Oinn, M. Greenwood, M. Addis, M.N. Alpdemir, J. Ferris, K. Glover, C. Goble, A. Goderis, D. Hull, D. Marvin, P. Li, P. Lord, M.R. Pocock, M. Senger, R. Stevens, A. Wipat, and C. Wroe, "Taverna: Lessons in Creating a Workflow Environment for the Life Sciences," Concurrency and Computation: Practice and Experience, vol. 18, no. 10, pp. 1067-1100, 2006.
[12] D. Churches, G. Gombas, A. Harrison, J. Maassen, C. Robinson, M. Shields, I. Taylor, and I. Wang, "Programming Scientific and Distributed Workflow with Triana Services," Concurrency and Computation: Practice and Experience, vol. 18, no. 10, pp. 1021-1037, 2006.
[13] J. Freire, C.T. Silva, S.P. Callahan, E. Santos, and C.E. Scheidegger, "Managing Rapidly-Evolving Scientific Workflows," Proc. Int'l Conf. Provenance Annotation Data, pp. 10-18 , May. 2006.
[14] Y. Zhao, M. Hategan, B. Clifford, I. Foster, G. Laszewski, V. Nefedova, I. Raicu, T. Stef-Praun, and M. Wilde, "Swift: Fast, Reliable, Loosely Coupled Parallel Computation," Proc. IEEE Int'l Workshop Scientific Workflows, pp. 199-206, July 2007.
[15] A. Chebotko, C. Lin, X. Fei, Z. Lai, S. Lu, J. Hua, and F. Fotouhi, "VIEW: A Visual Scientific Workflow Management System," Proc. IEEE First Int'l Workshop Scientific Workflows, pp. 207-208, July 2007.
[16] C. Lin, S. Lu, Z. Lai, A. Chebotko, X. Fei, J. Hua, and F. Fotouhi, "Service-Oriented Architecture for VIEW: A Visual Scientific Workflow Management System," Proc. IEEE Int'l Conf. Services Computing (SSC '08), pp. 335-342, July 2008.
[17] D.D. Roure, C. Goble, and R. Stevens, "The Design and Realisation of the myExperiment Virtual Research Environment for Social Sharing of Workflows," Future Generation Computer Systems, vol. 25 pp. 561-567, 2009.
[18] L.J. Zhang, J. Zhang, and H. Cai, Services Computing. Springer, 2007.
[19] R. Barga, J. Jackson, N. Araujo, D. Guo, N. Gautam, and Y. Simmhan, "The Trident Scientific Workflow Workbench," Proc. IEEE Fourth Int'l Conf. E-Science, pp. 317-318, Dec. 2008.
[20] A. Chebotko, X. Fei, C. Lin, S. Lu, and F. Fotouhi, "Storing and Querying Scientific Workflow Provenance Metadata Using an RDBMS," Proc. IEEE Third Int'l Conf. E-Science and Grid Computing, pp. 611-618, Dec. 2007.
[21] A. Chebotko, S. Lu, X. Fei, and F. Fotouhi, "RDFProv: A Relational RDF Store for Querying and Managing Scientific Workflow Provenance," Data and Knowledge Eng., vol. 69, no. 8, pp. 836-865, 2010.
[22] A. Chebotko, S. Lu, and F. Fotouhi, "Semantics Preserving SPARQL-to-SQL Query Translation," Data and Knowledge Eng., vol. 68, no. 10, pp. 973-1000, 2009.
[23] S. Lu and J. Zhang, "Collaborative Scientific Workflows," Proc IEEE Int'l Conf. Web Services (ICWS '09), pp. 527-534, July 2009.
[24] G. Fakas and B. Karakostas, "A Workflow Management System Based on Intelligent Collaborative Objects," Information and Software Technology, vol. 41, no.13, pp. 907-915, 1999.
[25] H. Song, J.J. Dong, C. Han, W.R. Jung, and C.-H. Youn, "A SLA-Adaptive Workflow Integrated Grid Resource Management System for Collaborative Healthcare Services," Proc. Third Int'l Conf. Internet and Web Applications and Services (ICIW '08), pp. 702-707, June 2008.
[26] L. Pudhota and E. Chang, "Collaborative Workflow Management Using Service Oriented Approach," Proc. Int'l Conf. E-Business, Enterprise Information Systems, E-Govt. (EEE '05), pp. 167-173, 2005.
[27] R. Lu and S.W. Sadiq, "A Survey of Comparative Business Process Modeling Approaches," Proc. 10th Int'l Conf. Business Information Systems (BIS '07), pp. 82-94, Apr. 2007.
[28] C.J. Huang, C.V. Trappey, and C.C. Ku, "A JADE-Based Autonomous Workflow Management System for Collaborative IC Design," Proc 11th Int'l Conf. Computer Supported Cooperative Work in Design (CSCWD '07), pp. 777-782, Apr. 2007.
[29] J. Dang, J. Huang, and M.N. Huhns, "Workflow Coordination for Service-Oriented Multiagent Systems," Proc. Sixth Int'l Joint Conf. Autonomous Agents and Multiagent Systems (AAMAS '07), pp. 1056-1058, 2007.
[30] J. Balasooriya, S.K. Prasad, and S.B. Navathe, "A Middleware Architecture for Enhancing Web Services Infrastructure for Distributed Coordination of Workflows," Proc. IEEE Int'l Conf. Services Computing (SCC '08), pp. 370-377, July 2008.
[31] J. Balasooriya, J. Joshi, S.K. Prasad, and S. Navathe, "A Two-Layered Software Architecture for Distributed Workflow Coordination over Web Services," Proc. IEEE Int'l Conf. Web Services (ICWS '06), pp. 933-934, Sept. 2006.
[32] P. Kazanis and A. Ginige, "Asynchronous Collaborative Business Process Modeling through a Web Forum," Proc. Seventh Ann. CollECTeR Conf. Electronic Commerce, 2002.
[33] T. Miller, P. McBurney, J. McGinnis, and K. Stathis, "First-Class Protocols for Agent-Based Coordination of Scientific Instruments," Proc. IEEE 16th Int'l Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE '07), pp. 41-46, June 2007.
[34] Z. Nemeth, C. Perez, and T. Priol, "Distributed Workflow Coordination: Molecules and Reactions," Proc. IEEE 20th Int'l Parallel and Distributed Processing Symp., pp. 260-267, Apr. 2006.
[35] C. Yen, W.J. Li, and J.C. Lin, "A Web-Based Collaborative, Computer-Aided Sequential Control Design Tool," IEEE Control Systems Magazine, vol. 23, no. 2, pp. 14-19, Apr. 2003.
[36] D. Dori, D. Beimel, and E. Toch, "OPCATeam—Collaborative Business Process Modeling with OPM," Proc. Second Int'l Conf. Business Process Management (BPM '04), pp. 66-81, June 2004.
[37] Y. Sure, M. Erdmann, J. Angele, S. Staab, R. Studer, and D. Wenke, "OntoEdit: Collaborative Ontology Engineering for the Semantic Web," Proc. First Int'l Semantic Web Conf. (ISWC '02), pp. 221-235, 2002.
[38] A. Agrawal, M. Amend, M. Das, M. Ford, C. Keller, M. Kloppmann, D. König, F. Leymann, R. Müller, K. Plösser, R. Rangaswamy, A. Rickayzen, M. Rowley, P. Schmidt, I. Trickovic, A. Yiu, and M. Zeller, "WS-BPEL Extension for People (BPEL4People), Version 1.0," software/dw/specs/ws-bpel4peopleBPEL4People_ v1.pdf , June 2007.
[39] N. Ayachitula, M.J. Buco, Y. Diao, M. Surendra, R. Pavuluri, L. Shwartz, and C. Ward, "IT Service Management Automation—A Hybrid Methodology to Integrate and Orchestrate Collaborative Human Centric and Automation Centric Workflows," Proc. IEEE Int'l Conf. Services Computing (SCC '07), pp. 574-581, July 2007.
[40] D. Russell, P.M. Dew, and K. Djemame, "Service-Based Collaborative Workflow for DAME" Proc. IEEE Int'l Conf. Services Computing (SCC '05), pp. 139-146, July 2005.
[41] D. Jordan and J. Evdemon, "Web Services Business Process Execution Language, Version 2.0," wsbpel-v2.0-OS.html, Apr. 2007.
[42] J.Y. Sayah and L.J. Zhang, "On-Demand Business Collaboration Enablement with Services Computing," Decision Support Systems, vol. 40, no. 1, pp. 107-127, July 2005.
[43] C.K. Chang, J. Zhang, and K.H. Chang, "Survey of Computer Supported Business Collaboration in Support of Business Processes," Int'l J. Business Process Integration and Management, vol. 1, no. 2, pp. 76-100, 2006.
[44] I. Altintas, "Collaborative Provenance for Workflow-Driven Science and Engineering," PhD thesis, Universiteit van Amsterdam, 2011.
[45] A. Chapman, H.V. Jagadish, and P. Ramanan, "Efficient Provenance Storage," Proc. Sigmod, pp. 993-1006, June 2008.
[46] M.K. Anand, S. Bowers, T.M. McPhillips, and B. Ludäscher, "Efficient Provenance Storage over Nested Data Collections," Proc. 12th Int'l Conf. Extending Database Technology: Advances in Database Technology (EDBT '09), pp. 958-969, 2009.
[47] T. Oinn, "XScufl Language Reference," , 2004.
[48] E. Gamma, R. Helm, R. Johnson, and J. Vlissides, Design Patterns: Elements of Reusable Object-Oriented Software. Addison Wesley, 1995.
[49] M. Robert, W.J. Evans, D.H. Honemann, and T.J. Balch, Robert's Rules of Order: Newly Revised, 10th ed. Perseus, 2000.
[50] J. Zhang, "Co-Taverna: A Tool Supporting Collaborative Scientific Workflows," Proc. IEEE Int'l Conf. Services Computing (SCC '10), pp. 41-48, July 2010.
[51] J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Proc. Sixth Conf. Symp. Operating Systems Design and Implementation (OSDI '04), pp. 137-150, 2004.
[52] S. Lu, A. Bernstein, and P. Lewish, "Correct Execution of Transactions at Different Isolation Levels," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 9, pp. 1070-1081, Sept. 2004.
[53] J. Gray and A. Reuter, Transaction Processing: Concepts and Techniques, first ed. Morgan Kaufmann, 1992.
362 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool