Subscribe
Issue No.04 - Oct.-Dec. (2013 vol.6)
pp: 470-483
David Chiu , Washington State University, Vancouver
Gagan Agrawal , Ohio State University, Columbus
ABSTRACT
Large-scale scientific data analysis projects have catalyzed service-based workflow management systems. We present an approach for integrating user preferences on completion time and workflow accuracy in a workflow composition system. The relationship between workflow execution time and the accuracy of results is exploited by our workflow system. Specifically, our system is equipped with a way for users to define cost models on service completion time and error propagation (prevalent in many scientific and data analysis applications). Together with these models and an ontology for describing web service and data dependences, our system plans service-based workflows to answer high-level queries. Our system was evaluated under a real service-based environment against user constraints on time, accuracy, and network bandwidth variations. In the worst case in our experiments, we observed an average deviation of 14.3 percent below the desired time constraints, which suggests that our system is time-conservative. Within varying network bandwidth environments, we can also meet time constraints through sampling, and only a 12.4 percent deviation below time expectations are observed on average. We further show that, though negotiating with services' error models, our system is capable of planning data reduction measures (e.g., sampling) directly within workflow plans to achieve the desired accuracy.
INDEX TERMS
Ontologies, Accuracy, Web services, Databases, Time factors, Mathematical model, Registers,scientific workflows, Workflow management, web service composition
CITATION
David Chiu, Gagan Agrawal, "Cost and Accuracy Aware Scientific Workflow Composition for Service-Oriented Environments", IEEE Transactions on Services Computing, vol.6, no. 4, pp. 470-483, Oct.-Dec. 2013, doi:10.1109/TSC.2012.19
REFERENCES
[1] I. Foster, "Service-Oriented Science," Science, vol. 308, no. 5723, pp. 814-817, May 2005.
[2] T. Oinn, M. Addis, J. Ferris, D. Marvin, M. Senger, M. Greenwood, T. Carver, K. Glover, M.R. Pocock, A. Wipat, and P. Li, "Taverna: A Tool for the Composition and Enactment of Bioinformatics Workflows," Bioinformatics, vol. 20, no. 17, pp. 3045-3054, 2004.
[3] C. Lin, S. Lu, X. Fei, A. Chebotko, D. Pai, Z. Lai, F. Fotouhi, and J. Hua, "A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution," IEEE Trans. Services Computing, vol. 2, no. 1, pp. 79-92, Jan. 2009.
[4] S. Dustdar and W. Schreiner, "A Survey on Web Services Composition," Int'l J. Web and Grid Services, vol. 1, no. 1, pp. 1-30, 2005.
[5] J. Rao and X. Su, "A Survey of Automated Web Service Composition Methods," Proc. First Int'l Conf. Semantic Web Services and Web Process Composition (SWSWPC '04), pp. 43-54, 2004.
[6] J. Qin and T. Fahringer, "A Novel Domain Oriented Approach for Scientific Grid Workflow Composition," Proc. ACM/IEEE Conf. Supercomputing (SC '08), pp. 1-12, 2008.
[7] L. Zeng, B. Benatallah, A.H. Ngu, M. Dumas, J. Kalagnanam, and H. Chang, "QoS-Aware Middleware for Web Services Composition," IEEE Trans. Software Eng., vol. 30, no. 5, pp. 311-327, May 2004.
[8] A. Afzal, J. Darlington, and A.S. McGough, "QoS-Constrained Stochastic Workflow Scheduling in Enterprise and Scientific Grids," Proc. IEEE/ACM Seventh Int'l Conf. Grid Computing (Grid '06), pp. 1-8, 2006.
[9] C. Fellbaum, WordNet: An Electronic Lexical Database. MIT Press, 1998.
[10] N. Alameh, "Chaining Geographic Information Web Services," IEEE Internet Computing, vol. 7, no. 5, pp. 22-29, Sept./Oct. 2003.
[11] D. Chiu, S. Deshpande, G. Agrawal, and R. Li, "Cost and Accuracy Sensitive Dynamic Workflow Composition over Grid Environments," Proc. IEEE/ACM Ninth Int'l Conf. Grid Computing (Grid '08), 2008.
[12] Y.-K. Kwok and I. Ahmad, "Static Scheduling Algorithms for Allocating Directed Task Graphs to Multiprocessors," ACM Computing Surveys, vol. 31, pp. 406-471, Dec. 1999.
[13] D. Chiu, S. Deshpande, G. Agrawal, and R. Li, "Composing Geoinformatics Workflows with User Preferences," Proc. 16th ACM SIGSPATIAL Int'l Conf. Advances in Geographic Information Systems (GIS '08), 2008.
[14] S.-C. Oh, D. Lee, and S.R.T. Kumara, "A Comparative Illustration of AI Planning-Based Web Services Composition," SIGecom Exchanges, vol. 5, no. 5, pp. 1-10, 2006.
[15] Web Services Business Process Execution Language (WS-BPEL) 2.0, OASIS Standard, 2011.
[16] L. Li and I. Horrocks, "A Software Framework for Matchmaking Based on Semantic Web Technology," Proc. 12th Int'l Conf. World Wide Web (WWW '03), pp. 331-339, 2003.
[17] Q. Sheng, B. Benatallah, M. Dumas, and E. Mak, "SELF-SERV: A Platform for Rapid Composition of Web Services in a Peer-to-Peer Environment," Proc. Demo Session of the 28th Int'l Conf. Very Large Databases, 2002.
[18] B. Benatallah, M. Dumas, Q.Z. Sheng, and A.H. Ngu, "Declarative Composition and Peer-to-Peer Provisioning of Dynamic Web Services," Proc. 18th Int'l Conf. Data Eng. (ICDE '02), 2002.
[19] P. Muth, D. Wodtke, J. Weissenfels, A.K. Dittrich, and G. Weikum, "From Centralized Workflow Specification to Distributed Workflow Execution," J. Intelligent Information Systems, vol. 10, no. 2, pp. 159-184, 1998.
[20] P. Traverso and M. Pistore, "Automated Composition of Semantic Web Services into Executable Processes," Proc. Third Int'l Semantic Web Conf., 2004.
[21] D. Wu, E. Sirin, J. Hendler, D. Nau, and B. Parsia, "Automatic Web Services Composition Using Shop2," Proc. Int'l Conf. Automated Planning and Scheduling (ICAPS '03), 2003.
[22] D. Martin et al., "OWL-S: Semantic Markup for Web Services. W3C Submission," http://www.w3.org/SubmissionOWL-S/, 2004.
[23] S.R. Ponnekanti and A. Fox, "SWORD: A Developer Toolkit for Web Service Composition," Proc. 11th Int'l Conf. World Wide Web (WWW '02), 2002.
[24] K. Fujii and T. Suda, "Semantics-Based Dynamic Service Composition," IEEE J. Selected Areas in Comm., vol. 23, no. 12, pp. 2361-2372, Dec. 2005.
[25] B. Medjahed, A. Bouguettaya, and A.K. Elmagarmid, "Composing Web Services on the Semantic Web," VLDB J., vol. 12, no. 4, pp. 333-351, 2003.
[26] I. Altintas, C. Berkley, E. Jaeger, M. Jones, B. Ludscher, and S. Mock, "Kepler: An Extensible System for Design and Execution of Scientific Workflows," Proc. 16th Int'l Conf. Scientific and Statistical Database Management, 2004.
[27] C. Brooks, E.A. Lee, X. Liu, S. Neuendorffer, Y. Zhao, and H. Zheng, "Heterogeneous Concurrent Modeling and Design in Java (Volume 2: Ptolemy II Software Architecture)," technical report, Dept. of Electrical Eng. and Computer Science, Univ. California, Berkeley, July 2005.
[28] S. Majithia, M.S. Shields, I.J. Taylor, and I. Wang, "Triana: A Graphical Web Service Composition and Execution Toolkit," Proc. IEEE Int'l Conf. Web Services (ICWS '04), pp. 514-524, 2004.
[29] M. Litzkow, M. Livny, and M. Mutka, "Condor—A Hunter of Idle Workstations," Proc. Eighth Int'l Conf. Distributed Computing Systems, June 1988.
[30] Condor Dagman, http://www.cs.wisc.edu/condordagman, 2013.
[31] E. Deelman, G. Singh, M.-H. Su, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, K. Vahi, G.B. Berriman, J. Good, A.C. Laity, J.C. Jacob, and D.S. Katz, "Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems," Scientific Programming, vol. 13, no. 3, pp. 219-237, 2005.