Issue No. 04 - Oct.-Dec. (2013 vol. 6)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TSC.2012.19
David Chiu , Washington State University, Vancouver
Gagan Agrawal , Ohio State University, Columbus
Large-scale scientific data analysis projects have catalyzed service-based workflow management systems. We present an approach for integrating user preferences on completion time and workflow accuracy in a workflow composition system. The relationship between workflow execution time and the accuracy of results is exploited by our workflow system. Specifically, our system is equipped with a way for users to define cost models on service completion time and error propagation (prevalent in many scientific and data analysis applications). Together with these models and an ontology for describing web service and data dependences, our system plans service-based workflows to answer high-level queries. Our system was evaluated under a real service-based environment against user constraints on time, accuracy, and network bandwidth variations. In the worst case in our experiments, we observed an average deviation of 14.3 percent below the desired time constraints, which suggests that our system is time-conservative. Within varying network bandwidth environments, we can also meet time constraints through sampling, and only a 12.4 percent deviation below time expectations are observed on average. We further show that, though negotiating with services' error models, our system is capable of planning data reduction measures (e.g., sampling) directly within workflow plans to achieve the desired accuracy.
Ontologies, Accuracy, Web services, Databases, Time factors, Mathematical model, Registers
David Chiu, Gagan Agrawal, "Cost and Accuracy Aware Scientific Workflow Composition for Service-Oriented Environments", IEEE Transactions on Services Computing, vol. 6, no. , pp. 470-483, Oct.-Dec. 2013, doi:10.1109/TSC.2012.19