The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - October-December (2009 vol.6)
pp: 241-254
Alain Girault , INRIA and Grenoble University, Saint-Ismier Cedex
Hamoudi Kalla , University of Batna, Batna
ABSTRACT
We propose a new framework for the (length and reliability) bicriteria static multiprocessor scheduling problem. Our first criterion remains the schedule's length, which is crucial to assess the system's real-time property. For our second criterion, we consider the global system failure rate, seen as if the whole system were a single task scheduled onto a single processor, instead of the usual reliability, because it does not depend on the schedule length like the reliability does (due to its computation in the classical exponential distribution model). Therefore, we control better the replication factor of each individual task of the dependency task graph given as a specification, with respect to the desired failure rate. To solve this bicriteria optimization problem, we take the failure rate as a constraint, and we minimize the schedule length. We are thus able to produce, for a given dependency task graph and multiprocessor architecture, a Pareto curve of nondominated solutions, among which the user can choose the compromise that fits his or her requirements best. Compared to the other bicriteria (length and reliability) scheduling algorithms found in the literature, the algorithm we present here is the first able to improve significantly the reliability, by several orders of magnitude, making it suitable to safety-critical systems.
INDEX TERMS
Reliability, bicriteria optimization, Pareto optima, static multiprocessor scheduling, reliability block diagrams, safety-critical systems.
CITATION
Alain Girault, Hamoudi Kalla, "A Novel Bicriteria Scheduling Heuristics Providing a Guaranteed Global System Failure Rate", IEEE Transactions on Dependable and Secure Computing, vol.6, no. 4, pp. 241-254, October-December 2009, doi:10.1109/TDSC.2008.50
REFERENCES
[1] I. Assayad, A. Girault, and H. Kalla, “A Bi-Criteria Scheduling Heuristics for Distributed Embedded Systems under Reliability and Real-Time Constraints,” Proc. Int'l Conf. Dependable Systems and Networks (DSN '04), pp. 347-356, June 2004.
[2] A. Avizienis, J.-C. Laprie, B. Randell, and C. Landwehr, “Basic Concepts and Taxonomy of Dependable and Secure Computing,” IEEE Trans. Dependable and Secure Computing, vol. 1, no. 1, pp. 11-33, Jan.-Mar. 2004.
[3] H.S. Balaban, “Some Effects of Redundancy on System Reliability,” Proc. Nat'l Symp. Reliability and Quality Control, pp. 385-402, Jan. 1960.
[4] M.O. Ball, “Computational Complexity of Network Reliability Analysis: An Overview,” IEEE Trans. Reliability, vol. 35, pp.230-239, Aug. 1986.
[5] A. Colin and I. Puaut, “Worst Case Execution Time Analysis for a Processor with Branch Prediction,” Real-Time Systems, vol. 18, nos.2/3, pp. 249-274, 2000.
[6] J.-Y. Colin and P. Chretienne, “C.P.M. Scheduling with Small Computation Delays and Task Duplication,” Operations Research, vol. 39, no. 4, pp. 680-684, 1991.
[7] A. Dogan and F. Özgüner, “Matching and Scheduling Algorithms for Minimizing Execution Time and Failure Probability of Applications in Heterogeneous Computing,” IEEE Trans. Parallel and Distributed Systems, vol. 13, no. 3, pp. 308-323, Mar. 2002.
[8] A. Dogan and F. Özgüner, “Biobjective Scheduling Algorithms for Execution Time-Reliability Trade-Off in Heterogeneous Computing Systems,” Computer J., vol. 48, no. 3, pp. 300-314, 2005.
[9] J. Dongara, E. Jeannot, E. Saule, and Z. Shi, “Bi-Objective Scheduling Algorithms for Optimizing Makespan and Reliability on Heterogeneous Systems,” Proc. 19th ACM Symp. Parallelism in Algorithms and Architectures (SPAA '07), pp. 280-288, June 2007.
[10] C. Ferdinand, R. Heckmann, M. Langenbach, F. Martin, M. Schmidt, H. Theiling, S. Thesing, and R. Wilhelm, “Reliable and Precise WCET Determination for a Real-Life Processor,” Proc. Int'l Workshop Embedded Software (EMSOFT '01), vol. 2211, Oct. 2001.
[11] J. Gauthier, X. Leduc, and A. Rauzy, “Assessment of Large Automatically Generated Fault Trees by Means of Binary Decision Diagrams,” J. Risk and Reliability, vol. 221, no. 2, pp. 95-105, 2007.
[12] T. Grandpierre, C. Lavarenne, and Y. Sorel, “Optimized Rapid Prototyping for Real-Time Embedded Heterogeneous Multiprocessors,” Proc. Seventh Int'l Workshop Hardware/Software Co-Design (CODES '99), May 1999.
[13] M. Hakem and F. Butelle, “A Bi-Objective Algorithm for Scheduling Parallel Applications on Heterogeneous Systems Subject to Failures,” Rencontres Francophones du Parallélisme (RENPAR '06), Oct. 2006.
[14] C. Hirel, R. Sahner, X. Zang, and K.S. Trivedi, “Reliability and Performability Modeling Using Sharpe,” Proc. 11th Int'l Conf. Computer Performance Evaluation: Modelling Techniques and Tools (TOOLS '00), pp. 345-349, Mar. 2000.
[15] C.-C. Hsieh and Y.-C. Hsieh, “Reliability and Cost Optimization in Distributed Computing Systems,” Computers and Operations Research, vol. 30, no. 8, pp. 1103-1119, July 2003.
[16] M.A. Iverson, “Dynamic Mapping and Scheduling Algorithms for a Multi-User Heterogeneous Computing Environment,” PhD dissertation, Ohio State Univ., Columbus, 1999.
[17] P.A. Jensen and M. Bellmore, “An Algorithm to Determine the Reliability of a Complex System,” IEEE Trans. Reliability, vol. 18, pp. 169-174, Nov. 1969.
[18] S. Kartik and C.S.R. Murthy, “Improved Task Allocation Algorithms to Maximize Reliability of Redundant Distributed Computing Systems,” IEEE Trans. Reliability, vol. 44, no. 4, pp. 575-586, Dec. 1995.
[19] J.C. Knight and N.G. Leveson, “An Experimental Evaluation of the Assumption of Independence in Multi-Version Programming,” IEEE Trans. Software Eng., vol. 12, no. 1, pp. 96-109, 1986.
[20] D. Lloyd and M. Lipow, Reliability: Management, Methods, and Mathematics, chapter 9. Prentice Hall, 1962.
[21] P. Pop, K. Poulsen, and V. Izosimov, “Scheduling and Voltage Scaling for Energy/Reliability Trade-Offs in Fault-Tolerant Time-Triggered Embedded Systems,” Proc. Fifth IEEE/ACM Int'l Conf. Hardware/Software Codesign and System Synthesis (CODES-ISSS '07), Oct. 2007.
[22] X. Qin and H. Jiang, “A Novel Fault-Tolerant Scheduling Algorithm for Precedence Constrained Tasks in Real-Time Heterogeneous Systems,” Parallel Computing, vol. 32, nos. 5-6, pp. 331-356, June 2006.
[23] S.M. Shatz and J.-P. Wang, “Models and Algorithms for Reliability-Oriented Task-Allocation in Redundant Distributed-Computer Systems,” IEEE Trans. Reliability, vol. 38, no. 1, pp.16-26, Apr. 1989.
[24] G.C. Sih and E.A. Lee, “A Compile-Time Scheduling Heuristic for Interconnection Constraint Heterogeneous Processor Architectures,” IEEE Trans. Parallel and Distributed Systems, vol. 4, no. 2, pp.175-187, Feb. 1993.
[25] J. Souyris, E.L. Pavec, G. Himbert, V. Jégu, G. Borios, and R. Heckmann, “Computing the Worst Case Execution Time of an Avionics Program by Abstract Interpretation,” Proc. Fifth Int'l Workshop Worst-Case Execution Time (WCET '05), pp.21-24, July 2005.
[26] H. Theiling, C. Ferdinand, and R. Wilhelm, “Fast and Precise WCET Prediction by Separate Cache and Path Analyses,” Real-Time Systems, vol. 18, nos. 2/3, pp. 157-179, May 2000.
[27] V. T'kindt and J.-C. Billaut, Multicriteria Scheduling: Theory, Models and Algorithms. Springer, 2006.
[28] D. Zhu, R. Melhem, and D. Mossé, “The Effects of Energy Management on Reliability in Real-Time Embedded Systems,” Proc. Int'l Conf. Computer Aided Design (ICCAD '04), pp. 35-40, Nov. 2004.
24 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool