This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
The Timely Computing Base Model and Architecture
August 2002 (vol. 51 no. 8)
pp. 916-930

Current systems are very often based on large-scale, unpredictable, and unreliable infrastructures. However, users of these systems increasingly require services with timeliness properties. This creates a difficult-to-solve contradiction with regard to the adequate time model: synchronous, or asynchronous? In this paper, we propose an architectural construct and programming model which address this problem. We assume the existence of a component that is capable of executing timely functions, however asynchronous the rest of the system may be. We call this component the Timely Computing Base and it can be used by the other components to execute a set of simple but crucial time-related services. We also show how to use it to build dependable and timely applications exhibiting varying degrees of timeliness assurance, under several synchrony models.

[1] Information Security, M. Abrams, S. Jajodia, and H. Podell, eds. IEEE CS Press, 1995.
[2] P. Veríssimo and A. Casimiro, “The Timely Computing Base,” DI/FCUL TR 99-2, Dept. of Computer Science, Univ. of Lisboa, Apr. 1999.
[3] P. Veríssimo, A. Casimiro, and C. Fetzer, “The Timely Computing Base: Timely Actions in the Presence of Uncertain Timeliness,” Proc. Int'l Conf. Dependable Systems and Networks, pp. 533-542, June 2000.
[4] A. Casimiro and P. Veríssimo, “Using the Timely Computing Base for Dependable QoS Adaptation,” Proc. 20th IEEE Symp. Reliable Distributed Systems, pp. 208-217, Oct. 2001.
[5] A. Casimiro and P. Veríssimo, “Generic Timing Fault Tolerance Using a Timely Computing Base,” Proc. Int'l Conf. Dependable Systems and Networks, June 2002.
[6] A. Casimiro, P. Martins, and P. Veríssimo, “How to Build a Timely Computing Base Using Real-Time Linux,” Proc. 2000 IEEE Int'l Workshop Factory Comm. Systems, pp. 127-134, Sept. 2000.
[7] T.D. Chandra and S. Toueg, “Unreliable Failure Detectors for Reliable Distributed Systems,” J. ACM, vol. 43, no. 2, pp. 225–267, 1996.
[8] F. Cristian and C. Fetzer, “The Timed Asynchronous System Model,” Proc. 28th Ann. Int'l Symp. Fault-Tolerant Computing, pp. 140-149, June 1998.
[9] P. Veríssimo and C. Almeida, “Quasi-Synchronism: A Step Away from the Traditional Fault-Tolerant Real-Time System Models,” Bull. Technical Committee on Operating Systems and Application Environments (TCOS), vol. 7, no. 4, pp. 35-39, Winter 1995.
[10] D. Dolev, C. Dwork, and L. Stockmeyer, “On the Minimal Synchronism Needed for Distributed Consensus,” Proc. 24th Ann. Symp. Foundations of Computer Science, Nov. 1983.
[11] C. Dwork, N. Lynch, and L. Stockmeyer, “Consensus in the Presence of Partial Synchrony,” J. ACM. vol. 35, no. 2, pp. 288–323, Apr. 1988.
[12] S. Mishra, C. Fetzer, and F. Cristian, “The Timewheel Group Communication System,” IEEE Trans. Computers, vol. 51, no. 8, pp. xxx-xxx, Aug. 2002.
[13] J.-F. Hermant and G.L. Lann, “Asynchronous Uniform Consensus in Real-Time Distributed Systems,” IEEE Trans. Computers, vol. 51, no. 8, pp. , Aug. 2002.
[14] Z. Manna and A. Pnueli, The Temporal Logic of Reactive and Concurrent Systems. Springer-Verlag, 1991.
[15] R. Koymans, “Specifying Real-Time Properties with Metric Temporal Logic,” J. Real-Time-Systems, vol. 2, no. 4, pp. 255-299, Nov. 1990.
[16] A. Burns and A. Wellings, Real-Time Systems and Programming Languages, third ed. Addison-Wesley, 2001.
[17] A. Burns, “A Framework for Building Real-Time Responsive Systems,” Proc. First Int'l Workshop Responsive Computer Systems, pp. 6-9, Oct. 1991.
[18] F. Jahanian, “Fault Tolerance in Embedded Real-Time Systems,” Lecture Notes in Computer Science, vol. 774, pp. 237-249, 1994.
[19] E.D. Jensen and J.D. Northcutt, Alpha: A Non-Proprietary Operating System for Large, Complex, Distributed Real-Time Systems Proc. IEEE Workshop Experimental Distributed Systems, pp. 35-41, 1990.
[20] H. Kopetz, R. Zainlinger, G. Fohler, H. Kantz, P. Puschner, and W. Schutz, “An Engineering Approach towards Hard Real-Time System Design,” Lecture Notes in Computer Science, vol. 550, pp. 166-188, 1991.
[21] M. de Prycker, Asynchronous Transfer Mode: Solution for Broadband ISDN, third ed. Prentice Hall, 1995.
[22] R. Brand, “Iso-Ethernet: Bridging the Gap from WAN to LAN,” Data Comm., July 1995.
[23] L. Zhang, S. Berson, S. Herzog, and S. Jamin, “RFC 2205: Resource ReSerVation Protocol (RSVP)—version 1 Functional Specification,” R. Braden, ed., Sept. 1997.
[24] H. Schulzrinne, S. Casner, R. Frederick, and V. Jacobson, “RTP: A Transport Protocol for Real-Time Applications,” Technical Report RFC 1889, Audio-Video Transport Working Group, Jan. 1996.
[25] P. Veríssimo, “Ordering and Timeliness Requirements of Dependable Real-Time Programs,” J. Real-Time Systems, vol. 7, no. 2, pp. 105-128, Sept. 1994.
[26] W. Chen, S. Toueg, and M. Aguilera, “On the Quality of Service of Failure Detectors,” Proc. Int'l Conf. Dependable Systems and Networks, pp. 191-200, June 2000.
[27] T.D. Chandra, V. Hadzillacos, S. Toueg, and B. Charron-Bost, “On the Impossibility of Group Membership,” Proc. 15th ACM Symp. Principles of Distributed Computing, pp. 322–330, 1996.
[28] E. Anceaume, B. Charron-Bost, P. Minet, and S. Toueg, “On the Formal Specification of Group Membership Services,” Technical Report RR-2695, INRIA, Rocquencourt, France, Nov. 1995.
[29] A. Gopal and S. Toueg, “Inconsistency and Contamination (Preliminary Version),” Proc. 10th Ann. ACM Symp. Principles of Distributed Computing, pp. 257-272, Aug. 1991.
[30] D. Powell, Failure Mode Assumptions and Assumption Coverage Proc. 22nd Int'l Symp. Fault-Tolerant Computing (FTCS-22), pp. 386-395, 1992.
[31] Bibliography on Time Series and Stochastic Processes, H. Wold, ed. London: Oliver and Boyd, 1965.
[32] M. Lubaszewski and B. Courtois, “A Reliable Fail-Safe System,” IEEE Trans. Computers, vol. 47, no. 2, pp. 236-241, Feb. 1998.
[33] C. Fetzer and F. Cristian, “Fail-Awareness in Timed Asynchronous Systems,” Proc. 15th ACM Symp. Principles of Distributed Computing, pp. 314–321a, Philadelphia, May 1996. Also available as
[34] C. Fetzer and F. Cristian, “Fail-Awareness: An Approach to Construct Fail-Safe Applications,” Proc. 27th Ann. Int'l Fault-Tolerant Computing Symp., pp. 282-291, June 1997.
[35] T. Abdelzaher and K. Shin, "End-Host Architecture for QoS-Adaptive Communication," Proc. IEEE Real-Time Technology and Applications Symp., IEEE Press, Piscataway, N.J., June 1998, pp. 121-130.
[36] C. Almeida and P. Veríssimo, “Timing Failure Detection and Real-Time Group Communication in Quasi-Synchronous Systems,” Proc. Eighth Euromicro Workshop Real-Time Systems, June 1996.
[37] M. Cukier et al., AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects Proc. IEEE Symp. Reliable Distributed Systems, pp. 245-253, Oct. 1998.
[38] B. Li and K. Nahrstedt, "A Control-Based Middleware Framework for Quality of Service Adaptations," IEEE J. Selected Areas in Comm., vol. 17, no. 9, Sept. 1999, pp. 1632-1650.
[39] I. Foster, V. Sander, and A. Roy, “A Quality of Service Architecture that Combines Resource Reservation and Application Adaptation,” Proc. Eighth Int'l Workshop Quality of Service, pp. 181-188, June 2000.
[40] P. Veríssimo, P. Barrett, P. Bond, A. Hilborne, L. Rodrigues, and D. Seaton, “The Extra Performance Architecture (XPA),” Delta-4—A Generic Architecture for Dependable Distributed Computing, D. Powell, ed., ESPRIT Research Reports, pp. 211-266, Springer Verlag, Nov. 1991.
[41] H. Zou and F. Jahanian, “Real-Time Primary-Backup (RTPB) Replication with Temporal Consistency Guarantees,” Technical Report CSE-TR-356-98, Dept. of Electrical Eng. and Computer Science, Univ. of Michigan, Feb. 1998.
[42] C. Almeida and P. Veríssimo, “Using Light-Weight Groups to Handle Timing Failures in Quasi-Synchronous Systems,” Proc. 19th IEEE Real-Time Systems Symp., Dec. 1998.
[43] A. Casimiro, F. Cristian, C. Fetzer, and P. Veríssimo, “Private Communications,” Sept. 1998.
[44] W. Feller, An Introduction to Probability Theory and Its Applications, second ed. John Wiley&Sons, 1971.

Index Terms:
Distributed systems, real-time systems, timely computing base, partial synchrony models.
Citation:
Paulo Veríssimo, António Casimiro, "The Timely Computing Base Model and Architecture," IEEE Transactions on Computers, vol. 51, no. 8, pp. 916-930, Aug. 2002, doi:10.1109/TC.2002.1024739
Usage of this product signifies your acceptance of the Terms of Use.