This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems
February 1990 (vol. 16 no. 2)
pp. 197-211

The authors describe a hybrid monitor for measuring the performance and observing the behavior of distributed systems during execution. They emphasize data collection, analysis and presentation of execution data. A special hardware support, which consists of a test and measurement processor (TMP), was designed and has been implemented in the nodes of experimental multicomputer system consisting of eleven nodes. The operations of the TMP are completely transparent with a minimal, less than 0.1%, overhead to the measured system. In the experimental system, all the TMPs were connected with a central monitoring station, using an independent communication network, in order to provide a global view of the monitored system. The central monitoring station displayed the resulting information in easy-to-read charts and graphs. Experience with the TMP shows that it promotes an improved understanding of run-time behavior and performance measurements, which aids in deriving qualitative and quantitative assessments of distributed systems.

[1] B. Bates and J. C. Wileden, "High-level debugging of distributed systems: The behavioral abstraction approach,"J. Syst. Software, pp. 225-264, Mar. 1983.
[2] C. Brownet al., "Research with the butterfly multicomputer,"Comput. Sci. Comput. Eng. Res. Rev. 1984-1985, Univ. Rochester, 1985.
[3] D. Ferrari and V. Minetti, "A hybrid measurement tool for minicomputers," inExperimental Computer Performance and Evaluation, D. Ferrari and M. Spadoni, Eds. Amsterdam, The Netherlands: North-Holland, 1981.
[4] K. A. Frenkel, "Evaluating two massively parallel machines,"Commun. ACM, vol. 29, pp. 752-758, Aug. 1986.
[5] H. Frommet al., "Experiences with performance measurements and modeling of a processor array,"IEEE Trans. Comput., vol. C-32, no. 1, pp. 15-31, Jan. 1983.
[6] H. Garcia-Molinaet al., "Debugging a distributed system,"IEEE Trans. Software Eng., vol. SE-10, no. 2, pp. 210-219, Mar. 1984.
[7] R. Gusella and S. Zatti, "The accuracy of the clock synchronization achieved by TEMPO in Berkeley UNIX 4.3BSD,"IEEE Trans. Software Eng., vol. 15, no. 7, pp. 847-853, July 1989.
[8] D. Haban, "DTM--A distributed test methodology," inProc. 6th Symp. Reliability in Distributed Software and Database Systems, Mar. 1987, pp. 66-73.
[9] D. Haban, D. Wybranietz, and A. Barak, "Monitoring and management support of distributed systems," inProc. Workshop Progress in Distributed Operating Systems and Distributed Systems Management, Berlin, 1989; to appear asSpringer LNCS.
[10] D. Haban and K. Shin, "Application of Real-Time Monitoring to Scheduling Tasks with Random Execution Times,"Proc. Real-Time Systems Symp., IEEE Press, New York, 1989, pp. 232-241.
[11] D. Haban and W. Weigel, "Global Events and Global Breakpoints in Distributed Systems,"Proc. 21st Hawaii Int'l Conf. System Sciences, Vol. II, IEEE Computer Society Press, Order No. 842 (microfiche only), 1989, pp. 166-175.
[12] P. K. Harter, D. M. Heimbigner, and R. King, "IDD: An interactive distributed debugger," inProc. 5th Int. Conf. Distributed Computing Systems, May 1985, pp. 498-506.
[13] C. Hewitt, "Viewing control structures as patterns of passing messages,"Artificial Intell., vol. 8, pp. 323-364, 1977.
[14] S. H. Jones, R. H. Barkan, and L, D. Wittie, "Bugnet: A real time distributed debugging system," inProc. 6th Symp. Reliability in Distributed Software and Database Systems, Mar. 1987, pp. 56-65.
[15] J. Joyce, G. Lomow, K. Slind, and B. Unger, "Monitoring distributed systems,"ACM Trans. Comput. Syst., vol. 5, no. 2, pp. 121- 150, May 1987.
[16] L. Lamport, "Time, clocks, and the ordering of events in a distributed system,"Commun. ACM, vol. 21, no. 7, pp. 558-565, July 1978.
[17] J. E. Lambert and F. Halsall, "Program debugging and performance evaluation aids for a multimicroprocessor system,"Software Microsyst., vol. 3, no. 1, pp. 2-10, Feb. 1984.
[18] K. W. Kolence and P. J. Kiviat, "Software unit profiles and Kiviat figures,"ACM Sigmetrics Perform. Eval. Rev., June 1976.
[19] B. P. Miller and C.-Q. Yang, "IPS: An interactive and automatic performance measurement tool for parallel and distributed programs," inProc. 7th Int. Conf. Distributed Computing Systems, Sept. 1987, pp. 482-489.
[20] J. Nehmer et al., "Key Concepts of the Incas Multicomputer Project,"IEEE Trans. Software Eng., Aug. 1987, pp. 913-923.
[21] C. L. Seitz, "The Cosmic Cube,"Commun. ACM, pp. 22-33, Jan. 1985.
[22] K.G. Shin and P. Ramanathan, "Clock Synchronization of a Large Multiprocessor System in the Presence of Malicious Faults,"IEEE Trans. Computers, Vol. C-36, No. 1, Jan. 1987, pp. 2-12.
[23] R. Snodgrass, "A relational approach to monitoring complex systems,"ACM Trans. Comput. Syst., vol. 6, no. 2, pp. 157-196, May 1988.
[24] L. Svobodova, "Online system performance measurements with software and hybrid monitors,"Operat. Syst. Rev., vol. 7, no. 4, pp. 45-53, Oct. 1973.
[25] W. A. Wulfet al., Hydra/C.mmp: An Experimental Computer System. New York: McGraw-Hill, 1981.
[26] D. Wybranietz, "A simulation system for multicast communications with interactive facilities," inProc. 15th Simula Users Conf., St. Helier, Jersey, Channel Islands, England, Sept. 1987.
[27] D. Wybranietz and P. Buhler, "The LADY programming environment for distributed operating systems," inProc. Parallel Architectures and Languages Europe PARLE '89, Eindhoven, The Netherlands, June 1989.
[28] Dieter Wybranietz and Dieter Haban, "Monitoring and Performance Measuring Distributed Systems Under Operation,"Proc. ACM SIGMETRICS Conf Measurement and Modelling of Comp. Systems, Association of Computing Machinery, New York, 1988, pp. 197-206.

Index Terms:
behaviour analysis; qualitative assessments; performance analysis; distributed systems; hybrid monitor; data collection; execution data; hardware support; test and measurement processor; TMP; experimental multicomputer system; communication network; quantitative assessments; distributed processing; performance evaluation; program testing.
Citation:
D. Haban, D. Wybranietz, "A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems," IEEE Transactions on Software Engineering, vol. 16, no. 2, pp. 197-211, Feb. 1990, doi:10.1109/32.44382
Usage of this product signifies your acceptance of the Terms of Use.