Issue No. 07 - July (1993 vol. 4)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.238299
<p>Achieving high performance for parallel or distributed programs often requires substantial amounts of information about the programs themselves, about the systems on which they are executing, and about specific program runs. The monitoring system that collects, analyzes, and makes application-dependent monitoring information available to the programmer and to the executing program is presented. The system may be used foroff-line program analysis, for on-line debugging, and for making on-line, dynamic changes to parallel or distributed programs to enhance their performance. The authors use a high-level, uniform data model for the representation of program information andmonitoring data. They show how this model may be used for the specification of programviews and attributes for monitoring, and demonstrate how such specifications can betranslated into efficient, program-specific monitoring code that uses alternativemechanisms for the distributed analysis and collection to be performed for the specifiedviews. The model's utility has been demonstrated on a wide variety of parallel machines.</p>
Index Termsdistributed systems; application dependent dynamic monitoring; parallel systems;distributed programs; off-line program analysis; on-line debugging; uniform data model;specification; attributes; program-specific monitoring code; distributed analysis; parallelmachines; distributed processing; parallel programming; program debugging; programtesting
R. Snodgrass, K. Schwan and D. Ogle, "Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems," in IEEE Transactions on Parallel & Distributed Systems, vol. 4, no. , pp. 762-778, 1993.