2015 International Conference on Parallel Architecture and Compilation (PACT) (2015)
San Francisco, CA, USA
Oct. 18, 2015 to Oct. 21, 2015
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PACT.2015.28
The availability of commercial hardware transactionalmemory (TM) systems has not yet been met with a rise in the numberof large-scale programs that use memory transactions explicitly. Asignificant impediment to the use of TM is the lack of tool support, specifically profilers that can identify and explain performance anomalies. In this paper, we introduce an end-to-end system that enables lowoverheadperformance profiling of large-scale transactional programs. We present algorithms and an implementation for Intel's Haswellprocessors. With our system, it is possible to record a transactionalprogram's execution with minimal overhead, and then replay it withina custom profiling tool to identify causes of contention and aborts, down to the granularity of individual memory accesses. Evaluationshows that our algorithms have low overhead, and our tools enableprogrammers to effectively explain performance anomalies.
Hardware, Clocks, Libraries, Synchronization, Instruction sets, Instruments,record and replay, transactional memory, profiling
Yujie Liu, Justin Gottschlich, Gilles Pokam, Michael Spear, "TSXProf: Profiling Hardware Transactions", 2015 International Conference on Parallel Architecture and Compilation (PACT), vol. 00, no. , pp. 75-86, 2015, doi:10.1109/PACT.2015.28