The Community for Technology Leaders
2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (2016)
Haifa, Israel
Sept. 11, 2016 to Sept. 15, 2016
ISBN: 978-1-5090-5308-7
pp: 373-386
Sankaralingam Panneerselvam , University of Wisconsin-Madison, United States
Michael Swift , University of Wisconsin-Madison, United States
ABSTRACT
Current processors provide a variety of different processing units to improve performance and power efficiency. For example, ARM's big.LITTLE, AMD's APUs, and Oracle's M7 provide heterogeneous processors, on-die GPUs, and on-die accelerators. However, the performance experienced by programs using these processing units can vary widely due to contention from multiprogramming, thermal constraints and other issues. In these systems, the decision of where to execute a task must consider not only execution time of the task, but also current system conditions. We built Rinnegan, a Linux kernel extension and runtime library, to perform scheduling and handle task placement in heterogeneous systems. The Rinnegan kernel extension monitors and reports the utilization of all processing units to applications, which then makes placement decisions at user level. The Rinnegan runtime provides a performance model to predict the speedup and overhead of offloading a task. With this model and the current utilization of processing units, the runtime can select the task placement that best achieves an application's performance goals, such as low latency, high throughput, or real-time deadlines. When integrated with StarPU, a runtime system for heterogeneous architectures, Rinnegan improves StarPU by performing 1.5– 2× better than its native scheduling policies in a shared heterogeneous environment.
INDEX TERMS
Runtime, Kernel, Program processors, Monitoring, Resource management, Processor scheduling, Central Processing Unit
CITATION
Sankaralingam Panneerselvam, Michael Swift, "Rinnegan: Efficient resource use in heterogeneous architectures", 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT), vol. 00, no. , pp. 373-386, 2016, doi:10.1145/2967938.2967964
94 ms
(Ver 3.3 (11022016))