The Community for Technology Leaders
Green Image
Issue No. 02 - Feb. (2018 vol. 67)
ISSN: 0018-9340
pp: 178-192
Xiaohang Wang , School of Software Engineering, South China University of Technology, Guangzhou, China
Amit Kumar Singh , School of Electronics and Computer Science, University of Southampton, Southampton, United Kingdom
Bing Li , School of Software Engineering, South China University of Technology, Guangzhou, China
Yang Yang , School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
Hong Li , School of Software Engineering, South China University of Technology, Guangzhou, China
Terrence Mak , School of Electronics and Computer Science, University of Southampton, Southampton, United Kingdom
ABSTRACT
All the cores of a many-core chip cannot be active at the same time, due to reasons like low CPU utilization in server systems and limited power budget in dark silicon era. These free cores (referred to as bubbles) can be placed near active cores for heat dissipation so that the active cores can run at a higher frequency level, boosting the performance of applications that run on active cores. Budgeting inactive cores (bubbles) to applications to boost performance has the following three challenges. First, the number of bubbles varies due to open workloads. Second, communication distance increases when a bubble is inserted between two communicating tasks (a task is a thread or process of a parallel application), leading to performance degradation. Third, budgeting too many bubbles as coolers to running applications leads to insufficient cores for future applications. In order to address these challenges, in this paper, a bubble budgeting scheme is proposed to budget free cores to each application so as to optimize the throughput of the whole system. Throughput of the system depends on the execution time of each application and the waiting time incurred for newly arrived applications. Essentially, the proposed algorithm determines the number and locations of bubbles to optimize the performance and waiting time of each application, followed by tasks of each application being mapped to a core region. A Rollout algorithm is used to budget power to the cores as the last step. Experiments show that our approach achieves 50 percent higher throughput when compared to state-of-the-art thermal-aware runtime task mapping approaches. The runtime overhead of the proposed algorithm is in the order of 1M cycles, making it an efficient runtime task management method for large-scale many-core systems.
INDEX TERMS
Throughput, Heating systems, Resource management, Silicon, Servers, Electronic mail, Runtime
CITATION

X. Wang, A. K. Singh, B. Li, Y. Yang, H. Li and T. Mak, "Bubble Budgeting: Throughput Optimization for Dynamic Workloads by Exploiting Dark Cores in Many Core Systems," in IEEE Transactions on Computers, vol. 67, no. 2, pp. 178-192, 2018.
doi:10.1109/TC.2017.2735967
319 ms
(Ver 3.3 (11022016))