The Community for Technology Leaders
2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT) (2010)
Vienna, Austria
Sept. 11, 2010 to Sept. 15, 2010
ISBN: 978-1-5090-5032-1
pp: 157-168
Zheng Li , INRIA Saclay, Orsay, France
Jose Duato , Polytechnic University of Valencia, Spain
Olivier Certner , ST Microelectronics & INRIA Saclay, Orsay, France
Olivier Temam , INRIA Saclay, Orsay, France
ABSTRACT
Parallel programming approaches based on task division/-spawning are getting increasingly popular because they provide for a simple and elegant abstraction of parallelization, while achieving good performance on workloads which are traditionally complex to parallelize due to the complex control flow and data structures involved. The ability to quickly distribute fine-granularity tasks among many cores is key to the efficiency and scalability of such division-based parallel programming approaches. For this reason, several hardware supports for work stealing environments have already been proposed. However, they all rely on a central hardware structure for distributing tasks among cores, which hampers the scalability and efficiency of these schemes. In this paper, we focus on conditional division, a division-based parallel approach which provides the additional benefit, over work-stealing approaches, of releasing the user from dealing with task granularity and which does not clog hardware resources with an exceedingly large number of small tasks. For this type of division-based approaches, we show that it is possible to design hardware support for speeding up task division that entirely relies on local information, and which thus exhibits good scalability properties.
INDEX TERMS
hardware support, Multicore, conditional parallelization
CITATION
Zheng Li, Jose Duato, Olivier Certner, Olivier Temam, "Scalable hardware support for conditional parallelization", 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT), vol. 00, no. , pp. 157-168, 2010, doi:
177 ms
(Ver 3.3 (11022016))