This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A High-Utilization Scheduling Schemeof Stream Programs on ClusteredVLIW Stream Architectures
April 2014 (vol. 25 no. 4)
pp. 840-850
Shaojun Wei, Inst. of Microelectron., Tsinghua Univ., Beijing, China
Fang Wang, Res. Inst. of Inf. Technol., Tsinghua Univ., Beijing, China
Zhaolin Li, Res. Inst. of Inf. Technol., Tsinghua Univ., Beijing, China
Guoyue Jiang, Inst. of Microelectron., Tsinghua Univ., Beijing, China
Stream architectures have emerged as a mainstream solution for computation-intensive applications due to their rich arithmetic units. This paper proposes a multithreading technique based on a scheduling scheme of stream programs on clustered VLIW stream architecture, which aims at optimal arithmetic unit utilization without increasing energy consumption. Its principle is to exploit more kernel-level parallelism for further optimal compilation by constructing homogeneous multiple threads on stream programs. Three phases are proposed in the scheduling scheme. First, threads in stream programs are replicated for constructing homogeneous multiple threads. Second, time step assignment for homogeneous multithreaded stream programs is utilized to obtain efficient kernel combination. Third, stream segmentation is presented to optimize both memory transfers and startup overheads of kernels. A set of benchmarks are exploited to evaluate the effectiveness of the proposed technique. Experimental results show that, with exploiting kernel-level software pipeline, the proposed technique improves the performance by 20.9 percent averagely with the energy decreasing by 7.6 percent. Utilizations of adders and multipliers are up to average 77.4 and 75.8 percent, increasing 17.0 and 13.3 percent, respectively. Moreover, the proposed technique performs an average of 12.5 percent improvement over CSMT4 with the energy decreasing by 12.0 percent.
Index Terms:
Kernel,Streaming media,Instruction sets,Registers,Computer architecture,System-on-chip,VLIW,arithmetic unit utilization,Stream architecture,stream program,scheduling scheme,homogeneous multiple threads
Citation:
Shaojun Wei, Fang Wang, Zhaolin Li, Guoyue Jiang, "A High-Utilization Scheduling Schemeof Stream Programs on ClusteredVLIW Stream Architectures," IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 4, pp. 840-850, April 2014, doi:10.1109/TPDS.2013.80
Usage of this product signifies your acceptance of the Terms of Use.