loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
1997 Advances in Parallel and Distributed Computing Conference (APDC '97)
A Multithreaded Processor Designed for Distributed Shared Memory Systems
Shanghai, CHINA
March 19-March 21
ISBN: 0-8186-7876-3
Winfried Gruenewald, University of Karlsruhe
Theo Ungerer, University of Karlsruhe
The multithreaded processor - called Rhamma - uses a fast context switch to bridge latencies caused by memory accesses or by synchronization operations. Load/store, synchronization, and execution operations of different threads of control are executed simultaneously by appropriate functional units. A fast context switch is performed whenever a functional unit comes across an operation that is destined for another unit. The overall performance depends on the speed of the context switch. We present two techniques to reduce the context switch cost to at most one processor cycle: A context switch is explicitly coded in the opcode, and a context switch buffer is used. The load/store unit shows up as the principal bottleneck. We evaluate four implementation alternatives of the load/store unit to increase processor performance.
Index Terms:
multithreading, processor architecture, fast context switching
Citation:
Winfried Gruenewald, Theo Ungerer, "A Multithreaded Processor Designed for Distributed Shared Memory Systems," apdc, pp.206, 1997 Advances in Parallel and Distributed Computing Conference (APDC '97), 1997
Usage of this product signifies your acceptance of the Terms of Use.