6th Australasian Computer Systems Architecture Conference (AustCSAC'01) Implementing an efficient vector instruction set in a chip multi-processor using micro-threaded pipelines Gold Coast, Queensland, Australia January 29-January 30 ISBN: 0-7695-0954-1
This paper looks at a combination of two techniques, one of which, using a vector instruction set, has a long history dating back to pipelined vector supercomputers, such as the Cray 1 and its successors. The other technique, multi-threading, is also well understood. The novel approach proposed in this paper combines both vertical and horizontal micro-threading with vector instruction descriptors. It will be shown that a family of threads can represent a vector instruction with dependencies between the instances of that family, the iterations. This technique gives a very low overhead in implementing an n-way loop and is able to tolerate high memory latency. The use of micro-threading to handle dependencies between threads provides the ability to trade-off between instruction level parallelism and loop parallelism. The paper describes the means by which instruction classes may be instanced as independent parallel micro-threads and illustrates the speed-up that may be obtained compared to using a conventional loop.
Citation:
Chris Jesshope, "Implementing an efficient vector instruction set in a chip multi-processor using micro-threaded pipelines," austcsac, pp.80, 6th Australasian Computer Systems Architecture Conference (AustCSAC'01), 2001 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||