15th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP'07)
Design and Implementation of Floating Point Stack on General RISC Architecture
Naples, Italy
February 07-February 09
ISBN: 0-7695-2784-1
Xuehai Qian, Institute of Computing Technology, Chinese Academy of Sciences
He Huang, Institute of Computing Technology, Chinese Academy of Sciences
Hao Zhang, Institute of Computing Technology, Chinese Academy of Sciences
Guoping Long, Institute of Computing Technology, Chinese Academy of Sciences
Junchao Zhang, Institute of Computing Technology, Chinese Academy of Sciences
Dongrui Fan, Institute of Computing Technology, Chinese Academy of Sciences
This paper presents a framework for implementing the X86 FP stack used in an x86-compliant processor based on a general RISC architecture. Architectural supports are added to a typical RISC architecture to maintain the FP stack status. Some speculative techniques are applied to the decode stage to enable pipelined and efficient FP operations. An optimized register renaming scheme is proposed to eliminate redundant micro-ops in FP programs, resulting in an increased performance while mitigating the burden on register rename table. The simulation results show that on average more than 10% fmov micro-ops are removed. Elimination of micro-ops significantly speeds up the execution of programs. The IPC increases are as high as 30% for some programs, and near 10% on average.
Citation:
Xuehai Qian, He Huang, Hao Zhang, Guoping Long, Junchao Zhang, Dongrui Fan, "Design and Implementation of Floating Point Stack on General RISC Architecture," pdp, pp.238-245, 15th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP'07), 2007