
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
S. Vassiliadis, J. Phillips, B. Blaner, "Interlock Collapsing ALU's," IEEE Transactions on Computers, vol. 42, no. 7, pp. 825839, July, 1993.  
BibTex  x  
@article{ 10.1109/12.237723, author = {S. Vassiliadis and J. Phillips and B. Blaner}, title = {Interlock Collapsing ALU's}, journal ={IEEE Transactions on Computers}, volume = {42}, number = {7}, issn = {00189340}, year = {1993}, pages = {825839}, doi = {http://doi.ieeecomputersociety.org/10.1109/12.237723}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Computers TI  Interlock Collapsing ALU's IS  7 SN  00189340 SP825 EP839 EPD  825839 A1  S. Vassiliadis, A1  J. Phillips, A1  B. Blaner, PY  1993 KW  interlocked fixed point arithmetic logic unit; two's complement; unsigned binary; binary logical operations; single instruction execution; carrysave adder; carrylookahead adder; CMOS technology; machine cycle; architectural compatibility; adders; CMOS integrated circuits; digital arithmetic; parallel processing. VL  42 JA  IEEE Transactions on Computers ER   
A device capable of executing interlocked fixed point arithmetic logic unit (ALU) instructions in parallel with other instructions causing the execution interlock is presented. The device incorporates the design of a 31 ALU and can execute two's complement, unsigned binary, and binary logical operations. It is shown that status for ALU operations using a 31 ALU can be determined in a parallel fashion, resulting in the compliance of the proposed device with predetermined architectural behavior of single instruction execution. The device requires no more logic stages than does a 31 binary adder using a carrysave adder (CSA) followed by a carrylookahead adder (CLA) design. Design considerations using a commonly available CMOS technology are also reported, indicating that the device will not increase the machine cycle of an implementation. It is suggested that the device can maintain full architectural compatibility.
[1] P. M. Kogge,The Architecture of Pipelined Computers. New York: McGrawHill, 1981.
[2] R. M. Tomasulo, "An efficient algorithm for exploiting multiple arithmetic units,"IBM J. Res. Develop., pp. 2533, Jan. 1967.
[3] R. D. Acosta, J. Kjelstrup, and H. C. Torng, "An instruction issuing approach to enhancing performance in multiple functional unit processors,"IEEE Trans. Comput., vol. C35, pp. 815828, Sept. 1986.
[4] H. S. Warren, "Instruction scheduling for the IBM RISC system/6000 processor,"IBM J. Res. Develop., vol. 34, no. 1, pp. 8592, Jan. 1990.
[5] N. P. Jouppi, "The nonuniform distribution of instructionlevel and machine parallelism and its effect on performance,"IEEE Trans. Comput., vol. 38, no. 12, pp. 16451658, Dec. 1989.
[6] N.P. Jouppi and D.W. Wall, "Available InstructionLevel Parallelism for Superpipelined and Superscalar Machines,"Third Int'l Conf. Architectural Support for Programming Languages and Operating Systems, IEEE CS Press, Los Alamitos, Calif., Order No. 1936, 1989, pp. 272282.
[7] R.R. Oehler and R.D. Groves, "IBM RISC System/6000 Processor Architecture,"IBM J. Research and Development, Vol. 34, No. 1, Jan. 1990, pp. 2336.
[8] A. Padegs, B. B. Moore, R. M. Smith, and W. Buchholz, "The IBM system/370 vector architecture: Design considerations,"IEEE Trans. Comput., vol. 37, no. 5, pp. 509520, May 1988.
[9] W. A. Wulf, "The WM computer architecture,"Comput. Architecture News, vol. 16, no. 1, pp. 7084, Mar. 1988.
[10] W. A. Wulf and C. Y. Hitchcock, III, "Apparatus for reading to and writing from memory streams of data while concurrently executing a plurality of data processing operations," U. S. Patent 4 819 155, Apr. 1989.
[11] D. W. Ruck, S. K. Rogers, M. Kabrinsky, M. E. Oxley, and B. W. Sutter, "The multilayer perceptron as an approximation to a Bayes optimal discriminant function,"IEEE Trans. Neural Networks, vol. 1, no. 4, pp. 296298, Dec. 1990.
[12] S. Vassiliadis, "Compound instruction set machines," Private Communication, May 1989.
[13] S. Vassiliadis, B. Blaner, and R. Eickemeyer, "On the attributes of the SCISM organization,"Comput. Architecture News, vol. 20, no. 4, pp. 4453, Sept. 1992.
[14] ESA/370 Principles of Operation, IBM Corp., SA2272000, 1989.
[15] S. Vassiliadis, J. Phillips, and B. Blaner, "ICU design considerations," IBM, Endicott, NY, Tech. Rep. TR01.C114, p. 22, Oct. 1991.
[16] S. Vassiliadis and J. Phillips, "Interlock collapsing SCISM ALU design," IBM, Endicott, NY, Tech. Rep. TR01.C115, p. 31, Oct. 1991.
[17] S. Vassiliadis, "Recursive equations for hardwired binary adders,"Int. J. Electron., vol. 67, no. 2, pp. 201213, Aug. 1989.
[18] M. J. Flynn and S. Waser,Introduction to Arithmetic for Digital Systems Designers. CBS College Publishing, 1982, pp. 215222.
[19] S. Vassiliadis, "A comparison between adders with new defined carries and traditional schemes for addition,"Int. J. Electron., vol. 64, no. 4, pp. 617626, Apr. 1988.
[20] H. Ling, "High speed binary adder,"IBM J. Res. Develop., vol. 25, no. 3, pp. 156166, May 1981.
[21] N. T. Quach and M. J. Flynn, "Highspeed addition in CMOS," Standford Univ., Tech. Rep. CSLTR90415, p. 13, Feb. 1990.
[22] B. Olsson, R. Montoye, P. Markstein, and M. Nguyen Phu, "IBM RISC system/6000 floatingpoint unit," IBM Corp., Pub. Order SA232619, pp. 3442, 1990.
[23] L. Dadda, "Some schemes for parallel multipliers,"Alta Frequenza, vol. 34, pp. 349356, May 1965.
[24] S. Vassiliadis, E. Schwarz, and D. J. Hanrahan, "A general proof for overlapped multiplebit scanning multiplications,"IEEE Trans. Comput., vol. 33, no. 2, pp. 172183, Feb. 1989.
[25] S. Vassiliadis, E. Schwarz, and B. M. Sung, "Hardwired multipliers with encoded partial products,"IEEE Trans. Comput., vol. 40, no. 11, pp. 11811197, Nov. 1991.
[26] S. Vassiliadis, D. S. Lemon, and M. Putrino, "S/370 signmagnitude floatingpoint adder,"IEEE J. SolidState Circuits, vol. 24, no. 4, pp. 10621070, Aug. 1989.
[27] N. Malik, R. Eickemeyer, and S. Vassiliadis, "Instructionlevel parallelism for execution interlock collapsing,"Comput. Architecture News, vol. 20, no. 4, pp. 3843, Sept. 1992.
[28] N. Malik, R. J. Eickenmeyer, and S. Vassiliadis, "Interlock collapsing ALU for increased instructionlevel parallelism," inConf Proc. MICRO 25, (Portland, OR), Dec. 1992, pp. 149157.