
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
R.K. Sitaraman, N.K. Jha, "Optimal Design of Checks for Error Detection and Location in FaultTolerant Multiprocessor Systems," IEEE Transactions on Computers, vol. 42, no. 7, pp. 780793, July, 1993.  
BibTex  x  
@article{ 10.1109/12.237719, author = {R.K. Sitaraman and N.K. Jha}, title = {Optimal Design of Checks for Error Detection and Location in FaultTolerant Multiprocessor Systems}, journal ={IEEE Transactions on Computers}, volume = {42}, number = {7}, issn = {00189340}, year = {1993}, pages = {780793}, doi = {http://doi.ieeecomputersociety.org/10.1109/12.237719}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Computers TI  Optimal Design of Checks for Error Detection and Location in FaultTolerant Multiprocessor Systems IS  7 SN  00189340 SP780 EP793 EPD  780793 A1  R.K. Sitaraman, A1  N.K. Jha, PY  1993 KW  error location; optimal design of checks; error detection; faulttolerant multiprocessor systems; RANDGEN; arbitrary datacheck; algorithmbased fault tolerance; majority diagnosability; UNIFGEN; uniform checks; error detection; fault tolerant computing; multiprocessing systems. VL  42 JA  IEEE Transactions on Computers ER   
RANDGEN, a simple and efficient generalpurpose algorithm for generating arbitrary datacheck (DC) graphs with a small number of checks, which satisfy a variety of properties that have been found to be useful in algorithmbased fault tolerance (ABFT) designs, is proposed. The concept of majority diagnosability is introduced in an attempt to explicitly redesign DC graphs for easy diagnosis. UNIFGEN, a variation of RANDGEN that produces DC graphs with uniform checks is examined.
[1] J. A. Abrahamet al., "Fault tolerance techniques for systolic arrays,"IEEE Comput. Mag., vol. 20, pp. 6574, July 1987.
[2] B. Bollobas,Random Graphs. New York: Academic Press, 1985.
[3] P. Banerjeeet al., "An evaluation of systemlevel fault tolerance on the intel hypercube multiprocessor," inProc. 18th Int. Symp. FaultTolerant Comput., 1988, pp. 362367.
[4] P. Banerjee and J. A. Abraham, "Bounds on algorithmbased fault tolerance in multiple processor systems,"IEEE Trans. Comput., vol. C35, pp. 296306, Apr. 1986.
[5] P. Banerjee and J. A. Abraham, "A probabilistic model of algorithmbased fault tolerance in array processors for realtime systems," inProc. RealTime Systems Symp., 1986, pp. 7278.
[6] H. Chernoff, "A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations,"Annals of Math. Stat., vol. 23, pp. 493509, 1952.
[7] CY. Chen and J. A. Abraham, "Faulttolerant systems for the computation of eigenvalues and singular values," inProc. SPIE Adv. Alg. Arch. Signal Proc., vol. 696, pp. 228237, Aug. 1986.
[8] YH. Choi and M. Malek, "A fault tolerant FFT processor,"IEEE Trans. Comput., vol. C37, pp. 617621, May 1988.
[9] YH. Choi and M. Malek, "A fault tolerant systolic sorter,"IEEE Trans. Comput., vol. C37, pp. 621624, May 1988.
[10] P. Erdos and J. Spencer,The Probabilistic Method in Combinatorics. New York: Academic Press, 1974.
[11] W. Feller,An Introduction to Probability Theory and its Applications, vol. I. New York: John Wiley, 1968.
[12] D. Gu, J. Rosenkrantz, and S. S. Ravi, "Design and analysis of test schemes for algorithmbased fault tolerance," inProc. 20th Int. Symp. FaultTolerant Comput., Newcastle, England, June 2628, 1990, pp. 106113.
[13] KH. Huang and J. A. Abraham, "Algorithmbased fault tolerance for matrix operations,"IEEE Trans. Comput., vol. C33, pp. 518528, June 1984.
[14] JY. Jou and J. A. Abraham, "Fault tolerant matrix arithmetic and signal processing on highly concurrent computing structures,"Proc. IEEE, vol. 74, pp. 732741, May 1986.
[15] JY. Jou and J. A. Abraham, "Fault tolerant FFT networks,"IEEE Trans. Comput., vol. C37, pp. 548561, May 1988.
[16] F. T. Luk, "Algorithmbased fault tolerance for parallel matrix equations solvers," inProc. SPIE Real Time Signal Proc., vol. 564, Aug. 1985, pp. 4953.
[17] F. T. Luk and H. Park, "An analysis of algorithmbased fault tolerance techniques," inProc. SPIE Adv. Alg. Arch. Signal Proc., vol. 696, pp. 222228, Aug. 1986.
[18] F. T. Luk and H. Park, "Fault tolerant matrix triangularizations on systolic arrays,"IEEE Trans. Comput., vol. C37, pp. 14341438, Nov. 1988.
[19] V. S. S. Nair and J. A. Abraham, "A model for the analysis of fault tolerant signal processing architectures," inProc. 32nd Int. Tech. Symp. SPIE, San Diego, Aug. 1988, pp. 246257.
[20] V. S. S. Nair and J. A. Abraham, "A model for the analysis, design and comparison of faulttolerant WSI architectures," inProc. Workshop on Wafer Scale Integration, Como, Italy, June 1989.
[21] V. S. S. Nair and J. A. Abraham, "Hierarchical design and analysis of faulttolerant multiprocessor systems using concurrent error detection," inProc. 20th Int. Symp. FaultTolerant Comput., (FTCS20), Newcastle upon Tyne, June 1990, pp. 130137.
[22] P. Raghavan, Lecture notes on randomized algorithms, IBM Tech. Rep. RC15340, T. J. Watson Research Center, Yorktown Heights, NY, Jan. 1990, pp. 5155.
[23] A. L. N. Reddy and P. Banerjee, "Algorithmbased fault detection for signal processing applications,"IEEE Trans. Comput., vol. C39, pp. 13041308, Oct. 1990.
[24] D. J. Rosenkrantz and S. S. Ravi, "Improved upper bounds for algorithmbased fault tolerance," inProc. 26th Allerton Conf. Comm. Cont. Comput., Allerton, IL, Sept. 1988, pp. 388397.
[25] D. L. Tao, C. R. P. Hartmann, and Y. S. Chen, "A novel concurrent error detection scheme for FFT networks," inProc. Int. Symp. Fault Tolerant Comput., NewcastleuponTyne, U.K., June 1990, pp. 114121.
[26] B. Vinnakota and N. K. Jha, "Diagnosability and diagnosis of algorithmbased fault tolerant systems," accepted for publication inIEEE Trans. Comput.
[27] B. Vinnakota and N. K. Jha, "A dependence graphbased approach to the design of algorithmbased fault tolerant systems," inProc. Int. Symp. Fault Tolerant Comput., NewcastleuponTyne, U.K., June 1990, pp. 122129.
[28] B. Vinnakota and N. K. Jha, "Design of multiprocessor systems for concurrent error detection and fault diagnosis," inProc. Int. Symp. Fault Tolerant Comput., Montreal, June 1991.
[29] J. D. Russel and C. R. Kime, "System fault diagnosis: Closure and diagnosability with repair,"IEEE Trans. Comput., vol. C24, pp. 10781089, Nov. 1975.
[30] SJ. Wang and N. K. Jha, "Algorithmbased fault tolerance for FFT networks," inProc. Int. Symp. Circuits Systems, San Diego, May 1992.