loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Overcoming Early-Life Failure and Aging for Robust Systems
November/December 2009 (vol. 26 no. 6)
pp. 28-39
Yanjing Li, Stanford University
Young Moon Kim, Stanford University
Evelyn Mintarno, Stanford University
Subhasish Mitra, Stanford University

Editor's note:

The prospect of system failure has increased because of device- and chip-level effects in the late CMOS era, such as early-life failure and NBTI. In this article, the authors present novel system-level architecture and design innovations to cope with these lifetime reliability challenges.

—Pradip Bose, IBM Research

1. B. Schroeder and G. Gibson, "Understanding Failures in Petascale Computers," J. Physics: Conference Series (SciDAC 07), IOP Publishing, 2007, vol. 78, article 12022.
2. S. Shazli et al., "A Field Analysis of System-Level Effects of Soft Errors Occurring in Microprocessors Used in Information Systems," Proc. Int'l Test Conf. (ITC 08), IEEE CS Press, 2008, pp. 1-10.
3. S. Borkar, "Designing Reliable Systems from Unreliable Components," IEEE Micro, vol. 25, no. 6, 2005, pp. 10-16.
4. S. Mitra et al., "Robust System Design with Built-In Soft Error Resilience," Computer, vol. 38, no. 2, 2005, pp. 43-52.
5. J.M. Carulli Jr. and T.J. Anderson, "The Impact of Multiple Failure Modes on Estimating Product Field Reliability," IEEE Design &Test, vol. 23, no. 2, 2006, pp. 118-126.
6. R. Zheng et al., "Circuit Aging Prediction for Low Power Operation," Proc. IEEE Custom Integrated Circuits Conf. (CICC 09), IEEE Press, 2009.
7. S. Mitra and E.J. McCluskey, "Which Concurrent Error Detection Schemes to Choose?" Proc. Int'l Test Conf. (ITC 00), IEEE CS Press, 2000, pp. 985-994.
8. M. Agarwal et al., "Circuit Failure Prediction and Its Application to Transistor Aging," Proc. 25th IEEE VLSI Test Symp. (VTS 07), IEEE CS Press, 2009, pp. 277-286.
9. H. Baba and S. Mitra, "Testing for Transistor Aging," Proc. 27th IEEE VLSI Test Symp. (VTS 09), IEEE CS Press, 2009, pp. 215-220.
10. T.W. Chen et al., "Experimental Study of Gate-Oxide Early Life Failures," Proc. Int'l Reliability Physics Symp. (IRPS 09), IEEE Press, 2009, pp. 650-658.
11. Y. Li, S. Makar, and S. Mitra, "CASP: Concurrent Autonomous Chip Self-Test Using Stored Test Patterns," Proc. IEEE Design, Automation, and Test in Europe (DATE 08), IEEE CS Press, 2008, pp. 885-890.
12. H. Inoue, Y. Li, and S. Mitra, "VAST: Virtualization-Assisted Concurrent Autonomous Self-Test," Proc. Int'l Test Conf. (ITC 08), IEEE CS Press, 2008, pp. 1-10.
13. P. Meaney et al., "IBM z990 Soft Error Detection and Recovery," IEEE Trans. Device and Materials Reliability, vol. 5, no. 3, 2005, pp. 419-427.

Index Terms:
design and test, failure prediction, online self-test and diagnostics, reliability, robust-system design, self-healing
Citation:
Yanjing Li, Young Moon Kim, Evelyn Mintarno, Donald S. Gardner, Subhasish Mitra, "Overcoming Early-Life Failure and Aging for Robust Systems," IEEE Design and Test of Computers, vol. 26, no. 6, pp. 28-39, Nov./Dec. 2009, doi:10.1109/MDT.2009.152
Usage of this product signifies your acceptance of the Terms of Use.