Partitioning Techniques for Partially Protected Caches to Reduce Soft Error Induced Failures

pdf ppt

Kyoungwoo Lee, Aviral Shrivastava, Nikil Dutt, and Nalini Venkatasubramanian

DIPES 2008: IFIP Conference on Distributed and Parallel Embedded Systems

Abstract: Exponentially increasing with technology scaling, soft errors have become a serious design concern in the deep sub-micron embedded systems. Partially Protected Cache (PPC) is a promising microarchitectural feature to mitigate failures due to soft errors in embedded processors. A processor with PPC maintains two caches, one protected and the other unprotected, both at the same level of memory hierarchy. The intuition behind PPC is that some data in the application is more prone to soft errors than others. By finding out the data more prone to soft errors and mapping only that to the protected cache, the failure rate can be significantly improved at minimal power and performance penalty. While the effectiveness of PPCs has been demonstrated on multimedia applications, where the multimedia data is inherently resilient to soft errors, no such obvious data partitioning exists for applications in general. This severely restricts the applicability of PPCs. This paper proposes profile-based data partitioning schemes that are applicable to applications in general andeffectively reduce failures due to soft errors at minimal power and performance overheads. Our experimental results on HP iPAQ-like processor and memory configuration demonstrate that our algorithm efficiently reduces the failure rate by 47X on benchmarks from MiBench while incurring only 0.5% performance and 15% power overheads.


Compiler and Microarchitecture Lab,
Department of Computer Science and Engineering,
School of Computing and Informatics,
Arizona State University, Tempe, AZ 85281.
Center For Embedded Computer Systems,
Department of Information and Computer Science,
University of California, Irvine.