JOURNAL ARTICLE

Exploring Soft-Error Robust and Energy-Efficient Register File in GPGPUs using Resistive Memory

Jingweijia TanZhi LiMingsong ChenXin Fu

Year: 2016 Journal:   ACM Transactions on Design Automation of Electronic Systems Vol: 21 (2)Pages: 1-25   Publisher: Association for Computing Machinery

Abstract

The increasing adoption of graphics processing units (GPUs) for high-performance computing raises the reliability challenge, which is generally ignored in traditional GPUs. GPUs usually support thousands of parallel threads and require a sizable register file. Such large register file is highly susceptible to soft errors and power-hungry. Although ECC has been adopted to register file in modern GPUs, it causes considerable power overhead, which further increases the power stress. Thus, an energy-efficient soft-error protection mechanism is more desirable. Besides its extremely low leakage power consumption, resistive memory (e.g., spin-transfer torque RAM) is also immune to the radiation induced soft errors due to its magnetic field based storage. In this article, we propose to LEverage reSistive memory to enhance the Soft-error robustness and reduce the power consumption (LESS) of registers in the General-Purpose computing on GPUs (GPGPUs). Since resistive memory experiences longer write latency compared to SRAM, we explore the unique characteristics of GPGPU applications to obtain the win-win gains: achieving the near-full soft-error protection for the register file, and meanwhile substantially reducing the energy consumption with negligible performance degradation. Our experimental results show that LESS is able to mitigate the registers soft-error vulnerability by 86% and achieve 61% energy savings with negligible (e.g., 1%) performance degradation.

Keywords:
Register file Computer science Soft error Efficient energy use Energy consumption Parallel computing General-purpose computing on graphics processing units Graphics Instruction set Operating system Electronic engineering

Metrics

7
Cited By
0.58
FWCI (Field Weighted Citation Impact)
42
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Advanced Memory and Neural Computing
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Radiation Effects in Electronics
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

JOURNAL ARTICLE

Soft-Error Reliability and Power Co-Optimization for GPGPUs Register File using Resistive Memory

Jingweijia TanZhi LiXin Fu

Journal:   Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 Year: 2015 Pages: 369-374
JOURNAL ARTICLE

Soft-error reliability and power co-optimization for GPGPUS register file using resistive memory

Jingweijia TanZhi LiXin Fu

Journal:   Design, Automation, and Test in Europe Year: 2015 Pages: 369-374
JOURNAL ARTICLE

Emerging technology enabled energy-efficient GPGPUs register file

Chenhao XieJingweijia TanMingsong ChenYi YangLu PengXin Fu

Journal:   Microprocessors and Microsystems Year: 2017 Vol: 50 Pages: 175-188
© 2026 ScienceGate Book Chapters — All rights reserved.