Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

Zhezhi He; Shaahin Angizi; Deliang Fan

doi:10.1109/isvlsi.2018.00103

ScienceGate Book Chapters

JOURNAL ARTICLE

Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

Zhezhi He Shaahin Angizi Deliang Fan

Year: 2018 Vol: 2 Pages: 533-538

DOI: 10.1109/isvlsi.2018.00103

Get Full-Text PDF Get Analytical Report

Abstract

Deep Convolution Neural Network (CNN) has achieved outstanding performance in image recognition over large scale dataset. However, pursuit of higher inference accuracy leads to CNN architecture with deeper layers and denser connections, which inevitably makes its hardware implementation demand more and more memory and computational resources. It can be interpreted as `CNN power and memory wall'. Recent research efforts have significantly reduced both model size and computational complexity by using low bit-width weights, activations and gradients, while keeping reasonably good accuracy. In this work, we present different emerging nonvolatile Magnetic Random Access Memory (MRAM) designs that could be leveraged to implement `bit-wise in-memory convolution engine', which could simultaneously store network parameters and compute low bit-width convolution. Such new computing model leverages the `in-memory computing' concept to accelerate CNN inference and reduce convolution energy consumption due to intrinsic logic-in-memory design and reduction of data communication.

Keywords:

Computer science Convolutional neural network Convolution (computer science) Inference Magnetoresistive random-access memory Reduction (mathematics) Memory management Computer engineering Parallel computing Artificial neural network Computer hardware Algorithm Artificial intelligence Semiconductor memory Random access memory Mathematics

Metrics

Cited By

0.98

FWCI (Field Weighted Citation Impact)

Refs

0.78

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Memory and Neural Computing

Physical Sciences → Engineering → Electrical and Electronic Engineering

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Ferroelectric and Negative Capacitance Devices

Physical Sciences → Engineering → Electrical and Electronic Engineering

Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

Abstract

Metrics

Citation History

Topics

Related Documents

Bit-width Adaptive Accelerator Design for Convolution Neural Network

Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Network

Accelerating low bit-width convolutional neural networks with embedded FPGA

IMCE: Energy-efficient bit-wise in-memory convolution engine for deep neural network

Application of deep convolution neural network