BitBlade: Energy-Efficient Variable Bit-Precision Hardware Accelerator for Quantized Neural Networks

Sungju Ryu; Hyungjun Kim; Wooseok Yi; Eunhwan Kim; Yulhwa Kim; Taesu Kim; Jae‐Joon Kim

doi:10.1109/jssc.2022.3141050

ScienceGate Book Chapters

JOURNAL ARTICLE

BitBlade: Energy-Efficient Variable Bit-Precision Hardware Accelerator for Quantized Neural Networks

Sungju Ryu Hyungjun Kim Wooseok Yi Eunhwan Kim Yulhwa Kim Taesu Kim Jae‐Joon Kim

Year: 2022 Journal: IEEE Journal of Solid-State Circuits Vol: 57 (6)Pages: 1924-1935 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jssc.2022.3141050

Get Full-Text PDF Get Analytical Report

Abstract

We introduce an area/energy-efficient precision-scalable neural network accelerator architecture. Previous precision-scalable hardware accelerators have limitations such as the under-utilization of multipliers for low bit-width operations and the large area overhead to support various bit precisions. To mitigate the problems, we first propose a bitwise summation, which reduces the area overhead for the bit-width scaling. In addition, we present a channel-wise aligning scheme (CAS) to efficiently fetch inputs and weights from on-chip SRAM buffers and a channel-first and pixel-last tiling (CFPL) scheme to maximize the utilization of multipliers on various kernel sizes. A test chip was implemented in 28-nm CMOS technology, and the experimental results show that the throughput and energy efficiency of our chip are up to 7.7 $\times $ and 1.64 $\times $ higher than those of the state-of-the-art designs, respectively. Moreover, additional 1.5–3.4 $\times $ throughput gains can be achieved using the CFPL method compared to the CAS.

Keywords:

Scalability Computer science Overhead (engineering) Computer hardware Chip Throughput Algorithm Energy (signal processing) Hardware acceleration Arithmetic Parallel computing Mathematics Field-programmable gate array Programming language

Metrics

Cited By

5.94

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Ferroelectric and Negative Capacitance Devices

Physical Sciences → Engineering → Electrical and Electronic Engineering

Advanced Memory and Neural Computing

Physical Sciences → Engineering → Electrical and Electronic Engineering

BitBlade: Energy-Efficient Variable Bit-Precision Hardware Accelerator for Quantized Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

UNPU: An Energy-Efficient Deep Neural Network Accelerator With Fully Variable Weight Bit Precision

Bit-Slicing FPGA Accelerator for Quantized Neural Networks

Quantized deep neural networks for energy efficient hardware-based inference

Hardware for Quantized Mixed-Precision Deep Neural Networks

PNeuro: A scalable energy-efficient programmable hardware accelerator for neural networks