A Computing Efficient Hardware Architecture for Sparse Deep Neural Network Computing

Yanwen Zhang; Peng Ouyang; Shouyi Yin; Youguang Zhang; Weisheng Zhao; Shaojun Wei

doi:10.1109/icsict.2018.8565755

ScienceGate Book Chapters

JOURNAL ARTICLE

A Computing Efficient Hardware Architecture for Sparse Deep Neural Network Computing

Yanwen Zhang Peng Ouyang Shouyi Yin Youguang Zhang Weisheng Zhao Shaojun Wei

Year: 2018 Pages: 1-3

DOI: 10.1109/icsict.2018.8565755

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional Neural Networks (CNNs) have demonstrated significant performance in AI (artificial intelligence) systems. However, CNNs often have tens or even hundreds of neural layers with millions of parameters to achieve state-of-the-art performance, which hinders the deployment to some resource limited scenarios. Meanwhile, those parameters and data usually are sparse, which results in useless calculation as well as unbalanced calculation. To solve these problem, we propose a computing efficient hardware architecture. In order to decrease calculating redundancy, we filter zero-valued weights and zero-valued feature maps. To reduce redundant memory consumption, we propose a memory division and a data reuse mechanism. To resolve load imbalance, we implement a near-zero-cost scheduling switching strategy. Experimental results show that our architecture saves, on average, 22.6% memory times and 60.5% computing time over the state-of-the-art NN accelerator.

Keywords:

Computer science Redundancy (engineering) Convolutional neural network Artificial neural network Architecture Computer engineering Reuse Scheduling (production processes) Computer architecture Distributed computing Parallel computing Embedded system Computer hardware Artificial intelligence Operating system

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.15

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Brain Tumor Detection and Classification

Life Sciences → Neuroscience → Neurology

CCD and CMOS Imaging Sensors

Physical Sciences → Engineering → Electrical and Electronic Engineering

A Computing Efficient Hardware Architecture for Sparse Deep Neural Network Computing

Abstract

Metrics

Citation History

Topics

Related Documents

Highly Efficient Sparse Neural Network Computing

Hardware Architecture of Stochastic Computing Neural Network

High Performance and Hardware Efficient Stochastic Computing Elements for Deep Neural Network

Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

A hardware-efficient computing engine for FPGA-based deep convolutional neural network accelerator