A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip

Deepak Kadetotad; Visar Berisha; Chaitali Chakrabarti; Jae-sun Seo

doi:10.1109/lssc.2019.2936761

ScienceGate Book Chapters

JOURNAL ARTICLE

A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip

Deepak Kadetotad Visar Berisha Chaitali Chakrabarti Jae-sun Seo

Year: 2019 Journal: IEEE Solid-State Circuits Letters Vol: 2 (9)Pages: 119-122 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/lssc.2019.2936761

Get Full-Text PDF Get Analytical Report

Abstract

Long short-term memory (LSTM) networks are widely used for speech applications but pose difficulties for efficient implementation on hardware due to large weight storage requirements. We present an energy-efficient LSTM recurrent neural network (RNN) accelerator, featuring an algorithm-hardware co-optimized memory compression technique called hierarchical coarse-grain sparsity (HCGS). Aided by HCGS-based block-wise recursive weight compression, we demonstrate LSTM networks with up to 16× fewer weights while achieving minimal accuracy loss. The prototype chip fabricated in 65-nm LP CMOS achieves 8.93/7.22 TOPS/W for 2-/3-layer LSTM RNNs trained with HCGS for TIMIT/TED-LIUM corpora.

Keywords:

Computer science Recurrent neural network Block (permutation group theory) Chip Artificial neural network Computer hardware TIMIT TOPS Energy (signal processing) Algorithm Artificial intelligence Materials science Mathematics Telecommunications

Metrics

Cited By

1.08

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Neural Networks and Applications

Physical Sciences → Computer Science → Artificial Intelligence

A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip

Abstract

Metrics

Citation History

Topics

Related Documents

A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip

An 8.93 TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity for On-Device Speech Recognition

Compressing LSTM Networks with Hierarchical Coarse-Grain Sparsity

SPIKA: 200-TOPS/W RRAM-based Neural Network Accelerator Chip

A 62.45 TOPS/W Spike-Based Convolution Neural Network Accelerator with Spatiotemporal Parallel Data Flow and Sparsity Mechanism