Accelerating Low Bit-width Neural Networks at the Edge, PIM or FPGA: A Comparative Study

Nakul Kochar; Lucas Ekiert; Deniz Najafi; Deliang Fan; Shaahin Angizi

doi:10.1145/3583781.3590213

ScienceGate Book Chapters

JOURNAL ARTICLE

Accelerating Low Bit-width Neural Networks at the Edge, PIM or FPGA: A Comparative Study

Nakul Kochar Lucas Ekiert Deniz Najafi Deliang Fan Shaahin Angizi

Year: 2023 Pages: 625-630

DOI: 10.1145/3583781.3590213

Get Full-Text PDF Get Analytical Report

Abstract

Deep Neural Network (DNN) acceleration with digital Processing-in-Memory (PIM) platforms at the edge is an actively-explored domain with great potential to not only address memory-wall bottlenecks but to offer orders of performance improvement in comparison to the von-Neumann architecture. On the other side, FPGA-based edge computing has been followed as a potential solution to accelerate compute-intensive workloads. In this work, adopting low-bit-width neural networks, we perform a solid and comparative inference performance analysis of a recent processing-in-SRAM tape-out with a low-resource FPGA board and a high-performance GPU to provide a guideline for the research community. We explore and highlight the key architectural constraints of these edge candidates that impact their overall performance. Our experimental data demonstrate that the processing-in-SRAM can obtain up to ~160x speed-up and up to 228x higher efficiency (img/s/W) compared to the under-test FPGA on the CIFAR-10 dataset.

Keywords:

Computer science Field-programmable gate array Static random-access memory Edge device Edge computing Enhanced Data Rates for GSM Evolution Artificial neural network Computer architecture Key (lock) Embedded system Computer engineering Parallel computing Computer hardware Artificial intelligence Operating system Cloud computing

Metrics

Cited By

0.17

FWCI (Field Weighted Citation Impact)

Refs

0.42

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Memory and Neural Computing

Physical Sciences → Engineering → Electrical and Electronic Engineering

Ferroelectric and Negative Capacitance Devices

Physical Sciences → Engineering → Electrical and Electronic Engineering

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Accelerating Low Bit-width Neural Networks at the Edge, PIM or FPGA: A Comparative Study

Abstract

Metrics

Citation History

Topics

Related Documents

Accelerating low bit-width convolutional neural networks with embedded FPGA

Accelerating Deep Neural Networks Using FPGA

Accelerating Deep Neural Networks in FPGA SoC

Accelerating Low Bit-Width Deep Convolution Neural Network in MRAM

FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA