JOURNAL ARTICLE

Efficient FPGA Implementations of Convolutional Neural Network

Abstract

Convolutional neural networks (CNNs) play a major role in image recognition. This paper proposes efficient field programmable gate array (FPGA) implementations of CNN, where the dual core implementation, parallelism in convolution layer, and partial reconfiguration based constant multiplication in convolution layer achieve less cycles, less delay, and less resources respectively. All the proposed designs are implemented on Zed Board (Zynq 7000 FPGA (XC7Z020CLG484-1)) with Xilinx Vivado. The synthesis results show that the proposed parallel convolution based single core implementation of CNN with Modified National Institute of Standards and Technology (MNIST) data set achieves 89% of reduction in delay for 10 tasks as compared with the conventional convolution based single core implementation with a co-processor.

Keywords:
Field-programmable gate array Computer science Convolutional neural network Implementation Computer architecture Artificial neural network Artificial intelligence Embedded system Programming language

Metrics

3
Cited By
0.77
FWCI (Field Weighted Citation Impact)
11
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.