JOURNAL ARTICLE

Design and Implementation of OpenCL-Based FPGA Accelerator for YOLOv2

Chenchen CuiFen GeZiyu LiXin YueZhou FangNing Wu

Year: 2021 Journal:   2021 IEEE 21st International Conference on Communication Technology (ICCT) Pages: 1004-1007

Abstract

Convolutional neural networks (CNNs) are widely used in practical scenarios such as license plate recognition, face recognition and radar image detection, where the main accelerators used are GPU platforms with high power output. However, FPGA is more suitable to be used in some application scenarios because of its own flexibility and a large number of computing resources, and it has lower energy consumption than GPU. In deploying large neural network models, pure hardware development requires a long time cycle. Our work chooses to use a High Level Synthesis tool based on OpenCL for development, which significantly improves the development efficiency and enables fast implementation of the model. The winograd algorithm is also used in the convolutional kernel module to accelerate the convolutional operation. The final verification is completed on the FPGA development board DE5a-Net, where we process a 544*544 format image in 149ms, as well as achieving a peak performance of 248.7 GOP/S.

Keywords:
Computer science Convolutional neural network Field-programmable gate array Kernel (algebra) Process (computing) Flexibility (engineering) Embedded system Computer architecture Artificial intelligence Operating system

Metrics

4
Cited By
0.25
FWCI (Field Weighted Citation Impact)
10
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
CCD and CMOS Imaging Sensors
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
© 2026 ScienceGate Book Chapters — All rights reserved.