Design and Implementation of OpenCL-Based FPGA Accelerator for YOLOv2

Chenchen Cui; Fen Ge; Ziyu Li; Xin Yue; Zhou Fang; Ning Wu

doi:10.1109/icct52962.2021.9657856

ScienceGate Book Chapters

JOURNAL ARTICLE

Design and Implementation of OpenCL-Based FPGA Accelerator for YOLOv2

Chenchen Cui Fen Ge Ziyu Li Xin Yue Zhou Fang Ning Wu

Year: 2021 Journal: 2021 IEEE 21st International Conference on Communication Technology (ICCT) Pages: 1004-1007

DOI: 10.1109/icct52962.2021.9657856

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional neural networks (CNNs) are widely used in practical scenarios such as license plate recognition, face recognition and radar image detection, where the main accelerators used are GPU platforms with high power output. However, FPGA is more suitable to be used in some application scenarios because of its own flexibility and a large number of computing resources, and it has lower energy consumption than GPU. In deploying large neural network models, pure hardware development requires a long time cycle. Our work chooses to use a High Level Synthesis tool based on OpenCL for development, which significantly improves the development efficiency and enables fast implementation of the model. The winograd algorithm is also used in the convolutional kernel module to accelerate the convolutional operation. The final verification is completed on the FPGA development board DE5a-Net, where we process a 544*544 format image in 149ms, as well as achieving a peak performance of 248.7 GOP/S.

Keywords:

Computer science Convolutional neural network Field-programmable gate array Kernel (algebra) Process (computing) Flexibility (engineering) Embedded system Computer architecture Artificial intelligence Operating system

Metrics

Cited By

0.25

FWCI (Field Weighted Citation Impact)

Refs

0.65

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

CCD and CMOS Imaging Sensors

Physical Sciences → Engineering → Electrical and Electronic Engineering

Parallel Computing and Optimization Techniques

Physical Sciences → Computer Science → Hardware and Architecture

Design and Implementation of OpenCL-Based FPGA Accelerator for YOLOv2

Abstract

Metrics

Citation History

Topics

Related Documents

A Scalable OpenCL-Based FPGA Accelerator for YOLOv2

An OpenCL-Based FPGA Accelerator for Compressed YOLOv2

FPGA Accelerator Design Using OpenCL

Design and Implementation of a YOLOv2 Accelerator on a Zynq-7000 FPGA

Design of an FPGA-Based FDTD Accelerator Using OpenCL