JOURNAL ARTICLE

Apply Yolov4-Tiny on an FPGA-Based Accelerator of Convolutional Neural Network for Object Detection

Fengxi ZhangYuying LiZhihao Ye

Year: 2022 Journal:   Journal of Physics Conference Series Vol: 2303 (1)Pages: 012032-012032   Publisher: IOP Publishing

Abstract

Abstract With the continuous expansion of Neural Network technology in the artificial intelligence field, for example, image recognition and retrieval, object detection, pixel processing, automatic speech generation, etc., Convolutional Neural Networks (CNN) and Deep Learning of Neural Networks (DNN) have made apparent breakthroughs. To improve the inference speed of images, the combination of FPGA-based acceleration and multiple model quantization methods has become one of the most contemporary alternative methods. This paper designed an FPGA-Based acceleration scheme combining software and hardware and effectively applied it to the Yolov4-Tiny object detection model, realizing the accelerated detection inference process from the original 6-7mins to 383ms. First, it chose the static quantization method of fixed-point numbers, fixed the position of the decimal point, and then added Batch Norm between the convolutional layer and the activation function to form a connection structure. Second, it further improved inference speed on an FPGA with a version of ZYNQ-7020 by increasing the bandwidth cap and reducing bandwidth requirements, employing a massive pipeline design. Finally, in the test of the Coco dataset, the plan has completed a substantial acceleration of the average inference speed of the Yolov4-Tiny object detection model from 7.13mins/Picture to 498.89ms/Picture, which has a high application value in the field of object detection. It dramatically improves the inference speed as well as keeps the average accuracy above 0.95.

Keywords:
Computer science Convolutional neural network Field-programmable gate array Object detection Inference Artificial intelligence Quantization (signal processing) Artificial neural network Deep learning Speedup Hardware acceleration Activation function Computer engineering Computer vision Pattern recognition (psychology) Computer hardware Parallel computing

Metrics

7
Cited By
0.87
FWCI (Field Weighted Citation Impact)
21
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
CCD and CMOS Imaging Sensors
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering

Related Documents

JOURNAL ARTICLE

Design of Yolov4-Tiny convolutional neural network hardware accelerator based on FPGA

Wan DuShang-Zhi ChenLei WangRifai Chai

Journal:   Journal of Physics Conference Series Year: 2024 Vol: 2849 (1)Pages: 012005-012005
JOURNAL ARTICLE

An FPGA-Based Reconfigurable Convolutional Neural Network Accelerator for Tiny YOLO-V3

Tsung‐Han TsaiNai-Chieh TungChun‐Yu Chen

Journal:   Circuits Systems and Signal Processing Year: 2025 Vol: 44 (5)Pages: 3388-3409
JOURNAL ARTICLE

Marine Object Detection using YOLOv4 Adapted Convolutional Neural Network

Muhammad Daniyal BaigHafiz Burhan Ul Haq

Journal:   Decision Making Advances Year: 2024 Vol: 2 (1)Pages: 83-91
JOURNAL ARTICLE

FPGA-based Accelerator for Convolutional Neural Network

YU Zijian,MA De,YAN Xiaolang,SHEN Juncheng

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2017
© 2026 ScienceGate Book Chapters — All rights reserved.