JOURNAL ARTICLE

Yolo-Based Lightweight Object Detection With Structure Simplification And Attention Enhancement

Abstract

In this paper we propose a lightweight object detector by optimizing the structure of YOLOv3. The optimization is conducted in two aspects: simplifying the structural components by lightweight substitutes and introducing the attention mechanism to increase the detection accuracy. For the simplification, we remodel the backbone based on MobileNet v2 and replace every 3×3 convolution in the detection neck and head by the fusion of a 3 × 3 depthwise separable convolution and a squeeze and excitation block (DSConv+SE); for the attention enhancement, we introduce the high-frequency wavelets of the original image to the input, a simplified non- local block to the simplified backbone and convolutional block attention modules to the simplified detection neck. In addition, local 3×3 convolution branches are introduced to the simplified backbone for enhanced learning capability. Experiments demonstrate that the proposed detector outperforms each compared state-of-the-art work in one or more aspects.

Keywords:
Convolution (computer science) Block (permutation group theory) Computer science Object detection Detector Artificial intelligence Object (grammar) Wavelet Separable space Algorithm Convolutional neural network Computer vision Pattern recognition (psychology) Mathematics Artificial neural network Telecommunications

Metrics

6
Cited By
1.09
FWCI (Field Weighted Citation Impact)
29
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.