JOURNAL ARTICLE

Two-Layer Attention Feature Pyramid Network for Small Object Detection

Sheng XiangJunhao MaShang Qun-liXianbao WangDefu Chen

Year: 2024 Journal:   Computer Modeling in Engineering & Sciences Vol: 141 (1)Pages: 713-731   Publisher: Tech Science Press

Abstract

Effective small object detection is crucial in various applications including urban intelligent transportation and pedestrian detection.However, small objects are difficult to detect accurately because they contain less information.Many current methods, particularly those based on Feature Pyramid Network (FPN), address this challenge by leveraging multi-scale feature fusion.However, existing FPN-based methods often suffer from inadequate feature fusion due to varying resolutions across different layers, leading to suboptimal small object detection.To address this problem, we propose the Two-layer Attention Feature Pyramid Network (TA-FPN), featuring two key modules: the Two-layer Attention Module (TAM) and the Small Object Detail Enhancement Module (SODEM).TAM uses the attention module to make the network more focused on the semantic information of the object and fuse it to the lower layer, so that each layer contains similar semantic information, to alleviate the problem of small object information being submerged due to semantic gaps between different layers.At the same time, SODEM is introduced to strengthen the local features of the object, suppress background noise, enhance the information details of the small object, and fuse the enhanced features to other feature layers to ensure that each layer is rich in small object information, to improve small object detection accuracy.Our extensive experiments on challenging datasets such as Microsoft Common Objects in Context (MS COCO) and Pattern Analysis Statistical Modelling and Computational Learning, Visual Object Classes (PASCAL VOC) demonstrate the validity of the proposed method.Experimental results show a significant improvement in small object detection accuracy compared to state-of-theart detectors.

Keywords:
Pyramid (geometry) Feature (linguistics) Layer (electronics) Object (grammar) Artificial intelligence Computer science Pattern recognition (psychology) Object detection Computer vision Materials science Mathematics Nanotechnology Geometry

Metrics

1
Cited By
0.53
FWCI (Field Weighted Citation Impact)
60
Refs
0.53
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering
Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.