JOURNAL ARTICLE

Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection

Hongyang WangTiejun Wang

Year: 2022 Journal:   Electronics Vol: 12 (1)Pages: 93-93   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The effective use of multi-scale features remains an open problem for object detection tasks. Recently, proposed object detectors have usually used Feature Pyramid Networks (FPN) to fuse multi-scale features. Since Feature Pyramid Networks use a relatively simple feature map fusion approach, it can lead to the loss or misalignment of semantic information in the fusion process. Several works have demonstrated that using a bottom-up structure in a Feature Pyramid Network can shorten the information path between lower layers and the topmost feature, allowing an adequate exchange of semantic information from different layers. We further enhance the bottom-up path by proposing a multi-scale residual aggregation Feature Pyramid Network (MSRA-FPN), which uses a unidirectional cross-layer residual module to aggregate features from multiple layers bottom-up in a triangular structure to the topmost layer. In addition, we introduce a Residual Squeeze and Excitation Module to mitigate the aliasing effects that occur when features from different layers are aggregated. MSRA-FPN enhances the semantic information of the high-level feature maps, mitigates the information decay during feature fusion, and enhances the detection capability of the model for large objects. It is experimentally demonstrated that our proposed MSRA-FPN improves the performance of the three baseline models by 0.5–1.9% on the PASCAL VOC dataset and is also quite competitive with other state-of-the-art FPN methods. On the MS COCO dataset, our proposed method can also improve the performance of the baseline model by 0.8% and the baseline model’s performance for large object detection by 1.8%. To further validate the effectiveness of MSRA-FPN for large object detection, we constructed the Thangka Figure Dataset and conducted comparative experiments. It is experimentally demonstrated that our proposed method improves the performance of the baseline model by 2.9–4.7% on this dataset and can reach up to 71.2%.

Keywords:
Residual Computer science Pyramid (geometry) Feature (linguistics) Artificial intelligence Semantic feature Object detection Pascal (unit) Pattern recognition (psychology) Data mining Algorithm Mathematics

Metrics

10
Cited By
1.11
FWCI (Field Weighted Citation Impact)
38
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Multi-scale aggregation feature pyramid with cornerness for underwater object detection

Xinbin LiHaifeng YuHaiyang Chen

Journal:   The Visual Computer Year: 2023 Vol: 40 (2)Pages: 1299-1310
JOURNAL ARTICLE

Pyramid attention object detection network with multi-scale feature fusion

Xiu ChenYujie LiYoshihisa Nakatoh

Journal:   Computers & Electrical Engineering Year: 2022 Vol: 104 Pages: 108436-108436
JOURNAL ARTICLE

Multi‐scale object detection by bottom‐up feature pyramid network

Boya ZhaoZhao Bao-junLinbo TangChen Wu

Journal:   The Journal of Engineering Year: 2019 Vol: 2019 (21)Pages: 7480-7483
© 2026 ScienceGate Book Chapters — All rights reserved.