JOURNAL ARTICLE

Bidirectional Matrix Feature Pyramid Network for Object Detection

Abstract

Feature pyramids are widely used to improve scale invariance for object detection. Most methods just map the objects to feature maps with relevant square receptive fields, but rarely pay attention to the aspect ratio variation, which is also an important property of object instances. It will lead to a poor match between rectangular objects and assigned features with square receptive fields, thus preventing from accurate recognition and location. Besides, the information propagation among feature layers is sparse, namely, each feature in the pyramid may mainly or only contain single-level information, which is not representative enough for classification and localization sub-tasks. In this paper, Bidirectional Matrix Feature Pyramid Network (BMFPN) is proposed to address these issues. It consists of three modules: Diagonal Layer Generation Module (DLGM), Top-down Module (TDM) and Bottom-up Module (BUM). First, multi-level features extracted by backbone are fed into DLGM to produce the base features. Then these base features are utilized to construct the final feature pyramid through TDM and BUM in series. The receptive fields of the designed feature layers in BMFPN have various scales and aspect ratios. Objects can be correctly assigned to appropriate and representative feature maps with relevant receptive fields depending on its scale and aspect ratio properties. Moreover, TDM and BUM form bidirectional and reticular information flow, which effectively fuses multi-level information in top-down and bottom-up manner respectively. To evaluate the effectiveness of our proposed architecture, an end-to-end anchor-free detector is designed and trained by integrating BMFPN into FCOS. And the center-ness branch in FCOS is modified with our Gaussian center-ness branch (GCB), which brings another slight improvement. Without bells and whistles, our method gains +3.3%, +2.4% and +2.6% AP on MS COCO dataset from baselines with ResNet-50, ResNet-101 and ResNeXt-101 backbones, respectively.

Keywords:
Pyramid (geometry) Feature (linguistics) Computer science Artificial intelligence Pattern recognition (psychology) Feature extraction Object detection Computer vision Receptive field Diagonal Mathematics Geometry

Metrics

9
Cited By
0.82
FWCI (Field Weighted Citation Impact)
53
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Augmented weighted bidirectional feature pyramid network for marine object detection

Jinxiong GaoGeng XuYonghui ZhangRong WangKaixuan Shao

Journal:   Expert Systems with Applications Year: 2023 Vol: 237 Pages: 121688-121688
BOOK-CHAPTER

AgBFPN: Attention Guided Bidirectional Feature Pyramid Network for Object Detection

Lanjie JiangXiang ZhangRuijing YangYudie Liu

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2023 Pages: 386-397
JOURNAL ARTICLE

Object detection based on Yolov4-Tiny and Improved Bidirectional feature pyramid network

Qi LiuXiaoyu FanZhipeng XiZhijian YinZhen Yang

Journal:   Journal of Physics Conference Series Year: 2022 Vol: 2209 (1)Pages: 012023-012023
JOURNAL ARTICLE

A recursive attention-enhanced bidirectional feature pyramid network for small object detection

Huanlong ZhangQifan DuQiye QiJie ZhangFengxian WangMiao Gao

Journal:   Multimedia Tools and Applications Year: 2022 Vol: 82 (9)Pages: 13999-14018
© 2026 ScienceGate Book Chapters — All rights reserved.