JOURNAL ARTICLE

MCF3D: Multi-Stage Complementary Fusion for Multi-Sensor 3D Object Detection

Jiarong WangMing ZhuDeyao SunBo WangWen GaoWei Hua

Year: 2019 Journal:   IEEE Access Vol: 7 Pages: 90801-90814   Publisher: Institute of Electrical and Electronics Engineers

Abstract

We present MCF3D, a multi-stage complementary fusion three-dimensional (3D) object detection network for autonomous driving, robot navigation, and virtual reality. This is an end-to-end learnable architecture, which takes both LIDAR point clouds and RGB images as inputs and utilizes a 3D region proposal subnet and second stage detector(s) subnet to achieve high-precision oriented 3D bounding box prediction. To fully exploit the strength of multimodal information, we design a series of fine and targeted fusion methods based on the attention mechanism and prior knowledge, including “pre-fusion,” “anchor-fusion,” and “proposal-fusion.” Our proposed RGB-Intensity form encodes the reflection intensity onto the input image to strengthen the representational power. Our designed proposal-element attention module allows the network to be guided to focus more on efficient and critical information with negligible overheads. In addition, we propose a cascade-enhanced detector for small classes, which is more selective against close false positives. The experiments on the challenging KITTI benchmark show that our MCF3D method produces state-of-the-art results while running in near real-time with a low memory footprint.

Keywords:
Subnet Computer science Artificial intelligence Minimum bounding box Object detection Computer vision Detector Benchmark (surveying) Sensor fusion Image fusion Point cloud Memory footprint False positive paradox Real-time computing Pattern recognition (psychology) Image (mathematics)

Metrics

22
Cited By
1.50
FWCI (Field Weighted Citation Impact)
45
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.