JOURNAL ARTICLE

MMAF-Net: Multi-view multi-stage adaptive fusion for multi-sensor 3D object detection

Wensheng ZhangHongli ShiYunche ZhaoZhenan FengRuggiero Lovreglio

Year: 2023 Journal:   Expert Systems with Applications Vol: 242 Pages: 122716-122716   Publisher: Elsevier BV

Abstract

In this paper, we propose a 3D object detection method called MMAF-Net that is based on the multi-view and multi-stage adaptive fusion of RGB images and LiDAR point cloud data. This is an end-to-end architecture, which combines the characteristics of RGB images, the front view of point clouds based on reflection intensity, and the bird's eye view of point clouds. It also adopts a multi-stage fusion approach of "data-level fusion + feature-level fusion" to fully exploit the strength of multimodal information. Our proposed method addresses key challenges found in current 3D object detection methods for autonomous driving, including insufficient feature extraction from multimodal data, rudimentary fusion techniques, and sensitivity to distance and occlusion. To ensure the comprehensive integration of multimodal information, we present a series of targeted fusion methods. Firstly, we propose a novel input form that encodes dense point cloud reflectivity information into the image to enhance its representational power. Secondly, we design the Region Attention Adaptive Fusion module utilizing an attention mechanism to guide the network in adaptively adjusting the importance of different features. Finally, we extend the 2D DIOU (Distance Intersection over Union) loss function to 3D and develop a joint regression loss based on 3D_DIOU and SmoothL1 to optimize the similarity between detected and ground truth boxes. The experimental results on the KITTI dataset demonstrate that MMAF-Net effectively addresses the challenges posed by highly obscured or crowded scenes while maintaining real-time performance and improving the detection accuracy of smaller and more difficult objects that are occluded at far distances.

Keywords:
Computer science Point cloud Artificial intelligence Computer vision RGB color model Sensor fusion Key (lock) Feature (linguistics) Object detection Intersection (aeronautics) Pattern recognition (psychology)

Metrics

15
Cited By
2.73
FWCI (Field Weighted Citation Impact)
73
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Optical Sensing Technologies
Physical Sciences →  Physics and Astronomy →  Instrumentation

Related Documents

JOURNAL ARTICLE

3D Object Detection Based on Multi-view Adaptive Fusion

Yong ZhangHuan Wu

Journal:   2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC) Year: 2022 Pages: 743-748
JOURNAL ARTICLE

FB-Net: Multi-sensor fusion for object detection using front view and bird’s-eye-view features

Bingli ZhangChengbiao ZhangYixin WangJunzhao JiangY ZhangXinyu WangGan ShenXiang Luo

Journal:   Proceedings of the Institution of Mechanical Engineers Part D Journal of Automobile Engineering Year: 2026
JOURNAL ARTICLE

Multi-View Clustering via Multi-Stage Fusion

Gan YuYunning YouJunjie HuangSen XiangChang TangWei HuShan An

Journal:   IEEE Transactions on Multimedia Year: 2025 Vol: 27 Pages: 4571-4583
© 2026 ScienceGate Book Chapters — All rights reserved.