JOURNAL ARTICLE

PPF-Det: Point-Pixel Fusion for Multi-Modal 3D Object Detection

Guotao XieChen Zhi-yuanMing GaoManjiang HuXiaohui Qin

Year: 2024 Journal:   IEEE Transactions on Intelligent Transportation Systems Vol: 25 (6)Pages: 5598-5611   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Multi-modal fusion can take advantage of the LiDAR and camera to boost the robustness and performance of 3D object detection. However, there are still of great challenges to comprehensively exploit image information and perform accurate diverse feature interaction fusion. In this paper, we proposed a novel multi-modal framework, namely Point-Pixel Fusion for Multi-Modal 3D Object Detection (PPF-Det). The PPF-Det consists of three submodules, Multi Pixel Perception (MPP), Shared Combined Point Feature Encoder (SCPFE), and Point-Voxel-Wise Triple Attention Fusion (PVW-TAF) to address the above problems. Firstly, MPP can make full use of image semantic information to mitigate the problem of resolution mismatch between point cloud and image. In addition, we proposed SCPFE to preliminary extract point cloud features and point-pixel features simultaneously reducing time-consuming on 3D space. Lastly, we proposed a fine alignment fusion strategy PVW-TAF to generate multi-level voxel-fused features based on attention mechanism. Extensive experiments on KITTI benchmarks, conducted on September 24, 2023, demonstrate that our method shows excellent performance.

Keywords:
Artificial intelligence Computer vision Point cloud Computer science Pixel Robustness (evolution) Object detection Encoder Image fusion Modal Fusion Feature (linguistics) Pattern recognition (psychology) Image (mathematics)

Metrics

24
Cited By
12.72
FWCI (Field Weighted Citation Impact)
74
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

JOURNAL ARTICLE

PPF-Net: Efficient Multimodal 3D Object Detection with Pillar-Point Fusion

Longji ZhangChangyong Li

Journal:   Electronics Year: 2025 Vol: 14 (4)Pages: 685-685
JOURNAL ARTICLE

Multi-modal Feature Fusion 3D Object Detection

Yiwen JinRong ZhangYisu HuHongliang LuoYongqiang Bai

Journal:   Advances in Computer Signals and Systems Year: 2023 Vol: 7 (8)
© 2026 ScienceGate Book Chapters — All rights reserved.