JOURNAL ARTICLE

Multi-modal Feature Fusion 3D Object Detection

Abstract

For the existing 3D small object detection is prone to false detection and missed detection and other deficiencies. A 3D object detection method based on multi-modal feature fusion is proposed. Firstly, a feature extraction module is designed. The input image data is down-sampled through the image feature extraction network, and the input point cloud data is sampled and grouped through the point cloud feature extraction network to obtain the feature information at different scales. Secondly, a multi-modal feature fusion module is constructed to realize the point correspondence between point cloud features and image features by projection operation, and then the image features and point cloud features are splicing and fused to generate the final point cloud features to compensate the deficiency of single modal feature information. The experimental results show that compared with the existing algorithms, the algorithm in this paper improves the average detection accuracy of small object by 2.03%.

Keywords:
Point cloud Feature (linguistics) Artificial intelligence Computer science Feature extraction Computer vision Pattern recognition (psychology) Object detection Modal Feature detection (computer vision) Image fusion Object (grammar) Point (geometry) Projection (relational algebra) Image (mathematics) Image processing Mathematics Algorithm

Metrics

8
Cited By
2.28
FWCI (Field Weighted Citation Impact)
9
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Object Detection Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.