JOURNAL ARTICLE

Multi-Scale Feature Fusion Point Cloud Object Detection Based on Original Point Cloud and Projection

Z. D. ZhangZhongjie ZhuYongqiang BaiYiwen JinMinyu Wang

Year: 2024 Journal:   Electronics Vol: 13 (11)Pages: 2213-2213   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Existing point cloud object detection algorithms struggle to effectively capture spatial features across different scales, often resulting in inadequate responses to changes in object size and limited feature extraction capabilities, thereby affecting detection accuracy. To solve this problem, we present a point cloud object detection method based on multi-scale feature fusion of the original point cloud and projection, which aims to improve the multi-scale performance and completeness of feature extraction in point cloud object detection. First, we designed a 3D feature extraction module based on the 3D Swin Transformer. This module pre-processes the point cloud using a 3D Patch Partition approach and employs a self-attention mechanism within a 3D sliding window, along with a downsampling strategy, to effectively extract features at different scales. At the same time, we convert the 3D point cloud to a 2D image using projection technology and extract 2D features using the Swin Transformer. A 2D/3D feature fusion module is then built to integrate 2D and 3D features at the channel level through point-by-point addition and vector concatenation to improve feature completeness. Finally, the integrated feature maps are fed into the detection head to facilitate efficient object detection. Experimental results show that our method has improved the average precision of vehicle detection by 1.01% on the KITTI dataset over three levels of difficulty compared to Voxel-RCNN. In addition, visualization analyses show that our proposed algorithm also exhibits superior performance in object detection.

Keywords:
Point cloud Cloud computing Computer science Projection (relational algebra) Feature (linguistics) Scale (ratio) Artificial intelligence Computer vision Point (geometry) Fusion Object (grammar) Image fusion Algorithm Image (mathematics) Mathematics Geography Geometry Cartography

Metrics

3
Cited By
1.16
FWCI (Field Weighted Citation Impact)
23
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology
© 2026 ScienceGate Book Chapters — All rights reserved.