JOURNAL ARTICLE

Point-Voxel Fusion for 3D Object Detection

Abstract

In 3D object detection, network prediction accuracy is greatly affected by point cloud's feature richness. However, the feature richness depends on fine-grained features extracted by the network. Currently some methods use voxel encoding approach continuously down-scaled by 3D convolution to improve the detection efficiency, but lose too many fine-grained features. Some methods directly inputting the original point cloud into the Multi-layer Perceptron (MLP) for feature extraction, which can retain more fine-grained features, but greatly reduce the detection efficiency. This work combines voxel features and point features to obtain a fused 3D map. We use an attention mechanism module that combines semantic features with spatial features to progress the former 3D feature map, which is used to constitute a richer 3D feature structure to reduce the loss of Z-axis features. Since the object geometry structure information is important for the detection task, we design a geometry-oriented auxiliary network that is jointly optimized by supervising two tasks in the training phase to guide the backbone network to understand the target structure features and discard them in the inference phase. The experiments show that our proposed detection method outperforms some previous methods in KITTI 3D/BEV detection.

Keywords:
Computer science Point cloud Artificial intelligence Feature (linguistics) Object detection Feature extraction Voxel Pattern recognition (psychology) Convolution (computer science) Inference Computer vision Artificial neural network

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
29
Refs
0.12
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

JOURNAL ARTICLE

HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection

Baojie FanKexin ZhangJiandong Tian

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2023 Vol: 34 (10)Pages: 8997-9009
BOOK-CHAPTER

Point-Voxel Fusion with Adaptive Sectorized Points Sampling for 3D Object Detection

Yihui LiuHe HongwenYingjuan Tang

Communications in computer and information science Year: 2026 Pages: 147-159
JOURNAL ARTICLE

Dense Voxel Fusion for 3D Object Detection

Anas MahmoudJordan S. K. HuSteven L. Waslander

Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2023 Pages: 663-672
© 2026 ScienceGate Book Chapters — All rights reserved.