JOURNAL ARTICLE

Dense Voxel Fusion for 3D Object Detection

Anas MahmoudJordan S. K. HuSteven L. Waslander

Year: 2023 Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Pages: 663-672

Abstract

Camera and LiDAR sensor modalities provide complementary appearance and geometric information useful for detecting 3D objects for autonomous vehicle applications. However, current end-to-end fusion methods are challenging to train and underperform state-of-the-art LiDAR-only detectors. Sequential fusion methods suffer from a limited number of pixel and point correspondences due to point cloud sparsity, or their performance is strictly capped by the detections of one of the modalities. Our proposed solution, Dense Voxel Fusion (DVF) is a sequential fusion method that generates multi-scale dense voxel feature representations, improving expressiveness in low point density regions. To enhance multi-modal learning, we train directly with projected ground truth 3D bounding box labels, avoiding noisy, detector-specific 2D predictions. Both DVF and the multi-modal training approach can be applied to any voxel-based LiDAR backbone. DVF ranks 3 rd among published fusion methods on KITTI's 3D car detection benchmark without introducing additional trainable parameters, nor requiring stereo images or dense depth labels. In addition, DVF significantly improves 3D vehicle detection performance of voxel-based methods on the Waymo Open Dataset.

Keywords:
Voxel Artificial intelligence Computer science Point cloud Computer vision Lidar Object detection Benchmark (surveying) Pixel Point (geometry) Feature (linguistics) Sensor fusion Detector Ground truth Pattern recognition (psychology) Image fusion Fusion Image (mathematics) Mathematics Remote sensing

Metrics

68
Cited By
5.15
FWCI (Field Weighted Citation Impact)
40
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Radar Voxel Fusion for 3D Object Detection

Felix NobisEhsan ShafieiPhillip KarleJohannes BetzMarkus Lienkamp

Journal:   Applied Sciences Year: 2021 Vol: 11 (12)Pages: 5598-5598
JOURNAL ARTICLE

Voxel Field Fusion for 3D Object Detection

Yanwei LiXiaojuan QiYukang ChenLiwei WangZeming LiJian SunJiaya Jia

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 1110-1119
JOURNAL ARTICLE

Dense projection fusion for 3D object detection

Chen ZhaoBin‐Jie HuChengxi LuoGuohao ChenHaohui Zhu

Journal:   Scientific Reports Year: 2024 Vol: 14 (1)Pages: 23492-23492
© 2026 ScienceGate Book Chapters — All rights reserved.