Fully Sparse Fusion for 3D Object Detection

Yingyan Li; Lue Fan; Yang Liu; Zehao Huang; Yuntao Chen; Naiyan Wang; Zhaoxiang Zhang

doi:10.1109/tpami.2024.3392303

ScienceGate Book Chapters

JOURNAL ARTICLE

Fully Sparse Fusion for 3D Object Detection

Yingyan Li Lue Fan Yang Liu Zehao Huang Yuntao Chen Naiyan Wang Zhaoxiang Zhang

Year: 2024 Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 46 (11)Pages: 7217-7231 Publisher: IEEE Computer Society

DOI: 10.1109/tpami.2024.3392303

Get Full-Text PDF Get Analytical Report

Abstract

Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually use dense Bird's-Eye-View (BEV) feature maps. However, the cost of such BEV feature maps is quadratic to the detection range, making it not scalable for long-range detection. Recently, LiDAR-only fully sparse architecture has been gaining attention for its high efficiency in long-range perception. In this paper, we study how to develop a multi-modal fully sparse detector. Specifically, our proposed detector integrates the well-studied 2D instance segmentation into the LiDAR side, which is parallel to the 3D instance segmentation part in the LiDAR-only baseline. The proposed instance-based fusion framework maintains full sparsity while overcoming the constraints associated with the LiDAR-only fully sparse detector. Our framework showcases state-of-the-art performance on the widely used nuScenes dataset, Waymo Open Dataset, and the long-range Argoverse 2 dataset. Notably, the inference speed of our proposed method under the long-range perception setting is 2.7× faster than that of other state-of-the-art multimodal 3D detection methods.

Keywords:

Artificial intelligence Computer science Pattern recognition (psychology) Computer vision Object detection Fusion Image fusion Object (grammar) Image (mathematics)

Metrics

Cited By

14.31

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Fully Sparse Fusion for 3D Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Sparse Dense Fusion for 3D Object Detection

PLPFusion: Plane-Line-Pixel Fully Sparse Fusion for Robust Multi-Modal 3D Object Detection

SparseAlign: A Fully Sparse Framework for Cooperative Object Detection

FSHNet: Fully Sparse Hybrid Network for 3D Object Detection

SRFDet3D: Sparse Region Fusion based 3D Object Detection