Abstract

Point cloud computation has become an increasingly more important workload for autonomous driving and other applications. Unlike dense 2D computation, point cloud convolution has sparse and irregular computation patterns and thus requires dedicated inference system support with specialized high-performance kernels. While existing point cloud deep learning libraries have developed different dataflows for convolution on point clouds, they assume a single dataflow throughout the execution of the entire model. In this work, we systematically analyze and improve existing dataflows. Our resulting system, TorchSparse++, achieves 2.9×, 3.3×, 2.2× and 1.8× measured end-to-end speedup on an NVIDIA A100 GPU over the state-of-the-art MinkowskiEngine, SpConv 1.2, TorchSparse and SpConv v2 in inference respectively. Furthermore, TorchSparse++ is the only system to date that supports all necessary primitives for 3D segmentation, detection, and reconstruction workloads in autonomous driving. Code is publicly released at https://github.com/mit-han-lab/torchsparse.

Keywords:
Computer science Dataflow Point cloud Convolution (computer science) Computation Inference Cloud computing Speedup Workload Point (geometry) Code (set theory) Parallel computing State (computer science) Distributed computing Computer engineering Artificial intelligence Algorithm Operating system Programming language

Metrics

20
Cited By
6.72
FWCI (Field Weighted Citation Impact)
54
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.