JOURNAL ARTICLE

DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds

Yaqian NingJie CaoChun BaoQun Hao

Year: 2023 Journal:   Remote Sensing Vol: 15 (23)Pages: 5612-5612   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The use of a transformer backbone in LiDAR point-cloud-based models for 3D object detection has recently gained significant interest. The larger receptive field of the transformer backbone improves its representation capability but also results in excessive attention being given to background regions. To solve this problem, we propose a novel approach called deformable voxel set attention, which we utilized to create a deformable voxel set transformer (DVST) backbone for 3D object detection from point clouds. The DVST aims to efficaciously integrate the flexible receptive field of the deformable mechanism and the powerful context modeling capability of the transformer. Specifically, we introduce the deformable mechanism into voxel-based set attention to selectively transfer candidate keys and values of foreground queries to important regions. An offset generation module was designed to learn the offsets of the foreground queries. Furthermore, a globally responsive convolutional feed-forward network with residual connection is presented to capture global feature interactions in hidden space. We verified the validity of the DVST on the KITTI and Waymo open datasets by constructing single-stage and two-stage models. The findings indicated that the DVST enhanced the average precision of the baseline model while preserving computational efficiency, achieving a performance comparable to state-of-the-art methods.

Keywords:
Computer science Point cloud Artificial intelligence Offset (computer science) Voxel Transformer Computer vision Residual Pattern recognition (psychology) Algorithm

Metrics

7
Cited By
1.27
FWCI (Field Weighted Citation Impact)
50
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

JOURNAL ARTICLE

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds

Chenhang HeRuihuang LiShuai LiLei Zhang

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 8407-8417
JOURNAL ARTICLE

Voxel Graph Attention for 3-D Object Detection From Point Clouds

Bin LuSun Ok YangZhenyu Yang

Journal:   IEEE Transactions on Instrumentation and Measurement Year: 2023 Vol: 72 Pages: 1-12
JOURNAL ARTICLE

Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection

Taeho KimJoohee Kim

Journal:   Sensors Year: 2023 Vol: 23 (16)Pages: 7217-7217
JOURNAL ARTICLE

Point-voxel dual transformer for LiDAR 3D object detection

Jigang TongFanhang YangSen YangShengzhi Du

Journal:   Optoelectronics Letters Year: 2025 Vol: 21 (9)Pages: 547-554
© 2026 ScienceGate Book Chapters — All rights reserved.