Taohong ZhuJun ShenChali WangHuiyuan Xiong
3D object detection based on point clouds is crucial for the safety of autonomous vehicles. The 3D feature network significantly impacts the feature extraction of the object detectors. However, the 3D feature network often exhibits poor performance due to submanifold dilation. In this paper, we propose a drop sparse convolution network, built using drop sparse convolution (Drop Conv), to mitigate the effects of submanifold dilation. Drop Conv calculates the activity of each feature by encoding itself and removes inactive features to preserve feature connectivity while suppressing dilated features. The proposed module can be easily integrated into existing 3D backbone networks. By combining it with existing sparse convolutions, it fills the gap in 3D backbone networks where sparsity cannot be enhanced. Extensive experiments on KITTI and nuScenes have demonstrated that the proposed method can effectively enhance the performance of state-of-the-art 3D object detectors using 3D feature networks.
Hai WuChenglu WenShaoshuai ShiXin LiCheng Wang
Jing HuYu FanPing ZhangChang DuanChao ZhangSi Yu Chen
Huaijin LiuJi‐Xiang DuYong ZhangHongbo ZhangJiandian Zeng
Linye LiXiaodong YueZhikang XuShaorong Xie
Qing TianYiyao ZhangZheng Zhang