Pyramid Point Cloud Transformer for Large-Scale Place Recognition

Le Hui; Hang Yang; Mingmei Cheng; Jin Xie; Jian Yang

doi:10.1109/iccv48922.2021.00604

ScienceGate Book Chapters

JOURNAL ARTICLE

Pyramid Point Cloud Transformer for Large-Scale Place Recognition

Le Hui Hang Yang Mingmei Cheng Jin Xie Jian Yang

Year: 2021 Journal: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pages: 6078-6087

DOI: 10.1109/iccv48922.2021.00604

Get Full-Text PDF Get Analytical Report

Abstract

Recently, deep learning based point cloud descriptors have achieved impressive results in the place recognition task. Nonetheless, due to the sparsity of point clouds, how to extract discriminative local features of point clouds to efficiently form a global descriptor is still a challenging problem. In this paper, we propose a pyramid point cloud transformer network (PPT-Net) to learn the discriminative global descriptors from point clouds for efficient retrieval. Specifically, we first develop a pyramid point transformer module that adaptively learns the spatial relationship of the different k-NN neighboring points of point clouds, where the grouped self-attention is proposed to extract discriminative local features of the point clouds. The grouped self-attention not only enhances long-term dependencies of the point clouds, but also reduces the computational cost. In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors. By applying VLAD pooling on multi-scale feature maps, we utilize the context gating mechanism on the multiple global descriptors to adaptively weight the multi-scale global context information into the final global descriptor. Experimental results on the Oxford dataset and three in-house datasets show that our method achieves the state-of-the-art on the point cloud based place recognition task. Code is available at https://github.com/fpthink/PPT-Net.

Keywords:

Discriminative model Point cloud Computer science Pyramid (geometry) Artificial intelligence Pooling Pattern recognition (psychology) Feature (linguistics) Mathematics

Metrics

135

Cited By

44.05

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

3D Shape Modeling and Analysis

Physical Sciences → Engineering → Computational Mechanics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Pyramid Point Cloud Transformer for Large-Scale Place Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition

HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud

TransLoc3D: point cloud based large-scale place recognition using adaptive receptive fields

SelFLoc: Selective feature fusion for large-scale point cloud-based place recognition

MinkUNeXt: Point cloud-based large-scale place recognition using 3D sparse convolutions