JOURNAL ARTICLE

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training

Abstract

This paper tries to address a fundamental question in point cloud self-supervised learning: what is a good signal we should leverage to learn features from point clouds without annotations? To answer that, we introduce a point cloud representation learning framework, based on geometric feature reconstruction. In contrast to recent papers that directly adopt masked autoencoder (MAE) and only predict original coordinates or occupancy from masked point clouds, our method revisits differences between images and point clouds and identifies three self-supervised learning objectives peculiar to point clouds, namely centroid prediction, normal estimation, and curvature prediction. Combined, these three objectives yield an nontrivial self-supervised learning task and mutually facilitate models to better reason fine-grained geometry of point clouds. Our pipeline is conceptually simple and it consists of two major steps: first, it randomly masks out groups of points, followed by a Transformer-based point cloud encoder; second, a lightweight Transformer decoder predicts centroid, normal, and curvature for points in each voxel. We transfer the pre-trained Transformer encoder to a downstream peception model. On the nuScene Datset, our model achieves 3.38 mAP improvment for object detection, 2.1 mIoU gain for segmentation, and 1.7 AMOTA gain for multi-object tracking. We also conduct experiments on the Waymo Open Dataset and achieve significant performance improvements over baselines as well. 1 1 Our code is available at https://github.com/Tsinghua-MARS-Lab/GeoMAE.

Keywords:
Point cloud Computer science Artificial intelligence Centroid Segmentation Leverage (statistics) Encoder Feature learning Computer vision Pattern recognition (psychology)

Metrics

32
Cited By
16.80
FWCI (Field Weighted Citation Impact)
79
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology
3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction

Songlin FanWei GaoGe Li

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2024 Vol: 36 (7)Pages: 12964-12976
JOURNAL ARTICLE

Masked Feature Prediction for Self-Supervised Visual Pre-Training

Chen WeiHaoqi FanSaining XieChao-Yuan WuAlan YuilleChristoph Feichtenhofer

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 14648-14658
JOURNAL ARTICLE

Mutual information-driven self-supervised point cloud pre-training

Weichen XuTianhao FuJian CaoXinyu ZhaoXinxin XuXixin CaoXing Zhang

Journal:   Knowledge-Based Systems Year: 2024 Vol: 307 Pages: 112741-112741
© 2026 ScienceGate Book Chapters — All rights reserved.