Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation

Xiangqian Li; Xin Tan; Zhizhong Zhang; Yuan Xie; Lizhuang Ma

doi:10.26599/cvm.2025.9450388

ScienceGate Book Chapters

JOURNAL ARTICLE

Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation

Xiangqian Li Xin Tan Zhizhong Zhang Yuan Xie Lizhuang Ma

Year: 2025 Journal: Computational Visual Media Vol: 11 (3)Pages: 497-511 Publisher: Springer Nature

DOI: 10.26599/cvm.2025.9450388

Get Full-Text PDF Get Analytical Report

Abstract

Current outdoor point-cloud segmentation methods typically formulate semantic segmentation as a per-point/voxel-classification task. Although this strategy is straightforward because it classifies each point directly, it ignores the overall relationship of the category. As an alternative paradigm, mask classification decouples category classification from region localization, allowing the model to better capture overall category relationships. In this paper, we propose a novel approach called the point mask transformer (PMFormer), which transforms the semantic segmentation of point clouds from per-point classification to mask classification using a transformer architecture. The proposed model comprises a 3D backbone, transformer decoder, and segmentation head that predicts a series of binary masks, each associated with a global class label. Furthermore, to accommodate the unique characteristics of large and sparse outdoor point-cloud scenes, we propose three enhancements for the integration of point-cloud data with the transformer: MaskMix, 3D position encoding, and attention weights. We evaluate our model using the SemanticKITTI and nuScenes datasets. Our experimental results show that the proposed method outperforms state-of-the-art semantic segmentation approaches.

Keywords:

Point cloud Segmentation Transformer Computer science Computer graphics (images) Point (geometry) Artificial intelligence Computer vision Engineering Electrical engineering Geometry Mathematics

Metrics

Cited By

30.68

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

3D Surveying and Cultural Heritage

Physical Sciences → Earth and Planetary Sciences → Geology

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

Optical measurement and interference techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

MPT-Net: Mask Point Transformer Network for Large Scale Point Cloud Semantic Segmentation

Radial Transformer for Large-Scale Outdoor LiDAR Point Cloud Semantic Segmentation

pCTFusion: Point Convolution-Transformer Fusion with Semantic Aware Loss for Outdoor LiDAR Point Cloud Segmentation

Urban-scale point cloud semantic segmentation with transformer

PTFormer: Propagation Transformer for Point Cloud Semantic Segmentation