JOURNAL ARTICLE

MPT-Net: Mask Point Transformer Network for Large Scale Point Cloud Semantic Segmentation

Zhe Jun TangTat‐Jen Cham

Year: 2022 Journal:   2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pages: 10611-10618

Abstract

Point cloud semantic segmentation is important for road scene perception, a task for driverless vehicles to achieve full fledged autonomy. In this work, we introduce Mask Point Transformer Network (MPT-Net), a novel architecture for point cloud segmentation that is simple to implement. MPT-Net consists of a local and global feature encoder and a transformer based decoder; a 3D Point-Voxel Convolution encoder backbone with voxel self attention to encode features and a Mask Point Transformer module to decode point features and segment the point cloud. Firstly, we introduce the novel MPT designed to specifically handle point cloud segmentation. MPT offers two benefits. It attends to every point in the point cloud using mask tokens to extract class specific features globally with cross attention, and provide inter-class feature information exchange using self attention on the learned mask tokens. Secondly, we design a backbone to use sparse point voxel convolutional blocks and a self attention block using transformers to learn local and global contextual features. We evaluate MPT-Net on large scale outdoor driving scene point cloud datasets, SemanticKITTI and nuScenes. Our experiments show that by replacing the standard segmentation head with MPT, MPT-Net achieves a state-of-the-art performance over our baseline approach by 3.8% in SemanticKITTI and is highly effective in detecting 'stuffs' in point cloud.

Keywords:
Point cloud Computer science Segmentation Artificial intelligence Encoder Transformer Computer vision Convolutional neural network Pattern recognition (psychology) Engineering

Metrics

4
Cited By
1.48
FWCI (Field Weighted Citation Impact)
34
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

JOURNAL ARTICLE

Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation

Xiangqian LiXin TanZhizhong ZhangYuan XieLizhuang Ma

Journal:   Computational Visual Media Year: 2025 Vol: 11 (3)Pages: 497-511
JOURNAL ARTICLE

LEARD-Net: Semantic segmentation for large-scale point cloud scene

Ziyin ZengYongyang XuZhong XieWei TangJie WanWeichao Wu

Journal:   International Journal of Applied Earth Observation and Geoinformation Year: 2022 Vol: 112 Pages: 102953-102953
JOURNAL ARTICLE

Radial Transformer for Large-Scale Outdoor LiDAR Point Cloud Semantic Segmentation

Xiang HeXu LiPeizhou NiXu WangQimin XuXixiang Liu

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2024 Vol: 62 Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.