Regional-to-Local Point-Voxel Transformer for Large-Scale Indoor 3D Point Cloud Semantic Segmentation

Shuai Li; Hongjun Li

doi:10.3390/rs15194832

ScienceGate Book Chapters

JOURNAL ARTICLE

Regional-to-Local Point-Voxel Transformer for Large-Scale Indoor 3D Point Cloud Semantic Segmentation

Shuai Li Hongjun Li

Year: 2023 Journal: Remote Sensing Vol: 15 (19)Pages: 4832-4832 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/rs15194832

Get Full-Text PDF Get Analytical Report

Abstract

Semantic segmentation of large-scale indoor 3D point cloud scenes is crucial for scene understanding but faces challenges in effectively modeling long-range dependencies and multi-scale features. In this paper, we present RegionPVT, a novel Regional-to-Local Point-Voxel Transformer that synergistically integrates voxel-based regional self-attention and window-based point-voxel self-attention for concurrent coarse-grained and fine-grained feature learning. The voxel-based regional branch focuses on capturing regional context and facilitating inter-window communication. The window-based point-voxel branch concentrates on local feature learning while integrating voxel-level information within each window. This unique design enables the model to jointly extract local details and regional structures efficiently and provides an effective and efficient solution for multi-scale feature fusion and a comprehensive understanding of 3D point clouds. Extensive experiments on S3DIS and ScanNet v2 datasets demonstrate that our RegionPVT achieves competitive or superior performance compared with state-of-the-art approaches, attaining mIoUs of 71.0% and 73.9% respectively, with significantly lower memory footprint.

Keywords:

Computer science Voxel Point cloud Segmentation Artificial intelligence Feature (linguistics) Scale (ratio) Computer vision Pattern recognition (psychology) Cartography Geography

Metrics

Cited By

2.69

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

3D Shape Modeling and Analysis

Physical Sciences → Engineering → Computational Mechanics

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

3D Surveying and Cultural Heritage

Physical Sciences → Earth and Planetary Sciences → Geology

Regional-to-Local Point-Voxel Transformer for Large-Scale Indoor 3D Point Cloud Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

MPT-Net: Mask Point Transformer Network for Large Scale Point Cloud Semantic Segmentation

3D Indoor Point Cloud Semantic Segmentation Using Image and Voxel

Urban-scale point cloud semantic segmentation with transformer

Retrieval-and-alignment based large-scale indoor point cloud semantic segmentation

Radial Transformer for Large-Scale Outdoor LiDAR Point Cloud Semantic Segmentation