Crossmodal Few-shot 3D Point Cloud Semantic Segmentation

Ziyu Zhao; Zhenyao Wu; Xinyi Wu; Canyu Zhang; Song Wang

doi:10.1145/3503161.3548251

ScienceGate Book Chapters

JOURNAL ARTICLE

Crossmodal Few-shot 3D Point Cloud Semantic Segmentation

Ziyu Zhao Zhenyao Wu Xinyi Wu Canyu Zhang Song Wang

Year: 2022 Journal: Proceedings of the 30th ACM International Conference on Multimedia Pages: 4760-4768

DOI: 10.1145/3503161.3548251

Get Full-Text PDF Get Analytical Report

Abstract

Recently, few-shot 3D point cloud semantic segmentation methods have been introduced to mitigate the limitations of existing fully supervised approaches, i.e., heavy dependence on labeled 3D data and poor capacity to generalize to new categories. However, those few-shot learning methods need one or few labeled data as support for testing. In practice, such data labeling usually requires manual annotation of large-scale points in 3D space, which can be very difficult and laborious. To address this problem, in this paper we introduce a novel crossmodal few-shot learning approach for 3D point cloud semantic segmentation. In this approach, the point cloud to be segmented is taken as query while one or few labeled 2D RGB images are taken as support to guide the segmentation of query. This way, we only need to annotate on a few 2D support images for the categories of interest. Specifically, we first convert the 2D support images into 3D point cloud format based on both appearance and the estimated depth information. We then introduce a co-embedding network for extracting the features of support and query, both from 3D point cloud format, to fill their domain gap. Finally, we compute the prototypes of support and employ cosine similarity between the prototypes and the query features for final segmentation. Experimental results on two widely-used benchmarks show that, with one or few labeled 2D images as support, our proposed method achieves competitive results against existing few-shot 3D point cloud semantic segmentation methods.

Keywords:

Point cloud Computer science Segmentation Crossmodal Artificial intelligence Cosine similarity Annotation Embedding Point (geometry) Domain (mathematical analysis) Similarity (geometry) Cloud computing Information retrieval Pattern recognition (psychology) Image (mathematics)

Metrics

Cited By

3.92

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

3D Shape Modeling and Analysis

Physical Sciences → Engineering → Computational Mechanics

3D Surveying and Cultural Heritage

Physical Sciences → Earth and Planetary Sciences → Geology

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Crossmodal Few-shot 3D Point Cloud Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Crossmodal Few-shot 3D Point Cloud Semantic Segmentation via View Synthesis

Few-shot 3D Point Cloud Semantic Segmentation

Rethinking Few-shot 3D Point Cloud Semantic Segmentation

Dynamic routing towards few-shot point cloud semantic segmentation

Cross-Domain Few-Shot 3D Point Cloud Semantic Segmentation