Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation

Bowei Xing; Xianghua Ying; Ruibin Wang; Jinfa Yang; Taiyan Chen

doi:10.1609/aaai.v37i3.25400

ScienceGate Book Chapters

JOURNAL ARTICLE

Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation

Bowei Xing Xianghua Ying Ruibin Wang Jinfa Yang Taiyan Chen

Year: 2023 Journal: Proceedings of the AAAI Conference on Artificial Intelligence Vol: 37 (3)Pages: 2974-2982 Publisher: Association for the Advancement of Artificial Intelligence

DOI: 10.1609/aaai.v37i3.25400

Get Full-Text PDF Get Analytical Report

Abstract

Domain adaptation for 3D point cloud has attracted a lot of interest since it can avoid the time-consuming labeling process of 3D data to some extent. A recent work named xMUDA leveraged multi-modal data to domain adaptation task of 3D semantic segmentation by mimicking the predictions between 2D and 3D modalities, and outperformed the previous single modality methods only using point clouds. Based on it, in this paper, we propose a novel cross-modal contrastive learning scheme to further improve the adaptation effects. By employing constraints from the correspondences between 2D pixel features and 3D point features, our method not only facilitates interaction between the two different modalities, but also boosts feature representations in both labeled source domain and unlabeled target domain. Meanwhile, to sufficiently utilize 2D context information for domain adaptation through cross-modal learning, we introduce a neighborhood feature aggregation module to enhance pixel features. The module employs neighborhood attention to aggregate nearby pixels in the 2D image, which relieves the mismatching between the two different modalities, arising from projecting relative sparse point cloud to dense image pixels. We evaluate our method on three unsupervised domain adaptation scenarios, including country-to-country, day-to-night, and dataset-to-dataset. Experimental results show that our approach outperforms existing methods, which demonstrates the effectiveness of the proposed method.

Keywords:

Computer science Point cloud Segmentation Artificial intelligence Feature (linguistics) Pixel Context (archaeology) Domain adaptation Modal Domain (mathematical analysis) Adaptation (eye) Pattern recognition (psychology) Process (computing) Modalities Point (geometry) Computer vision Machine learning Mathematics Geography

Metrics

Cited By

2.16

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Technologies in Various Fields

Physical Sciences → Computer Science → Artificial Intelligence

Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Cross-Modal Learning for Domain Adaptation in 3D Semantic Segmentation

Contrastive Learning-Based Domain Adaptation for Semantic Segmentation

Learning Cross-Modal Contrastive Features for Video Domain Adaptation

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Domain Adaptation for Semantic Segmentation of Autonomous Driving with Contrastive Learning