xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Maximilian Jaritz; Tuan-Hung Vu; Raoul de Charette; Émilie Wirbel; Patrick Pérez

doi:10.1109/cvpr42600.2020.01262

ScienceGate Book Chapters

JOURNAL ARTICLE

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Maximilian Jaritz Tuan-Hung Vu Raoul de Charette Émilie Wirbel Patrick Pérez

Year: 2020 Pages: 12602-12611

DOI: 10.1109/cvpr42600.2020.01262

Get Full-Text PDF Get Analytical Report

Abstract

Unsupervised Domain Adaptation (UDA) is crucial to tackle the lack of annotations in a new domain. There are many multi-modal datasets, but most UDA approaches are uni-modal. In this work, we explore how to learn from multi-modality and propose cross-modal UDA (xMUDA) where we assume the presence of 2D images and 3D point clouds for 3D semantic segmentation. This is challenging as the two input spaces are heterogeneous and can be impacted differently by domain shift. In xMUDA, modalities learn from each other through mutual mimicking, disentangled from the segmentation objective, to prevent the stronger modality from adopting false predictions from the weaker one. We evaluate on new UDA scenarios including day-to-night, country-to-country and dataset-to-dataset, leveraging recent autonomous driving datasets. xMUDA brings large improvements over uni-modal UDA on all tested scenarios, and is complementary to state-of-the-art UDA techniques. Code is available at https://github.com/valeoai/xmuda.

Keywords:

Computer science Segmentation Modal Modality (human–computer interaction) Adaptation (eye) Domain adaptation Domain (mathematical analysis) Semantics (computer science) Modalities Code (set theory) Point cloud Artificial intelligence Machine learning Set (abstract data type)

Metrics

213

Cited By

21.00

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-modal unsupervised domain adaptation for semantic image segmentation

Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Unsupervised multi-modal domain adaptation for RGB-T Semantic Segmentation

Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation

Cross-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation via Bidirectional Fusion-then-Distillation