Unsupervised Domain Adaptation for Referring Semantic Segmentation

Haonan Shi; Wenwen Pan; Zhou Zhao; Mingmin Zhang; Fei Wu

doi:10.1145/3581783.3611879

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Domain Adaptation for Referring Semantic Segmentation

Haonan Shi Wenwen Pan Zhou Zhao Mingmin Zhang Fei Wu

Year: 2023 Pages: 5807-5818

DOI: 10.1145/3581783.3611879

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we study the task of referring semantic segmentation in a highly practical setting, in which labeled visual data with corresponding text descriptions are available in the source, but only unlabeled visual data (without text descriptions) are available in the target. It is a challenging task that has many difficulties: (1) how to obtain proper queries for the target domain; (2) how to adapt visual-text joint distribution shifts; (3) how to maintain the original segmentation performance. Thus, we propose a cycle-consistent vision-language matching network to narrow down the domain gap and ease adaptation difficulty. Our model has significant practical applications since they are capable generalising to new data sources without requiring corresponding text annotations. First, a pseudo-text selector is devised to handle the missing modality, through the pre-trained clip model to measure the gap between query features of the source and visual features of the target. Next, a cross-domain segmentation predictor is adopted, which prompts the joint representations to be domain invariant and minimize the discrepancy between two domains. Then, we present a cycle-consistent query matcher to learn discriminative features via reconstructing visual features from masks. Instead of doing the textual comparison, we match the visual features to the pseudo queries. Extensive experiments show the effectiveness of our method.

Keywords:

Computer science Discriminative model Artificial intelligence Segmentation Task (project management) Domain adaptation Natural language processing Matching (statistics) Pattern recognition (psychology) Domain (mathematical analysis) Classifier (UML)

Metrics

Cited By

0.73

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Unsupervised Domain Adaptation for Referring Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Multichannel Semantic Segmentation with Unsupervised Domain Adaptation

Rethinking unsupervised domain adaptation for semantic segmentation

Domain Connection based Unsupervised Domain Adaptation for Semantic Segmentation

Towards Unsupervised Online Domain Adaptation for Semantic Segmentation

Depth Guidance Unsupervised Domain Adaptation for Semantic Segmentation