Denoising Pretraining for Semantic Segmentation

Emmanuel Asiedu Brempong; Simon Kornblith; Ting Chen; Niki Parmar; Matthias Minderer; Mohammad Norouzi

doi:10.1109/cvprw56347.2022.00462

ScienceGate Book Chapters

JOURNAL ARTICLE

Denoising Pretraining for Semantic Segmentation

Emmanuel Asiedu Brempong Simon Kornblith Ting Chen Niki Parmar Matthias Minderer Mohammad Norouzi

Year: 2022 Journal: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pages: 4174-4185

DOI: 10.1109/cvprw56347.2022.00462

Get Full-Text PDF Get Analytical Report

Abstract

Semantic segmentation labels are expensive and time consuming to acquire. To improve label efficiency of semantic segmentation models, we revisit denoising autoencoders and study the use of a denoising objective for pretraining UNets. We pretrain a Transformer-based UNet as a denoising autoencoder, followed by fine-tuning on semantic segmentation using few labeled examples. Denoising pretraining outperforms training from random initialization, and even supervised ImageNet-21K pretraining of the encoder when the number of labeled images is small. A key advantage of denoising pretraining over supervised pretraining of the backbone is the ability to pretrain the decoder, which would otherwise be randomly initialized. We thus propose a novel Decoder Denoising Pretraining (DDeP) method, in which we initialize the encoder using supervised learning and pretrain only the decoder using the denoising objective. Despite its simplicity, DDeP achieves state-of-the-art results on label-efficient semantic segmentation, offering considerable gains on the Cityscapes, Pascal Context, and ADE20K datasets.

Keywords:

Computer science Artificial intelligence Segmentation Noise reduction Video denoising Encoder Pattern recognition (psychology) Initialization Context (archaeology) Computer vision Object (grammar) Video tracking

Metrics

102

Cited By

6.90

FWCI (Field Weighted Citation Impact)

163

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Denoising Pretraining for Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

MENSA: Multi-Dataset Harmonized Pretraining for Semantic Segmentation

Underwater Image Denoising and Semantic Segmentation

A multi-pretraining U-Net architecture for semantic segmentation

Context-Self Contrastive Pretraining for Crop Type Semantic Segmentation

CP$$^2$$: Copy-Paste Contrastive Pretraining for Semantic Segmentation