Understanding Convolution for Semantic Segmentation

Panqu Wang; Pengfei Chen; Ye Yuan; Ding Liu; Zehua Huang; Xiaodi Hou; Garrison W. Cottrell

doi:10.1109/wacv.2018.00163

ScienceGate Book Chapters

JOURNAL ARTICLE

Understanding Convolution for Semantic Segmentation

Panqu Wang Pengfei Chen Ye Yuan Ding Liu Zehua Huang Xiaodi Hou Garrison W. Cottrell

Year: 2018 Pages: 1451-1460

DOI: 10.1109/wacv.2018.00163

Get Full-Text PDF Get Analytical Report

Abstract

Recent advances in deep learning, especially deep convolutional neural networks (CNNs), have led to significant improvement over previous semantic segmentation systems. Here we show how to improve pixel-wise semantic segmentation by manipulating convolution-related operations that are of both theoretical and practical value. First, we design dense upsampling convolution (DUC) to generate pixel-level prediction, which is able to capture and decode more detailed information that is generally missing in bilinear upsampling. Second, we propose a hybrid dilated convolution (HDC) framework in the encoding phase. This framework 1) effectively enlarges the receptive fields (RF) of the network to aggregate global information; 2) alleviates what we call the "gridding issue"caused by the standard dilated convolution operation. We evaluate our approaches thoroughly on the Cityscapes dataset, and achieve a state-of-art result of 80.1% mIOU in the test set at the time of submission. We also have achieved state-of-theart overall on the KITTI road estimation benchmark and the PASCAL VOC2012 segmentation task. Our source code can be found at https://github.com/TuSimple/TuSimple-DUC.

Keywords:

Computer science Upsampling Segmentation Artificial intelligence Pascal (unit) Convolution (computer science) Convolutional neural network Deep learning Pixel Benchmark (surveying) Convolutional code Pattern recognition (psychology) Artificial neural network Algorithm Image (mathematics) Decoding methods

Metrics

1923

Cited By

127.92

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Understanding Convolution for Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Dense Convolution for Semantic Segmentation

Multi - Direction Convolution for Semantic Segmentation

Rethinking 1D convolution for lightweight semantic segmentation

Frequency-Adaptive Dilated Convolution for Semantic Segmentation

Multi-Resolution Convolution for 3D Semantic Segmentation