DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES

Shengli Jiang; Guihong Li; Wei Yao; Zhenjie Hong; Tae‐Yong Kuc

doi:10.5194/isprs-archives-xliii-b2-2020-605-2020

ScienceGate Book Chapters

JOURNAL ARTICLE

DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES

Shengli Jiang Guihong Li Wei Yao Zhenjie Hong Tae‐Yong Kuc

Year: 2020 Journal: The international archives of the photogrammetry, remote sensing and spatial information sciences/International archives of the photogrammetry, remote sensing and spatial information sciences Vol: XLIII-B2-2020 Pages: 605-610 Publisher: Copernicus Publications

DOI: 10.5194/isprs-archives-xliii-b2-2020-605-2020

Get Full-Text PDF Get Analytical Report

Abstract

Abstract. Semantic segmentation is a fundamental research task in computer vision, which intends to assign a certain category to every pixel. Currently, most existing methods only utilize the deepest feature map for decoding, while high-level features get inevitably lost during the procedure of down-sampling. In the decoder section, transposed convolution or bilinear interpolation was widely used to restore the size of the encoded feature map; however, few optimizations are applied during up-sampling process which is detrimental to the performance for grouping and classification. In this work, we proposed a dual pyramids encoder-decoder deep neural network (DPEDNet) to tackle the above issues. The first pyramid integrated and encoded multi-resolution features through sequentially stacked merging, and the second pyramid decoded the features through dense atrous convolution with chained up-sampling. Without post-processing and multi-scale testing, the proposed network has achieved state-of-the-art performances on two challenging benchmark image datasets for both ground and aerial view scenes.

Keywords:

Computer science Artificial intelligence Pyramid (geometry) Bilinear interpolation Encoder Feature (linguistics) Segmentation Benchmark (surveying) Computer vision Convolution (computer science) Interpolation (computer graphics) Pattern recognition (psychology) Decoding methods Sampling (signal processing) Decimation Artificial neural network Image (mathematics) Algorithm Filter (signal processing) Mathematics Geography

Metrics

Cited By

0.10

FWCI (Field Weighted Citation Impact)

Refs

0.39

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES

Abstract

Metrics

Citation History

Topics

Related Documents

Semantic segmentation of agricultural aerial images using encoder-decoder models

Semantic segmentation of agricultural aerial images using encoder-decoder models

Dual encoder–decoder-based deep polyp segmentation network for colonoscopy images

Multispectral Semantic Land Cover Segmentation From Aerial Imagery With Deep Encoder–Decoder Network

Image Semantic Segmentation Based on Encoder-Decoder Network