Rethinking BiSeNet For Real-time Semantic Segmentation

Mingyuan Fan; Shenqi Lai; Junshi Huang; Xiaoming Wei; Zhenhua Chai; Junfeng Luo; Xiaolin Wei

doi:10.1109/cvpr46437.2021.00959

ScienceGate Book Chapters

JOURNAL ARTICLE

Rethinking BiSeNet For Real-time Semantic Segmentation

Mingyuan Fan Shenqi Lai Junshi Huang Xiaoming Wei Zhenhua Chai Junfeng Luo Xiaolin Wei

Year: 2021 Pages: 9711-9720

DOI: 10.1109/cvpr46437.2021.00959

Get Full-Text PDF Get Analytical Report

Abstract

BiSeNet [28], [27] has been proved to be a popular two-stream network for real-time segmentation. However, its principle of adding an extra path to encode spatial information is time-consuming, and the backbones borrowed from pretrained tasks, e.g., image classification, may be inefficient for image segmentation due to the deficiency of task-specific design. To handle these problems, we propose a novel and efficient structure named Short-Term Dense Concatenate network (STDC network) by removing structure redundancy. Specifically, we gradually reduce the dimension of feature maps and use the aggregation of them for image representation, which forms the basic module of STDC network. In the decoder, we propose a Detail Aggregation module by integrating the learning of spatial information into low-level layers in single-stream manner. Finally, the low-level features and deep features are fused to predict the final segmentation results. Extensive experiments on Cityscapes and CamVid dataset demonstrate the effectiveness of our method by achieving promising trade-off between segmentation accuracy and inference speed. On Cityscapes, we achieve 71.9% mIoU on the test set with a speed of 250.4 FPS on NVIDIA GTX 1080Ti, which is 45.2% faster than the latest methods, and achieve 76.8% mIoU with 97.0 FPS while inferring on higher resolution images. Code is available at https://github.com/MichaelFan01/STDC-Seg.

Keywords:

Computer science Segmentation Artificial intelligence Inference Redundancy (engineering) ENCODE Feature (linguistics) Image segmentation Representation (politics) Code (set theory) Pattern recognition (psychology) Set (abstract data type) Encoding (memory) Computer vision

Metrics

681

Cited By

51.52

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Rethinking BiSeNet For Real-time Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation

Real-time semantic segmentation based on improved BiSeNet

Far-Sighted BiSeNet V2 for Real-time Semantic Segmentation

Augmented-Training-Aware Bisenet for Real-Time Semantic Segmentation

Faster BiSeNet: A Faster Bilateral Segmentation Network for Real-time Semantic Segmentation