Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation

Liang Liao; Liang Wan; Mingsheng Liu; Shusheng Li

doi:10.1109/cac59555.2023.10450923

ScienceGate Book Chapters

JOURNAL ARTICLE

Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation

Liang Liao Liang Wan Mingsheng Liu Shusheng Li

Year: 2023 Pages: 4114-4120

DOI: 10.1109/cac59555.2023.10450923

Get Full-Text PDF Get Analytical Report

Abstract

When some application scenarios need to use semantic segmentation technology, like automatic driving, the primary concern comes to real-time performance rather than extremely high segmentation accuracy. To achieve a good trade-off between speed and accuracy, two-branch architecture has been proposed in recent years. It treats spatial information and semantics information separately which allows the module to be composed of two networks both not heavy. However, the process of fusing features with two different scales becomes a performance bottleneck for many nowaday two-branch models. In this research, we design a new fusion mechanism for two-branch architecture which is guided by attention computation. To be precise, we use the Dual-Guided Attention (DGA) module we proposed to replace some multi-scale transformations with the calculation of attention which means we only use several attention layers of near linear complexity to achieve performance comparable to frequently-used multi-layer fusion. To ensure that our module can be effective, we use Residual U-blocks (RSU) to build one of the two branches in our networks which aims to obtain better multi-scale features. Extensive experiments on Cityscapes and CamVid dataset show the effectiveness of our method. On Cityscapes, our light version network without pretrain weight can achieve 71.1% mIoU at 163 FPS on a single Nvidia RTX 3070 using full resolution images(1024×2048pix). And the large version can achieve 77.9% mIoU with a speed of 43 FPS which still reaches the real-time criterion. Our code and module has been open sourced at https://github.com/LikeLidoA/Mymodule.

Keywords:

Residual Computer science Segmentation Dual (grammatical number) Semantic compression Artificial intelligence Attention network Computer vision Semantic computing Algorithm Semantic technology Linguistics

Metrics

Cited By

0.55

FWCI (Field Weighted Citation Impact)

Refs

0.64

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Bilateral Dual-Residual Real-Time Semantic Segmentation Network

Bilateral network with dual-guided attention for real-time semantic segmentation of road scene

AGDBNet: Attention-Guided Dual-Branch Network for Real-Time Semantic Segmentation

DBiSeNet: Dual bilateral segmentation network for real-time semantic segmentation

Trilateral Network with Residual U-Blocks and Contextual Transformer Block for Real-Time Semantic Segmentation