Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation

Jing Gu; Xinkai Sun; Jie Feng; Shuyuan Yang; Fang Liu; Licheng Jiao

doi:10.1109/tai.2024.3355354

ScienceGate Book Chapters

JOURNAL ARTICLE

Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation

Jing Gu Xinkai Sun Jie Feng Shuyuan Yang Fang Liu Licheng Jiao

Year: 2024 Journal: IEEE Transactions on Artificial Intelligence Vol: 5 (7)Pages: 3393-3407 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tai.2024.3355354

Get Full-Text PDF Get Analytical Report

Abstract

Real-time image semantic segmentation draws the attentions of more and more researchers as a basis of scene understanding, and it has been applied in many fields that need fast interaction and response, such as autonomous driving and robot control. Considering the loss of low-level spatial information with the deepening network layer, we propose a multiple resolutions detail enhancement network (MRDENet) in this paper, which adequately extracts and utilizes accurate low-level detail information from original images with different resolutions. MRDENet consists of three light-weight branch sub-networks, and designs dense oblique connections between adjacent branches to preserve the different level effective features of previous branch. Furthermore, a new multi-level information aggregation module is presented to effectively fuse the low-level detail features and the high-level semantic features of different branches by employing group convolution and channel shuffle with low computation cost, thus ensuring that MRDENet could achieve a favorable trade-off between segmentation precision with inference speed. The experimental results show that MRDENet achieves 73.1% mIoU with 93 FPS on Cityscapes dataset, and 68.5% mIoU with 112 FPS on CamVid dataset, which indicates the performance of MRDENet is competitive with the state-of-art methods.

Keywords:

Computer science Segmentation Convolution (computer science) Fuse (electrical) Artificial intelligence Inference Computation Image (mathematics) Computer vision Semantics (computer science) Channel (broadcasting) Robot Image segmentation Oblique case Pattern recognition (psychology) Algorithm Engineering

Metrics

Cited By

4.24

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Medical Image Segmentation Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Bilateral Detail Enhancement Network for Real-Time Semantic Segmentation

Semantics recalibration and detail enhancement network for real‐time semantic segmentation

Detail Guided Multilateral Segmentation Network for Real-Time Semantic Segmentation

BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation

DESENet: a bilateral network with detail-enhanced semantic encoder for real-time semantic segmentation