Abstract

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture contexts by multi-scale features fusion, we propose a Dual Attention Networks (DANet) to adaptively integrate local features with their global dependencies. Specifically, we append two types of attention modules on top of traditional dilated FCN, which model the semantic interdependencies in spatial and channel dimensions respectively. The position attention module selectively aggregates the features at each position by a weighted sum of the features at all positions. Similar features would be related to each other regardless of their distances. Meanwhile, the channel attention module selectively emphasizes interdependent channel maps by integrating associated features among all channel maps. We sum the outputs of the two attention modules to further improve feature representation which contributes to more precise segmentation results. We achieve new state-of-the-art segmentation performance on three challenging scene segmentation datasets, i.e., Cityscapes, PASCAL Context and COCO Stuff dataset. In particular, a Mean IoU score of 81.5% on Cityscapes test set is achieved without using coarse data.

Keywords:
Computer science Pascal (unit) Segmentation Artificial intelligence Attention network Context (archaeology) Channel (broadcasting) Dual (grammatical number) Feature (linguistics) Pattern recognition (psychology) Fusion mechanism Representation (politics) Fusion

Metrics

6566
Cited By
349.52
FWCI (Field Weighted Citation Impact)
46
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Scene Segmentation With Dual Relation-Aware Attention Network

Jun FuJing LiuJie JiangYong LiYongjun BaoHanqing Lu

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2020 Vol: 32 (6)Pages: 2547-2560
JOURNAL ARTICLE

MEDANet: More Efficient Dual Attention Network for Scene Segmentation

Pan OuyangXiaoguo YaoZhijian Huang

Journal:   Journal of Circuits Systems and Computers Year: 2024 Vol: 34 (01)
JOURNAL ARTICLE

Dual-Attention Network for Few-Shot Segmentation

Zhikui ChenHan WangSuhua ZhangFangming Zhong

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 2210-2214
© 2026 ScienceGate Book Chapters — All rights reserved.