JOURNAL ARTICLE

CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing

Wujie ZhouShaohua DongMeixin FangLu Yu

Year: 2023 Journal:   IEEE Transactions on Intelligent Vehicles Vol: 9 (1)Pages: 1919-1929   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Color–thermal (RGB-T) urban scene parsing has recently attracted widespread interest. However, most existing approaches to RGB-T urban scene parsing do not deeply explore the information complementarity between RGB-T features. In this study, we propose a cross-modal attention-cascaded fusion network (CACFNet) that fully exploits cross-modality. In our design, a cross-modal attention fusion module mines complementary information from two modalities. Subsequently, a cascaded fusion module decodes the multi-level features in an up-bottom manner. Noting that each pixel is labeled with the category of the region to which it belongs, we present a region-based module that explores the relationship between pixel and region. Moreover, in contrast to previous methods that employ only the cross-entropy loss to penalize pixel-wise predictions, we propose an additional loss to learn pixel–pixel relationships. Extensive experiments on two datasets demonstrate that the proposed CACFNet achieves state-of-the-art performance in RGB-T urban scene parsing.

Keywords:
Parsing Modal Computer science Artificial intelligence RGB color model Fusion Computer vision Linguistics Materials science

Metrics

41
Cited By
6.72
FWCI (Field Weighted Citation Impact)
59
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Automated Road and Building Extraction
Physical Sciences →  Engineering →  Ocean Engineering

Related Documents

JOURNAL ARTICLE

Cross-Modal Attention Guided Enhanced Fusion Network for RGB-T Tracking

Jun LiuWei KeShuai WangDa YangSizhe Wang

Journal:   IEEE Signal Processing Letters Year: 2025 Vol: 33 Pages: 276-280
JOURNAL ARTICLE

EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing

Shaohua DongWujie ZhouCaie XuWeiqing Yan

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2023 Vol: 25 (1)Pages: 657-669
JOURNAL ARTICLE

Cross-modal attention fusion network for RGB-D semantic segmentation

Qiankun ZhaoYingcai WanJiqian XuLijin Fang

Journal:   Neurocomputing Year: 2023 Vol: 548 Pages: 126389-126389
© 2026 ScienceGate Book Chapters — All rights reserved.