JOURNAL ARTICLE

AMFFNet: Adaptive Multi-Scale Feature Fusion Network for Urban Image Semantic Segmentation

Shuting HuangHaiyan Huang

Year: 2025 Journal:   Electronics Vol: 14 (12)Pages: 2344-2344   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Urban image semantic segmentation faces challenges including the coexistence of multi-scale objects, blurred semantic relationships between complex structures, and dynamic occlusion interference. Existing methods often struggle to balance global contextual understanding of large scenes and fine-grained details of small objects due to insufficient granularity in multi-scale feature extraction and rigid fusion strategies. To address these issues, this paper proposes an Adaptive Multi-scale Feature Fusion Network (AMFFNet). The network primarily consists of four modules: a Multi-scale Feature Extraction Module (MFEM), an Adaptive Fusion Module (AFM), an Efficient Channel Attention (ECA) module, and an auxiliary supervision head. Firstly, the MFEM utilizes multiple depthwise strip convolutions to capture features at various scales, effectively leveraging contextual information. Then, the AFM employs a dynamic weight assignment strategy to harmonize multi-level features, enhancing the network’s ability to model complex urban scene structures. Additionally, the ECA attention mechanism introduces cross-channel interactions and nonlinear transformations to mitigate the issue of small-object segmentation omissions. Finally, the auxiliary supervision head enables shallow features to directly affect the final segmentation results. Experimental evaluations on the CamVid and Cityscapes datasets demonstrate that the proposed network achieves superior mean Intersection over Union (mIoU) scores of 77.8% and 81.9%, respectively, outperforming existing methods. The results confirm that AMFFNet has a stronger ability to understand complex urban scenes.

Keywords:
Feature (linguistics) Computer science Artificial intelligence Scale (ratio) Segmentation Pattern recognition (psychology) Image (mathematics) Fusion Computer vision Image fusion Image segmentation Geography Cartography

Metrics

2
Cited By
6.13
FWCI (Field Weighted Citation Impact)
34
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Automated Road and Building Extraction
Physical Sciences →  Engineering →  Ocean Engineering
Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.