JOURNAL ARTICLE

A hybrid attention multi-scale fusion network for real-time semantic segmentation

Baijun YeRenzheng XueQiuwen Wu

Year: 2025 Journal:   Scientific Reports Vol: 15 (1)Pages: 872-872   Publisher: Nature Portfolio

Abstract

Abstract In semantic segmentation research, spatial information and receptive fields are essential. However, currently, most algorithms focus on acquiring semantic information and lose a significant amount of spatial information, leading to a significant decrease in accuracy despite improving real-time inference speed. This paper proposes a new method to address this issue. Specifically, we have designed a new module (HFRM) that combines channel attention and spatial attention to retrieve the spatial information lost during downsampling and enhance object classification accuracy. Regarding fusing spatial and semantic information, we have designed a new module (HFFM) to merge features of two different levels more effectively and capture a larger receptive field through an attention mechanism. Additionally, edge detection methods have been incorporated to enhance the extraction of boundary information. Experimental results demonstrate that for an input size of 512 × 1024, our proposed method achieves 73.6% mIoU at 176 frames per second (FPS) on the Cityscapes dataset and 70.0% mIoU at 146 FPS on Camvid. Compared to existing networks, our Model achieves faster inference speed while maintaining accuracy, enhancing its practicality.

Keywords:
Computer science Inference Segmentation Artificial intelligence Upsampling Merge (version control) Spatial analysis Pattern recognition (psychology) Focus (optics) Data mining Leverage (statistics) Computer vision Information retrieval Image (mathematics)

Metrics

7
Cited By
33.41
FWCI (Field Weighted Citation Impact)
40
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.