JOURNAL ARTICLE

Multi‐scale feature aggregation network for single‐image dehazing

D. Zhenbing ZhaoBo MoZhu Xiang

Year: 2024 Journal:   IET Image Processing Vol: 18 (11)Pages: 2943-2961   Publisher: Institution of Engineering and Technology

Abstract

Abstract Transformer possesses a broader perceptual scope, while the Convolutional Neural Network (CNN) excels at capturing local information. In this paper, the authors propose the Multi‐Sclale Feature Aggregation Network (MSFA‐Net) for single‐image dehazing which is fused with the advantages of Transformer and CNN. Our MSFA‐Net is based on the encoder–decoder structure, and there are four main innovations. Firstly, the authors make some improvements to the original Swin Transformer to make it more effective for dehazing tasks, and the authors name it Spatial Information Aggregation Transformer (SIAT). The authors place the SIAT in both encoder and decoder of MSFA‐Net for feature extraction. The authors propose an upsampling module called Efficient Spatial Resolution Recovery (ESRR) which is placed in the decoder part. Compared to commonly used transposed convolutions, the authors’ ESRR module has fewer computational cost. Considering that the haze distribution is always uneven and the information from each channel is different, the authors introduce the Dynamic Multi‐Attention (DMA) module to provide pixel‐wise weights and channel‐wise weights for input features. The authors place the DMA module between the encoder and decoder parts. As the network depth increases, the spatial structural information from the high‐resolution layer tends to degrade. To deal with the problem, the authors propose the Multi‐Scale Feature Fusion (MSFF) module to recover missing spatial structural information. The authors place the MSFF module in both the encoder and decoder parts. Extensive experimental results show that the authors’ proposed dehazing network achieves state‐of‐the‐art dehazing performance with relatively low computational cost.

Keywords:
Computer science Encoder Transformer Upsampling Artificial intelligence Convolutional neural network Feature (linguistics) Computer vision Pattern recognition (psychology) Image (mathematics) Voltage

Metrics

1
Cited By
0.53
FWCI (Field Weighted Citation Impact)
74
Refs
0.52
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Fire Detection and Safety Systems
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality

Related Documents

JOURNAL ARTICLE

Multi-stream feature aggregation network with multi-scale supervision for single image dehazing

Junjiang WuHaibo TaoKai XiaoJun ChuLu Leng

Journal:   Engineering Applications of Artificial Intelligence Year: 2024 Vol: 139 Pages: 109486-109486
JOURNAL ARTICLE

Local multi-scale feature aggregation network for real-time image dehazing

Yong LiuXiaorong Hou

Journal:   Pattern Recognition Year: 2023 Vol: 141 Pages: 109599-109599
JOURNAL ARTICLE

Attention-adaptive multi-scale feature aggregation dehazing network

Zhuo SuRuizhi LiuYuxin FengFan Zhou

Journal:   Journal of Visual Communication and Image Representation Year: 2022 Vol: 90 Pages: 103706-103706
© 2026 ScienceGate Book Chapters — All rights reserved.