JOURNAL ARTICLE

Multi-Scale Dilated Convolution Transformer for Single Image Deraining

Abstract

Recently, Transformer-based methods have achieved significant improvements over convolutional neural networks (CNNs) in single image deraining, due to the powerful ability of modeling non-local information. In fact, rich local-global information representations are equally important for better satisfying rain removal. In this paper, we propose an effective image deraining method by integrating a CNN model into the Transformer backbone to accelerate network convergence, called Multi-scale Dilated-convolution Transformer (MDT), which fully leverages the learning capabilities of Transformers on non-local features, seamlessly integrating local detail extraction and global structural representation. The fundamental building unit of our framework is the Multi-scale Dilated-convolution Transformer Block (MDTB) with different dilation rates, which consists of the Dilconv Self-Attention (DSA) and the Dilconv Feed-Forward Network (DFN). Specifically, the former processes the contextual information via dilated convolutions and enables the model to emphasize spatially-varying rain distribution features, while the latter integrates the dual-branch information to facilitate the local feature learning for better feature aggregation. Extensive evaluations demonstrate that our model reaches superior performance, significantly improving the image deraining quality.

Keywords:
Computer science Transformer Dilation (metric space) Convolutional neural network Artificial intelligence Feature learning Feature extraction Pattern recognition (psychology) Convolution (computer science) Artificial neural network Computer vision Mathematics Voltage Engineering

Metrics

3
Cited By
0.55
FWCI (Field Weighted Citation Impact)
39
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.