JOURNAL ARTICLE

Memorizing Swin-Transformer Denoising Network for Diffusion Model

Jindou ChenYiqing Shen

Year: 2024 Journal:   Electronics Vol: 13 (20)Pages: 4050-4050   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Diffusion models have garnered significant attention in the field of image generation. However, existing denoising architectures, such as U-Net, face limitations in capturing the global context, while Vision Transformers (ViTs) may struggle with local receptive fields. To address these challenges, we propose a novel Swin-Transformer-based denoising network architecture that leverages the strengths of both U-Net and ViT. Moreover, our approach integrates the k-Nearest Neighbor (kNN) based memorizing attention module into the Swin-Transformer, enabling it to effectively harness crucial contextual information from feature maps and enhance its representational capacity. Finally, we introduce an innovative hierarchical time stream embedding scheme that optimizes the incorporation of temporal cues during the denoising process. This method surpasses basic approaches like simple addition or concatenation of fixed time embeddings, facilitating a more effective fusion of temporal information. Extensive experiments conducted on four benchmark datasets demonstrate the superior performance of our proposed model compared to U-Net and ViT as denoising networks. Our model outperforms baselines on the CRC-VAL-HE-7K and CelebA datasets, achieving improved FID scores of 14.39 and 4.96, respectively, and even surpassing DiT and UViT under our experiment setting. The Memorizing Swin-Transformer architecture, coupled with the hierarchical time stream embedding, sets a new state-of-the-art in denoising diffusion models for image generation.

Keywords:
Transformer Computer science Diffusion Noise reduction Electronic engineering Artificial intelligence Electrical engineering Engineering Physics Voltage

Metrics

2
Cited By
1.06
FWCI (Field Weighted Citation Impact)
28
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Swin Transformer for Seismic Denoising

Fang LiHailong Liu伟 王Jianwei Ma

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2024 Vol: 21 Pages: 1-5
JOURNAL ARTICLE

Implicit Multi-Scale Swin Transformer Network for Image Denoising

Qi ZhangYuwei DingWeiqi ZhangYian ZhuBob ZhangJerry Chun‐Wei Lin

Journal:   IEEE Transactions on Consumer Electronics Year: 2025 Vol: 71 (2)Pages: 5584-5594
JOURNAL ARTICLE

SUNet: Swin Transformer UNet for Image Denoising

Chi-Mao FanTsung-Jung LiuKuan-Hsien Liu

Journal:   2022 IEEE International Symposium on Circuits and Systems (ISCAS) Year: 2022 Pages: 2333-2337
© 2026 ScienceGate Book Chapters — All rights reserved.