JOURNAL ARTICLE

Learning Degradation-Robust Spatiotemporal Frequency-Transformer for Video Super-Resolution

Zhongwei QiuHuan YangJianlong FuDaochang LiuChang XuDongmei Fu

Year: 2023 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 45 (12)Pages: 14888-14904   Publisher: IEEE Computer Society

Abstract

Video Super-Resolution (VSR) aims to restore high-resolution (HR) videos from low-resolution (LR) videos. Existing VSR techniques usually recover HR frames by extracting pertinent textures from nearby frames with known degradation processes. Despite significant progress, grand challenges remain to effectively extract and transmit high-quality textures from high-degraded low-quality sequences, such as blur, additive noises, and compression artifacts. This work proposes a novel degradation-robust Frequency-Transformer (FTVSR++) for handling low-quality videos that carry out self-attention in a combined space-time-frequency domain. First, video frames are split into patches and each patch is transformed into spectral maps in which each channel represents a frequency band. It permits a fine-grained self-attention on each frequency band so that real visual texture can be distinguished from artifacts. Second, a novel dual frequency attention (DFA) mechanism is proposed to capture the global and local frequency relations, which can handle different complicated degradation processes in real-world scenarios. Third, we explore different self-attention schemes for video processing in the frequency domain and discover that a "divided attention" which conducts joint space-frequency attention before applying temporal-frequency attention, leads to the best video enhancement quality. Extensive experiments on three widely-used VSR datasets show that FTVSR++ outperforms state-of-the-art methods on different low-quality videos with clear visual margins.

Keywords:
Computer science Artificial intelligence Frequency domain Computer vision Time–frequency analysis Frame rate Feature extraction Pattern recognition (psychology)

Metrics

9
Cited By
1.64
FWCI (Field Weighted Citation Impact)
70
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Learning Trajectory-Aware Transformer for Video Super-Resolution

Chengxu LiuHuan YangJianlong FuXueming Qian

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 5677-5686
JOURNAL ARTICLE

Robust super resolution of compressed video

Xiaohong ZhangMin TangRuofeng Tong

Journal:   The Visual Computer Year: 2012 Vol: 28 (12)Pages: 1167-1180
© 2026 ScienceGate Book Chapters — All rights reserved.