JOURNAL ARTICLE

Learning Trajectory-Aware Transformer for Video Super-Resolution

Chengxu LiuHuan YangJianlong FuXueming Qian

Year: 2022 Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pages: 5677-5686

Abstract

Video super-resolution (VSR) aims to restore a sequence of high-resolution (HR) frames from their low-resolution (LR) counterparts. Although some progress has been made, there are grand challenges to effectively utilize temporal dependency in entire video sequences. Existing approaches usually align and aggregate video frames from limited adjacent frames (e.g., 5 or 7 frames), which prevents these approaches from satisfactory results. In this paper, we take one step further to enable effective spatio-temporal learning in videos. We propose a novel Trajectory-aware Transformer for Video Super-Resolution (TTVSR). In particular, we formulate video frames into several pre-aligned trajectories which consist of continuous visual tokens. For a query token, self-attention is only learned on relevant visual tokens along spatio-temporal trajectories. Compared with vanilla vision Transformers, such a design significantly reduces the computational cost and enables Transformers to model long-range features. We further propose a cross-scale feature tokenization module to over-come scale-changing problems that often occur in long-range videos. Experimental results demonstrate the superiority of the proposed TTVSR over state-of-the-art models, by extensive quantitative and qualitative evaluations in four widely-used video super-resolution benchmarks. Both code and pre-trained models can be downloaded at https://github.com/researchmm/TTVSR.

Keywords:
Computer science Artificial intelligence Transformer Security token Computer vision Trajectory Frame rate

Metrics

107
Cited By
7.25
FWCI (Field Weighted Citation Impact)
57
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation

Chengxu LiuHuan YangJianlong FuXueming Qian

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: 32 Pages: 4728-4741
JOURNAL ARTICLE

Learning Degradation-Robust Spatiotemporal Frequency-Transformer for Video Super-Resolution

Zhongwei QiuHuan YangJianlong FuDaochang LiuChang XuDongmei Fu

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 45 (12)Pages: 14888-14904
BOOK-CHAPTER

Self-guided Transformer for Video Super-Resolution

Tong XueQianrui WangXinyi HuangDengshi Li

Lecture notes in computer science Year: 2023 Pages: 186-198
© 2026 ScienceGate Book Chapters — All rights reserved.