RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution

Zhicheng Geng; Luming Liang; Tianyu Ding; Ilya Zharkov

doi:10.1109/cvpr52688.2022.01692

ScienceGate Book Chapters

JOURNAL ARTICLE

RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution

Zhicheng Geng Luming Liang Tianyu Ding Ilya Zharkov

Year: 2022 Journal: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pages: 17420-17430

DOI: 10.1109/cvpr52688.2022.01692

Get Full-Text PDF Get Analytical Report

Abstract

Space-time video super-resolution (STVSR) is the task of interpolating videos with both Low Frame Rate (LFR) and Low Resolution (LR) to produce High-Frame-Rate (HFR) and also High-Resolution (HR) counterparts. The existing methods based on Convolutional Neural Network (CNN) succeed in achieving visually satisfied results while suffer from slow inference speed due to their heavy architec-tures. We propose to resolve this issue by using a spatial-temporal transformer that naturally incorporates the spa-tial and temporal super resolution modules into a single model. Unlike CNN-based methods, we do not explic-itly use separated building blocks for temporal interpolations and spatial super-resolutions; instead, we only use a single end-to-end transformer architecture. Specifically, a reusable dictionary is built by encoders based on the in-put LFR and LR frames, which is then utilized in the de-coder part to synthesize the HFR and HR frames. compared with the state-of-the-art TMNet [54], our network is 60% smaller (4.5M vs 12.3M parameters) and 80% faster (26.2fps vs 14.3fps on 720 x 576 frames) without sacri-ficing much performance. The source code is available at https://github.com/llmpass/RSTT.

Keywords:

Computer science Frame rate Image resolution Transformer Encoder Temporal resolution Convolutional neural network Artificial intelligence Real-time computing Computer vision Electrical engineering Engineering Voltage

Metrics

Cited By

6.42

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution

Abstract

Metrics

Citation History

Topics

Related Documents

Space-Time Video Super-Resolution 3D Transformer

RVSRT: real-time video super resolution transformer

SPSTT: Second-Order Propagation Spatial Temporal Transformer Network for Space-Time Video Super-Resolution

Enhancing space–time video super-resolution via spatial–temporal feature interaction

Space-Time Video Super-Resolution Using Temporal Profiles