Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution

Haowei Chen; Yu-Syuan Xu; Min-Fong Hong; Yi‐Min Tsai; Hsien-Kai Kuo; Chun‐Yi Lee

doi:10.1109/cvpr52729.2023.01751

ScienceGate Book Chapters

JOURNAL ARTICLE

Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution

Haowei Chen Yu-Syuan Xu Min-Fong Hong Yi‐Min Tsai Hsien-Kai Kuo Chun‐Yi Lee

Year: 2023 Pages: 18257-18267

DOI: 10.1109/cvpr52729.2023.01751

Get Full-Text PDF Get Analytical Report

Abstract

Implicit neural representation has recently shown a promising ability in representing images with arbitrary resolutions. In this paper, we present a Local Implicit Transformer (LIT), which integrates the attention mechanism and frequency encoding technique into a local implicit image function. We design a cross-scale local attention block to effectively aggregate local features and a local frequency encoding block to combine positional encoding with Fourier domain information for constructing high-resolution images. To further improve representative power, we propose a Cascaded LIT (CLIT) that exploits multi-scale features, along with a cumulative training strategy that gradually increases the upsampling scales during training. We have conducted extensive experiments to validate the effectiveness of these components and analyze various training strategies. The qualitative and quantitative results demonstrate that LIT and CLIT achieve favorable results and outperform the prior works in arbitrary super-resolution tasks.

Keywords:

Upsampling Computer science Encoding (memory) Transformer Artificial intelligence Frequency domain Exploit Block (permutation group theory) Algorithm Image (mathematics) Pattern recognition (psychology) Computer vision Mathematics

Metrics

Cited By

9.28

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-scale implicit transformer with re-parameterization for arbitrary-scale super-resolution

Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution

Adaptive Local Implicit Image Function for Arbitrary-Scale Super-Resolution

Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks

Cross Transformer Network for Scale-Arbitrary Image Super-Resolution