JOURNAL ARTICLE

InViT: GAN Inversion-Based Vision Transformer for Blind Image Inpainting

Yongqiang DuHaoran LiuShengjie HeSongnan Chen

Year: 2024 Journal:   IEEE Access Vol: 12 Pages: 129956-129965   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Blind image inpainting, the task of detecting corrupted regions with diverse patterns within an image and then generating plausible content for the corrupted regions, remains a both challenging and practical problem in computer vision. In this paper, we propose a novel model InViT for blind image inpainting, which leverages a combination of a pre-trained Generative Adversarial Network (GAN) and a learnable Vision Transformer (ViT). The proposed InViT mainly consists of two phases, the mask prediction phase and the image inpainting phase. Benefiting from the learned latent feature space from the full training data through GAN inversion, a pre-trained StyleGAN is able to provide reliable cues of corrupted regions for mask prediction. By further incorporating the predicted mask into the image inpainting phase, we design a vision Transformer with the mask-aware self-attention mechanism to capture long-range dependencies between pixels during content reconstruction. Besides, we propose a Prompt-augment Contextual Aggregation module to strengthen the reasonableness of generated content for the corrupted regions. Extensive experiments on several benchmark datasets for blind image inpainting demonstrate that our InViT model achieves state-of-the-art performance compared to existing methods in terms of both quantitative metrics and qualitative visual quality.

Keywords:
Computer vision Artificial intelligence Inpainting Computer science Inversion (geology) Transformer Pattern recognition (psychology) Image (mathematics) Geology Electrical engineering Engineering Voltage

Metrics

4
Cited By
2.12
FWCI (Field Weighted Citation Impact)
69
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Vision Transformer-Based Image Inpainting Method

Rui Lin

Year: 2023 Pages: 366-372
JOURNAL ARTICLE

Multiscale Window Based Vision Transformer for Image Inpainting

Seong-Joo KimJaeyoung ChoiJae-Young Choi

Journal:   The Journal of Next-generation Convergence Technology Association Year: 2023 Vol: 7 (12)Pages: 2050-2057
JOURNAL ARTICLE

Wavelet frame based blind image inpainting

Bin DongHui JiJia LiZuowei ShenYuhong Xu

Journal:   Applied and Computational Harmonic Analysis Year: 2011 Vol: 32 (2)Pages: 268-279
© 2026 ScienceGate Book Chapters — All rights reserved.