Abstract

Learning generative image models from various domains efficiently needs transferring knowledge from an image synthesis model trained on a large dataset. We present a recipe for learning vision transformers by generative knowledge transfer. We base our framework on generative vision transformers representing an image as a sequence of visual tokens with the autoregressive or non-autoregressive transformers. To adapt to a new domain, we employ prompt tuning, which prepends learnable tokens called prompts to the image token sequence and introduces a new prompt design for our task. We study on a variety of visual domains with varying amounts of training images. We show the effectiveness of knowledge transfer and a significantly better image generation quality. 1 1 https://github.com/google-research/generative_transfer

Keywords:
Generative grammar Computer science Transformer Transfer of learning Autoregressive model Artificial intelligence Security token Generative model Machine learning Mathematics Engineering

Metrics

64
Cited By
16.35
FWCI (Field Weighted Citation Impact)
130
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning

Chunqing RuanHongjian Wang

Lecture notes in computer science Year: 2023 Pages: 293-303
BOOK-CHAPTER

Visual Prompt Tuning

Menglin JiaLuming TangBor-Chun ChenClaire CardieSerge BelongieBharath H. AithalSer-Nam Lim

Lecture notes in computer science Year: 2022 Pages: 709-727
BOOK-CHAPTER

Robust Visual Reinforcement Learning by Prompt Tuning

Tung Anh TranKhoat ThanDanilo Vasconcellos Vargas

Lecture notes in computer science Year: 2024 Pages: 387-401
BOOK-CHAPTER

Probabilistic Visual Prompt Tuning

Mu SunLingye ZhaoLuojun Lin

Lecture notes in computer science Year: 2026 Pages: 432-445
© 2026 ScienceGate Book Chapters — All rights reserved.