JOURNAL ARTICLE

Class-Balanced Text to Image Synthesis With Attentive Generative Adversarial Network

Min WangCongyan LangLiqian LiangGengyu LyuSonghe FengTao Wang

Year: 2021 Journal:   IEEE Multimedia Vol: 28 (3)Pages: 21-31   Publisher: IEEE Computer Society

Abstract

Although the text-to-image synthesis task has shown significant progress, it still remains a challenge in generating high-quality images. In this article, we first propose an attention-driven, cycle-refinement generative adversarial network, AGAN-v1, to bridge the domain gap between visual contents and semantic concepts by constructing spatial configurations of objects. The generation of image contours is the core component, in which an attention mechanism is developed to refine local details of images by focusing on the objects that complement one subregion. Second, an advanced class-balanced generative adversarial network, AGAN-v2, is proposed to address the problem of long-tailed data distribution. Importantly, it is the first method to solve this problem in the text-to-image synthesis task. Our AGAN-v2 introduces a reweighting scheme, which adopts the effective number of samples for each class to rebalance the generative loss. Extensive quantitative and qualitative experiments on CUB and MS-COCO datasets demonstrate that the proposed AGAN-v2 significantly outperforms the state-of-the-art methods.

Keywords:
Computer science Generative grammar Image (mathematics) Adversarial system Complement (music) Class (philosophy) Task (project management) Artificial intelligence Domain (mathematical analysis) Image synthesis Component (thermodynamics) Pattern recognition (psychology) Theoretical computer science Mathematics

Metrics

3
Cited By
0.20
FWCI (Field Weighted Citation Impact)
27
Refs
0.45
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.