JOURNAL ARTICLE

Semantic-Aware Data Augmentation for Text-to-Image Synthesis

Zhaorui TanXi YangKaizhu Huang

Year: 2024 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 38 (6)Pages: 5098-5107   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Data augmentation has been recently leveraged as an effective regularizer in various vision-language deep neural networks. However, in text-to-image synthesis (T2Isyn), current augmentation wisdom still suffers from the semantic mismatch between augmented paired data. Even worse, semantic collapse may occur when generated images are less semantically constrained. In this paper, we develop a novel Semantic-aware Data Augmentation (SADA) framework dedicated to T2Isyn. In particular, we propose to augment texts in the semantic space via an Implicit Textual Semantic Preserving Augmentation, in conjunction with a specifically designed Image Semantic Regularization Loss as Generated Image Semantic Conservation, to cope well with semantic mismatch and collapse. As one major contribution, we theoretically show that Implicit Textual Semantic Preserving Augmentation can certify better text-image consistency while Image Semantic Regularization Loss regularizing the semantics of generated images would avoid semantic collapse and enhance image quality. Extensive experiments validate that SADA enhances text-image consistency and improves image quality significantly in T2Isyn models across various backbones. Especially, incorporating SADA during the tuning process of Stable Diffusion models also yields performance improvements.

Keywords:
Computer science Image (mathematics) Image synthesis Natural language processing Information retrieval Artificial intelligence

Metrics

6
Cited By
4.10
FWCI (Field Weighted Citation Impact)
0
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Computer Graphics and Visualization Techniques
Physical Sciences →  Computer Science →  Computer Graphics and Computer-Aided Design
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Semantic-aware Mapping for Text-to-Image Synthesis

Khushboo Patel

Journal:   Journal of Information Systems Engineering & Management Year: 2025 Vol: 10 (2)Pages: 746-754
JOURNAL ARTICLE

SAST: Semantic-Aware stylized Text-to-Image generation

Xinyue SunJing GuoYongzhen KeShuai YangKai WangY. J. Wu

Journal:   Journal of Visual Communication and Image Representation Year: 2025 Vol: 115 Pages: 104685-104685
© 2026 ScienceGate Book Chapters — All rights reserved.