KnHiGAN: Knowledge-enhanced Hierarchical Generative Adversarial Network for Fine-grained Text-to-Image Synthesis

Ning Ge; Yonghua Zhu; Xiaoyu Xiong; Binghui Zheng; Jieyu Huang

doi:10.1109/iscid52796.2021.00088

ScienceGate Book Chapters

JOURNAL ARTICLE

KnHiGAN: Knowledge-enhanced Hierarchical Generative Adversarial Network for Fine-grained Text-to-Image Synthesis

Ning Ge Yonghua Zhu Xiaoyu Xiong Binghui Zheng Jieyu Huang

Year: 2021 Pages: 357-360

DOI: 10.1109/iscid52796.2021.00088

Get Full-Text PDF Get Analytical Report

Abstract

To generate fine-grained images with greater authenticity, in this paper, we propose a Knowledge-enhanced Hierarchical Generative Adversarial Network (KnHiGAN) for text-to-image synthesis. KnHiGAN sets up a Knowledge Enhancement Module to expand conditions for the limited text descriptions by combining with the knowledge graph, as a result, it can provide richer fine-grained details to the generative network. Moreover, a Hierarchical Generative Adversarial Network is designed to generate the foreground and background separately, and the two are integrated together to composite the final result. Experiments on CUB-200 and Oxford-102 datasets show that our KnHiGAN can not only generate the fine-grained images which are more like those that exist in the real world, but also can maintain a high degree of consistency with the original text input.

Keywords:

Adversarial system Generative grammar Computer science Generative adversarial network Consistency (knowledge bases) Image synthesis Image (mathematics) Artificial intelligence Graph Knowledge graph Theoretical computer science

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.18

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Computer Graphics and Visualization Techniques

Physical Sciences → Computer Science → Computer Graphics and Computer-Aided Design

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

KnHiGAN: Knowledge-enhanced Hierarchical Generative Adversarial Network for Fine-grained Text-to-Image Synthesis

Abstract

Metrics

Citation History

Topics

Related Documents

Fine-grained image inpainting with scale-enhanced generative adversarial network

Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis

Fine-Grained Semantic Image Synthesis with Object-Attention Generative Adversarial Network

Semantically Consistent Hierarchical Text to Fashion Image Synthesis with an Enhanced-Attentional Generative Adversarial Network

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks