Dual conditional GAN based on external attention for semantic image synthesis

Gang Liu; Qijun Zhou; Xiaoxiao Xie; Qingchen Yu

doi:10.1080/09540091.2023.2259120

ScienceGate Book Chapters

JOURNAL ARTICLE

Dual conditional GAN based on external attention for semantic image synthesis

Gang Liu Qijun Zhou Xiaoxiao Xie Qingchen Yu

Year: 2023 Journal: Connection Science Vol: 35 (1) Publisher: Taylor & Francis

DOI: 10.1080/09540091.2023.2259120

Get Full-Text PDF Get Analytical Report

Abstract

Although the existing semantic image synthesis methods based on generative adversarial networks (GANs) have achieved great success, the quality of the generated images still cannot achieve satisfactory results. This is mainly caused by two reasons. One reason is that the information in the semantic layout is sparse. Another reason is that a single constraint cannot effectively control the position relationship between objects in the generated image. To address the above problems, we propose a dual-conditional GAN with based on an external attention for semantic image synthesis (DCSIS). In DCSIS, the adaptive normalization method uses the one-hot encoded semantic layout to generate the first latent space and the external attention uses the RGB encoded semantic layout to generate the second latent space. Two latent spaces control the shape of objects and the positional relationship between objects in the generated image. The graph attention (GAT) is added to the generator to strengthen the relationship between different categories in the generated image. A graph convolutional segmentation network (GSeg) is designed to learn information for each category. Experiments on several challenging datasets demonstrate the advantages of our method over existing approaches, regarding both visual quality and the representative evaluating criteria.

Keywords:

Computer science Artificial intelligence Normalization (sociology) Generator (circuit theory) Graph Generative adversarial network Image (mathematics) Semantics (computer science) Pattern recognition (psychology) Theoretical computer science

Metrics

Cited By

1.09

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Dual conditional GAN based on external attention for semantic image synthesis

Abstract

Metrics

Citation History

Topics

Related Documents

Dual Attention GANs for Semantic Image Synthesis

Attention-based dual context aggregation for image semantic segmentation

Dual-attention-transformer-based semantic reranking for large-scale image localization

Modeling visual and word-conditional semantic attention for image captioning

Attention Dual Adversarial Remote Sensing Image Semantic Segmentation