JOURNAL ARTICLE

Dual conditional GAN based on external attention for semantic image synthesis

Abstract

Although the existing semantic image synthesis methods based on generative adversarial networks (GANs) have achieved great success, the quality of the generated images still cannot achieve satisfactory results. This is mainly caused by two reasons. One reason is that the information in the semantic layout is sparse. Another reason is that a single constraint cannot effectively control the position relationship between objects in the generated image. To address the above problems, we propose a dual-conditional GAN with based on an external attention for semantic image synthesis (DCSIS). In DCSIS, the adaptive normalization method uses the one-hot encoded semantic layout to generate the first latent space and the external attention uses the RGB encoded semantic layout to generate the second latent space. Two latent spaces control the shape of objects and the positional relationship between objects in the generated image. The graph attention (GAT) is added to the generator to strengthen the relationship between different categories in the generated image. A graph convolutional segmentation network (GSeg) is designed to learn information for each category. Experiments on several challenging datasets demonstrate the advantages of our method over existing approaches, regarding both visual quality and the representative evaluating criteria.

Keywords:
Computer science Artificial intelligence Normalization (sociology) Generator (circuit theory) Graph Generative adversarial network Image (mathematics) Semantics (computer science) Pattern recognition (psychology) Theoretical computer science

Metrics

6
Cited By
1.09
FWCI (Field Weighted Citation Impact)
31
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Attention-based dual context aggregation for image semantic segmentation

Dexin ZhaoZhiyang QiRuixue YangZhaohui Wang

Journal:   Multimedia Tools and Applications Year: 2021 Vol: 80 (18)Pages: 28201-28216
JOURNAL ARTICLE

Dual-attention-transformer-based semantic reranking for large-scale image localization

Yilin XiaoSiliang DuXu ChenMingzhong LiuMingwei Sun

Journal:   Applied Intelligence Year: 2024 Vol: 54 (9-10)Pages: 6946-6958
JOURNAL ARTICLE

Modeling visual and word-conditional semantic attention for image captioning

Chunlei WuYiwei WeiXiaoliang ChuFei SuLeiquan Wang

Journal:   Signal Processing Image Communication Year: 2018 Vol: 67 Pages: 100-107
© 2026 ScienceGate Book Chapters — All rights reserved.