LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation

Guangcong Zheng; Xianpan Zhou; Xuewei Li; Zhongang Qi; Ying Shan; Xi Li

doi:10.1109/cvpr52729.2023.02154

ScienceGate Book Chapters

JOURNAL ARTICLE

LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation

Guangcong Zheng Xianpan Zhou Xuewei Li Zhongang Qi Ying Shan Xi Li

Year: 2023 Pages: 22490-22499

DOI: 10.1109/cvpr52729.2023.02154

Get Full-Text PDF Get Analytical Report

Abstract

Recently, diffusion models have achieved great success in image synthesis. However, when it comes to the layout-to-image generation where an image often has a complex scene of multiple objects, how to make strong control over both the global layout map and each detailed object remains a challenging task. In this paper, we propose a diffusion model named LayoutDiffusion that can obtain higher generation quality and greater controllability than the previous works. To overcome the difficult multimodal fusion of image and layout, we propose to construct a structural image patch with region information and transform the patched image into a special layout to fuse with the normal layout in a unified form. Moreover, Layout Fusion Module (LFM) and Object-aware Cross Attention (OaCA) are proposed to model the relationship among multiple objects and designed to be object-aware and position-sensitive, allowing for precisely controlling the spatial related information. Extensive experiments show that our LayoutDiffusion out-performs the previous SOTA methods on FID, CAS by relatively 46.35%,26.70% on COCO-stuff and 44.29%,41.82% on VG. Code is available at https://github.com/ZGCTroy/LayoutDiffusion.

Keywords:

Computer science Image (mathematics) Image fusion Construct (python library) Computer vision Fuse (electrical) Controllability Artificial intelligence Object (grammar) Code (set theory) Mathematics Engineering Set (abstract data type)

Metrics

115

Cited By

20.93

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation

Abstract

Metrics

Citation History

Topics

Related Documents

HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation

LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation

STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation

Relation-Aware Diffusion Model for Controllable Poster Layout Generation