MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

Donghao Zhou; Jiancheng Huang; Jinbin Bai; Jiaze Wang; Hao Chen; Guangyong Chen; Xiaowei Hu; Pheng‐Ann Heng

doi:10.24963/ijcai.2024/1136

ScienceGate Book Chapters

JOURNAL ARTICLE

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

Donghao Zhou Jiancheng Huang Jinbin Bai Jiaze Wang Hao Chen Guangyong Chen Xiaowei Hu Pheng‐Ann Heng

Year: 2024 Pages: 10225-10233

DOI: 10.24963/ijcai.2024/1136

Get Full-Text PDF Get Analytical Report

Abstract

Text-to-image diffusion models can generate high-quality images but lack fine-grained control of visual concepts, limiting their creativity. Thus, we introduce component-controllable personalization, a new task that enables users to customize and reconfigure individual components within concepts. This task faces two challenges: semantic pollution, where undesired elements disrupt the target concept, and semantic imbalance, which causes disproportionate learning of the target concept and component. To address these, we design MagicTailor, a framework that uses Dynamic Masked Degradation to adaptively perturb unwanted visual semantics and Dual-Stream Balancing for more balanced learning of desired visual semantics. The experimental results show that MagicTailor achieves superior performance in this task and enables more personalized and creative image generation.

Keywords:

Metrics

Cited By

2.12

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimedia Communication and Technology

Social Sciences → Social Sciences → Sociology and Political Science

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

Abstract

Metrics

Citation History

Topics

Related Documents

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models

DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models

Controllable Generation with Text-to-Image Diffusion Models: a Survey

Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models