DISSERTATION

Improving controllable text-to-video diffusion models

Zou, Jesse

Year: 2023 University:   Texas Digital Library (University of Texas)   Publisher: The University of Texas at Austin

Abstract

In this work, we explore different interpretations of ControlNet and how the conditional control it provides image diffusion models can be extended to video diffusion models. We discover that Control-a-Video extends ControlNet using strategies that diverge from ControlNet's training procedures. We explore if restructuring the training procedure to be more analogous to ControlNet will allow for a higher degree of controllability, and we introduce a way to train the model while maintaining the high convergence speeds found in Control-a-Video. We propose the following interpretations that are more analogous to ControlNet: (1) Decomposing the video diffusion model training from the Video ControlNet training in Control-a-Video; (2) connecting a frozen image diffusion model as the foundation for a Video ControlNet called VideoNet (3) training the entire VideoNet instead of just the temporal layers. We find that decomposing the training process produces higher quality generation, pairing an image diffusion model with a VideoNet speeds up training at the cost of sample quality, and training all spatio-temporal layers in a Video ControlNet causes the samples to degenerate.

Keywords:
Diffusion Training (meteorology) Diffusion process Convergence (economics) Image (mathematics) Process (computing) Sample (material)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Model Reduction and Neural Networks
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.