Text to Video using GANs and Diffusion Models

Nikita Singhal; Praval Singh; Nikhil Singh; Mahipal Singh; Harsimrat Singh

doi:10.5455/jjcit.71-1708490995

ScienceGate Book Chapters

JOURNAL ARTICLE

Text to Video using GANs and Diffusion Models

Nikita Singhal Praval Singh Nikhil Singh Mahipal Singh Harsimrat Singh

Year: 2024 Journal: Jordanian Journal of Computers and Information Technology Pages: 1-1

DOI: 10.5455/jjcit.71-1708490995

Get Full-Text PDF Get Analytical Report

Abstract

The challenging endeavour of text-to-video creation requires transforming text descriptions into realistic and cohesive videos. This field of study has made substantial progress in recent years, with the development of diffusion models and generative adversarial networks (GANs). This study examines the most modern text-to-video generation models, as well as the various steps involved in text-to-video generation,including temporal coherence, video generation, and text encoding. We additionally emphasise the challenges involved with text-to-video generation, as well as recent advances to overcome these issues. The most frequently used datasets and metrics in this field are also analysed and reviewed

Keywords:

Diffusion Computer science Physics Thermodynamics

Metrics

Cited By

3.76

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Computational and Text Analysis Methods

Social Sciences → Social Sciences → General Social Sciences

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Text to Video using GANs and Diffusion Models

Abstract

Metrics

Citation History

Topics

Related Documents

From Gans to Diffusion Models: Text-To-Image Generation

Image Synthesis Using GANs and Diffusion Models

Text To Video: Enhancing Video Generation Using Diffusion Models And Reconstruction Network

Text-to-Image Generation Using Stack Generative Adversarial Networks (GANs) and Stable Diffusion Models

Restoring Historical Paintings Using Diffusion Models and GANs