Text Guided Generation and Manipulation of human Face Images using StyleGAN

M. Sobhana; Mastan Mohammed Meera Durga; M. Joseph Kishore; E. Eswar Reddy; V. Krishna Chaitanya

doi:10.1109/icaiss58487.2023.10250640

ScienceGate Book Chapters

JOURNAL ARTICLE

Text Guided Generation and Manipulation of human Face Images using StyleGAN

M. Sobhana Mastan Mohammed Meera Durga M. Joseph Kishore E. Eswar Reddy V. Krishna Chaitanya

Year: 2023 Vol: 45 Pages: 1040-1049

DOI: 10.1109/icaiss58487.2023.10250640

Get Full-Text PDF Get Analytical Report

Abstract

Image generation is an intriguing research topic. Text conditioned image generation is a specific problem under the image generation research topic. Text controlled image generation requires understanding the linguistic semantics of the text and accurately mapping them with the visual semantics, which can be a hard task to achieve. This work also aims to achieve the same for generating and manipulating human face images through text descriptions using StyleGAN. In the proposed architecture there are mainly two pipelines, one for text based image generation and another for text based image manipulation. Each pipeline contains a sequence of models that achieve their respective tasks. For text based image generation, a Text Encoder and Latent Code Decoder are used to map the text to the latent space of a pre-trained StyleGAN. For text based image manipulation, GAN Inversion technique is used to map the real world image to the latent space of pre-trained StyleGAN and obtain the latent vector. Latent directions are learned in the disentangled latent space of pre-trained StyleGAN model and are used for image manipulation. The target attribute is identified by applying Latent Direction Classifier on the text input and its corresponding latent direction is used in manipulating the latent code of the original image. The final manipulated image is generated by using the modified latent code in the StyleGAN generator.

Keywords:

Computer science Artificial intelligence Generator (circuit theory) Image (mathematics) Pattern recognition (psychology) Encoder Face (sociological concept) Computer vision Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.12

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Text Guided Generation and Manipulation of human Face Images using StyleGAN

Abstract

Metrics

Topics

Related Documents

StyleHumanCLIP: Text-Guided Garment Manipulation for StyleGAN-Human

Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images

Thermal Face Generation Using StyleGAN

TediGAN: Text-Guided Diverse Face Image Generation and Manipulation

Face Transferring on Webcam images using StyleGAN