Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations

Guanyue Li; Yi Liu; Xiwen Wei; Yang Zhang; Si Wu; Yong Xu; Hau−San Wong

doi:10.1145/3474085.3475293

ScienceGate Book Chapters

JOURNAL ARTICLE

Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations

Guanyue Li Yi Liu Xiwen Wei Yang Zhang Si Wu Yong Xu Hau−San Wong

Year: 2021 Pages: 1562-1570

DOI: 10.1145/3474085.3475293

Get Full-Text PDF Get Analytical Report

Abstract

Generative adversarial network (GAN)-based models possess superior capability of high-fidelity image synthesis. There are a wide range of semantically meaningful directions in the latent representation space of well-trained GANs, and the corresponding latent space walks are meaningful for semantic controllability in the synthesized images. To explore the underlying organization of a latent space, we propose an unsupervised Density-Preserving Latent Semantics Exploration model (DP-LaSE). The important latent directions are determined by maximizing the variations in intermediate features, while the correlation between the directions is minimized. Considering that latent codes are sampled from a prior distribution, we adopt a density-preserving regularization approach to ensure latent space walks are maintained in iso-density regions, since moving to a higher/lower density region tends to cause unexpected transformations. To further refine semantics-specific transformations, we perform subspace learning over intermediate feature channels, such that the transformations are limited to the most relevant subspaces. Extensive experiments on a variety of benchmark datasets demonstrate that DP-LaSE is able to discover interpretable latent space walks, and specific properties of synthesized images can thus be precisely controlled.

Keywords:

Computer science Linear subspace Semantics (computer science) Representation (politics) Subspace topology Pattern recognition (psychology) Artificial intelligence Generative model Probabilistic latent semantic analysis Theoretical computer science Algorithm Mathematics Generative grammar Geometry

Metrics

Cited By

0.61

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Cell Image Analysis Techniques

Life Sciences → Biochemistry, Genetics and Molecular Biology → Biophysics

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations

Abstract

Metrics

Citation History

Topics

Related Documents

Locality-Preserving Directions for Interpreting the Latent Space of Satellite Image GANs

Intuitively interpreting GANs latent space using semantic distribution

Discovering Interpretable Latent Space Directions of GANs Beyond Binary Attributes

Disentangling the latent space of GANs for semantic face editing

Interpreting the Latent Space of GANs for Semantic Face Editing