Structured visual understanding and generation with deep generative models

Song, Yuhang (author)

doi:10.25549/usctheses-c89-333787

ScienceGate Book Chapters

DISSERTATION

Structured visual understanding and generation with deep generative models

Song, Yuhang (author)

Year: 2020 University: University of Southern California Digital Library

DOI: 10.25549/usctheses-c89-333787

Get Full-Text PDF Get Analytical Report

Abstract

In recent years, deep learning has made a lot of impacts and achievements to the computer vision community. Nowadays, deep learning model can recognize thousands of image categories, with various architectures, deeper and deeper. In complex scene, deep neural models can localize objects and detect a number of object categories and perform instance segmentation afterward. At most recently, a number of scene graph generation and visual relationship detection methods are developed for high-level image understanding, in order to extract more fine-grained and structural representation from images. As a dual problem of visual understanding, visual generation also attracts lots of attention during these few years in the light of deep learning techniques. Deep generative models can generate realistic images with high resolution and high quality, and also be further applied to make image translation across different domains and environments. The world around us is highly structured and images are highly structured. Images can not only contain multiple foreground object categories but also contain various background either in natural scenes or artificial scenarios. In this thesis, we mainly leverage structure information for visual generation and understanding in these tasks: 1) leveraging the semantic structure to generate realistic images

Keywords:

Deep learning Leverage (statistics) Generative model Generative grammar Segmentation Object detection Image segmentation Pattern recognition (psychology) Cognitive neuroscience of visual object recognition

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Structured visual understanding and generation with deep generative models

Abstract

Metrics

Topics

Related Documents

Identity Generation with Deep Generative Models

Structured Generative Models for Scene Understanding

GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation

Understanding Deep Generative Models with Generalized Empirical Likelihoods

Human-controllable and structured deep generative models