Unsupervised Learning from Videos for Object Discovery in Single Images

Dong Zhao; Baoqing Ding; Yulin Wu; Lei Chen; Hongchao Zhou

doi:10.3390/sym13010038

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Learning from Videos for Object Discovery in Single Images

Dong Zhao Baoqing Ding Yulin Wu Lei Chen Hongchao Zhou

Year: 2020 Journal: Symmetry Vol: 13 (1)Pages: 38-38 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/sym13010038

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a method for discovering the primary objects in single images by learning from videos in a purely unsupervised manner—the learning process is based on videos, but the generated network is able to discover objects from a single input image. The rough idea is that an image typically consists of multiple object instances (like the foreground and background) that have spatial transformations across video frames and they can be sparsely represented. By exploring the sparsity representation of a video with a neural network, one may learn the features of each object instance without any labels, which can be used to discover, recognize, or distinguish object instances from a single image. In this paper, we consider a relatively simple scenario, where each image roughly consists of a foreground and a background. Our proposed method is based on encoder-decoder structures to sparsely represent the foreground, background, and segmentation mask, which further reconstruct the original images. We apply the feed-forward network trained from videos for object discovery in single images, which is different from the previous co-segmentation methods that require videos or collections of images as the input for inference. The experimental results on various object segmentation benchmarks demonstrate that the proposed method extracts primary objects accurately and robustly, which suggests that unsupervised image learning tasks can benefit from the sparsity of images and the inter-frame structure of videos.

Keywords:

Computer science Artificial intelligence Segmentation Object (grammar) Inference Pattern recognition (psychology) Computer vision Image (mathematics) Representation (politics) Unsupervised learning Encoder Autoencoder Image segmentation Frame (networking) Feature learning Deep learning Object detection

Metrics

Cited By

0.52

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Unsupervised Learning from Videos for Object Discovery in Single Images

Abstract

Metrics

Citation History

Topics

Related Documents

Unsupervised object discovery and localization in images and videos

Unsupervised Object-aware Learning from Videos

Unsupervised Object Discovery and Segmentation in Videos

Unsupervised Learning of Characteristic Object Parts from Videos

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images