JOURNAL ARTICLE

Unsupervised Learning from Videos for Object Discovery in Single Images

Dong ZhaoBaoqing DingYulin WuLei ChenHongchao Zhou

Year: 2020 Journal:   Symmetry Vol: 13 (1)Pages: 38-38   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

This paper proposes a method for discovering the primary objects in single images by learning from videos in a purely unsupervised manner—the learning process is based on videos, but the generated network is able to discover objects from a single input image. The rough idea is that an image typically consists of multiple object instances (like the foreground and background) that have spatial transformations across video frames and they can be sparsely represented. By exploring the sparsity representation of a video with a neural network, one may learn the features of each object instance without any labels, which can be used to discover, recognize, or distinguish object instances from a single image. In this paper, we consider a relatively simple scenario, where each image roughly consists of a foreground and a background. Our proposed method is based on encoder-decoder structures to sparsely represent the foreground, background, and segmentation mask, which further reconstruct the original images. We apply the feed-forward network trained from videos for object discovery in single images, which is different from the previous co-segmentation methods that require videos or collections of images as the input for inference. The experimental results on various object segmentation benchmarks demonstrate that the proposed method extracts primary objects accurately and robustly, which suggests that unsupervised image learning tasks can benefit from the sparsity of images and the inter-frame structure of videos.

Keywords:
Computer science Artificial intelligence Segmentation Object (grammar) Inference Pattern recognition (psychology) Computer vision Image (mathematics) Representation (politics) Unsupervised learning Encoder Autoencoder Image segmentation Frame (networking) Feature learning Deep learning Object detection

Metrics

5
Cited By
0.52
FWCI (Field Weighted Citation Impact)
79
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Unsupervised Object-aware Learning from Videos

Jan Hůla

Year: 2020 Vol: abs 1805 2641 Pages: 237-242
JOURNAL ARTICLE

Unsupervised Learning of Characteristic Object Parts from Videos

Henrik SkibbeAlexandra TeynorHans Burkhardt

Journal:   Machine Vision and Applications Year: 2009 Vol: 10 (11)Pages: 199-202
JOURNAL ARTICLE

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images

Long-Nhat HoAnh TranQuynh PhungMinh Hoai

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 12580-12590
© 2026 ScienceGate Book Chapters — All rights reserved.