Large Scale Video Representation Learning via Relational Graph Clustering

Hyodong Lee; Joonseok Lee; Joe Yue-Hei Ng; Paul Natsev

doi:10.1109/cvpr42600.2020.00684

ScienceGate Book Chapters

JOURNAL ARTICLE

Large Scale Video Representation Learning via Relational Graph Clustering

Hyodong Lee Joonseok Lee Joe Yue-Hei Ng Paul Natsev

Year: 2020 Pages: 6806-6815

DOI: 10.1109/cvpr42600.2020.00684

Get Full-Text PDF Get Analytical Report

Abstract

Representation learning is widely applied for various tasks on multimedia data, e.g., retrieval and search. One approach for learning useful representation is by utilizing the relationships or similarities between examples. In this work, we explore two promising scalable representation learning approaches on video domain. With hierarchical graph clusters built upon video-to-video similarities, we propose: 1) smart negative sampling strategy that significantly boosts training efficiency with triplet loss, and 2) a pseudo-classification approach using the clusters as pseudo-labels. The embeddings trained with the proposed methods are competitive on multiple video understanding tasks, including related video retrieval and video annotation. Both of these proposed methods are highly scalable, as verified by experiments on large-scale datasets.

Keywords:

Computer science Scalability Feature learning Cluster analysis Representation (politics) Graph Artificial intelligence Video retrieval Machine learning Pattern recognition (psychology) Theoretical computer science Database

Metrics

Cited By

1.36

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Large Scale Video Representation Learning via Relational Graph Clustering

Abstract

Metrics

Citation History

Topics

Related Documents

Large-scale graph representation learning

Large-scale knowledge graph representation learning

Hierarchical Multi-Relational Graph Representation Learning for Large-Scale Prediction of Drug-Drug Interactions

Multi-relational dynamic graph representation learning

DropNaE: Alleviating irregularity for large-scale graph representation learning