JOURNAL ARTICLE

Self-Supervised Video Representation Learning with Meta-Contrastive Network

Yuanze LinXun GuoYan Lu

Year: 2021 Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pages: 8219-8229

Abstract

Self-supervised learning has been successfully applied to pre-train video representations, which aims at efficient adaptation from pre-training domain to downstream tasks. Existing approaches merely leverage contrastive loss to learn instance-level discrimination. However, lack of category information will lead to hard-positive problem that constrains the generalization ability of this kind of methods. We find that the multi-task process of meta learning can provide a solution to this problem. In this paper, we propose a Meta-Contrastive Network (MCN), which combines the contrastive learning and meta learning, to enhance the learning ability of existing self-supervised approaches. Our method contains two training stages based on model-agnostic meta learning (MAML), each of which consists of a contrastive branch and a meta branch. Extensive evaluations demonstrate the effectiveness of our method. For two downstream tasks, i.e., video action recognition and video retrieval, MCN outperforms state-of-the-art approaches on UCF101 and HMDB51 datasets. To be more specific, with R(2+1)D backbone, MCN achieves Top-1 accuracies of 84.8% and 54.5% for video action recognition, as well as 52.5% and 23.7% for video retrieval.

Keywords:
Computer science Leverage (statistics) Artificial intelligence Generalization Machine learning Feature learning Task (project management) Domain adaptation Representation (politics) Natural language processing Pattern recognition (psychology) Classifier (UML)

Metrics

39
Cited By
2.28
FWCI (Field Weighted Citation Impact)
86
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Jinyu LiuYing ChengYuejie ZhangRui-Wei ZhaoRui Feng

Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Year: 2022 Pages: 1-6
JOURNAL ARTICLE

Multitask Contrastive Learning for Self-Supervised Video Representation

东风 单

Journal:   Computer Science and Application Year: 2023 Vol: 13 (03)Pages: 433-443
JOURNAL ARTICLE

Cut-in maneuver detection with self-supervised contrastive video representation learning

Yağız NalçakanYalın Baştanlar

Journal:   Signal Image and Video Processing Year: 2023 Vol: 17 (6)Pages: 2915-2923
© 2026 ScienceGate Book Chapters — All rights reserved.