JOURNAL ARTICLE

3D-CSL: Self-Supervised 3D Context Similarity Learning for Near-Duplicate Video Retrieval

Abstract

In this paper, we introduce 3D-CSL, a compact pipeline for Near-Duplicate Video Retrieval (NDVR), and explore a novel self-supervised learning strategy for video similarity learning. Most previous NDVR methods depend a lot on pair-wise labeled data, so that be limited by the scale of datasets and cannot optimize complex but efficient backbones, e.g., 3D transformers. In order to break this limitation, we explore the self-supervised similarity learning for the NDVR task and propose FCS loss, a novel triplet loss, and ShotMix, a novel video-specific augmentation, which enhances the self-supervised video similarity learning significantly. With this premise, the compact 3D pipeline we propose shows a great advantage in extracting global spatiotemporal dependencies in videos and achieves the best balance between efficiency and effectiveness. Furthermore, we also propose PredMAE to pretrain the 3D transformer with video prediction task as a pretext task to boost the downstream NDVR task without any human labels. The experiments on FIVR-200K and CC_WEB_VIDEO demonstrate the superiority and reliability of our method, which achieves the state-of-the-art performance on clip-level NDVR. Code is released in https://github.com/dun-research/3D-CSL

Keywords:
Computer science Artificial intelligence Transformer Pipeline (software) Similarity (geometry) Task (project management) Machine learning Pattern recognition (psychology) Image (mathematics)

Metrics

6
Cited By
1.09
FWCI (Field Weighted Citation Impact)
39
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Self-similarity-based partial near-duplicate video retrieval and alignment

Zhipeng WuKiyoharu Aizawa

Journal:   International Journal of Multimedia Information Retrieval Year: 2013 Vol: 3 (1)Pages: 1-14
BOOK-CHAPTER

Near-Duplicate Video Retrieval

Encyclopedia of Database Systems Year: 2009 Pages: 1885-1885
JOURNAL ARTICLE

Attention-based deep supervised hashing for near duplicate video retrieval

Naifei ShiChong FuMing TieWenchao ZhangXingwei WangChiu‐Wing Sham

Journal:   Neural Computing and Applications Year: 2023 Vol: 36 (10)Pages: 5217-5230
© 2026 ScienceGate Book Chapters — All rights reserved.