JOURNAL ARTICLE

Actionness Inconsistency-Guided Contrastive Learning for Weakly-Supervised Temporal Action Localization

Zhilin LiZilei WangQinying Liu

Year: 2023 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 37 (2)Pages: 1513-1521   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Weakly-supervised temporal action localization (WTAL) aims to detect action instances given only video-level labels. To address the challenge, recent methods commonly employ a two-branch framework, consisting of a class-aware branch and a class-agnostic branch. In principle, the two branches are supposed to produce the same actionness activation. However, we observe that there are actually many inconsistent activation regions. These inconsistent regions usually contain some challenging segments whose semantic information (action or background) is ambiguous. In this work, we propose a novel Actionness Inconsistency-guided Contrastive Learning (AICL) method which utilizes the consistent segments to boost the representation learning of the inconsistent segments. Specifically, we first define the consistent and inconsistent segments by comparing the predictions of two branches and then construct positive and negative pairs between consistent segments and inconsistent segments for contrastive learning. In addition, to avoid the trivial case where there is no consistent sample, we introduce an action consistency constraint to control the difference between the two branches. We conduct extensive experiments on THUMOS14, ActivityNet v1.2, and ActivityNet v1.3 datasets, and the results show the effectiveness of AICL with state-of-the-art performance. Our code is available at https://github.com/lizhilin-ustc/AAAI2023-AICL.

Keywords:
Consistency (knowledge bases) Constraint (computer-aided design) Computer science Construct (python library) Representation (politics) Action (physics) Artificial intelligence Code (set theory) Class (philosophy) Machine learning Natural language processing Pattern recognition (psychology) Mathematics Programming language

Metrics

13
Cited By
1.05
FWCI (Field Weighted Citation Impact)
68
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Weakly supervised temporal action localization with actionness-guided false positive suppression

Zhilin LiZilei WangQinying Liu

Journal:   Neural Networks Year: 2024 Vol: 175 Pages: 106307-106307
BOOK-CHAPTER

Weakly Supervised Temporal Action Localization Through Segment Contrastive Learning

Zihao JiangYidong Li

Communications in computer and information science Year: 2023 Pages: 228-243
JOURNAL ARTICLE

Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization

Junyu GaoMengyuan ChenChangsheng Xu

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 19967-19977
© 2026 ScienceGate Book Chapters — All rights reserved.