Temporal Dropout for Weakly Supervised Action Localization

Chi Xie; Zikun Zhuang; Shengjie Zhao; Shuang Liang

doi:10.1145/3567827

ScienceGate Book Chapters

JOURNAL ARTICLE

Temporal Dropout for Weakly Supervised Action Localization

Chi Xie Zikun Zhuang Shengjie Zhao Shuang Liang

Year: 2022 Journal: ACM Transactions on Multimedia Computing Communications and Applications Vol: 19 (3)Pages: 1-24 Publisher: Association for Computing Machinery

DOI: 10.1145/3567827

Get Full-Text PDF Get Analytical Report

Abstract

Weakly supervised action localization is a challenging problem in video understanding and action recognition. Existing models usually formulate the training process as direct classification using video-level supervision. They tend to only locate the most discriminative parts of action instances and produce temporally incomplete detection results. A natural solution for this problem, the adversarial erasing strategy, is to remove such parts from training so that models can attend to complementary parts. Previous works do it in an offline and heuristic way. They adopt a multi-stage pipeline, where discriminative regions are determined and erased under the guidance of detection results from last stage. Such a pipeline can be both ineffective and inefficient, possibly hindering the overall performance. On the contrary, we combine adversarial erasing with dropout mechanism and propose a Temporal Dropout Module that learns where to remove in a data-driven and online manner. This plug-and-play module is trained without iterative stages, which not only simplifies the pipeline but also makes the regularization during training easier and more adaptive. Experiments show that the proposed method outperforms previous erasing-based methods by a large margin. More importantly, it achieves universal improvement when plugged into various direct classification methods and obtains state-of-the-art performance.

Keywords:

Computer science Discriminative model Artificial intelligence Regularization (linguistics) Pipeline (software) Machine learning Dropout (neural networks) Margin (machine learning) Heuristic Process (computing) Pattern recognition (psychology)

Metrics

Cited By

1.24

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Temporal Dropout for Weakly Supervised Action Localization

Abstract

Metrics

Citation History

Topics

Related Documents

Weakly-supervised temporal action localization: a survey

Action Shuffling for Weakly Supervised Temporal Localization

Weakly supervised temporal action localization: a survey

Self-supervised temporal adaptive learning for weakly-supervised temporal action localization

Action Relational Graph for Weakly-Supervised Temporal Action Localization