JOURNAL ARTICLE

Temporal Dropout for Weakly Supervised Action Localization

Chi XieZikun ZhuangShengjie ZhaoShuang Liang

Year: 2022 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 19 (3)Pages: 1-24   Publisher: Association for Computing Machinery

Abstract

Weakly supervised action localization is a challenging problem in video understanding and action recognition. Existing models usually formulate the training process as direct classification using video-level supervision. They tend to only locate the most discriminative parts of action instances and produce temporally incomplete detection results. A natural solution for this problem, the adversarial erasing strategy, is to remove such parts from training so that models can attend to complementary parts. Previous works do it in an offline and heuristic way. They adopt a multi-stage pipeline, where discriminative regions are determined and erased under the guidance of detection results from last stage. Such a pipeline can be both ineffective and inefficient, possibly hindering the overall performance. On the contrary, we combine adversarial erasing with dropout mechanism and propose a Temporal Dropout Module that learns where to remove in a data-driven and online manner. This plug-and-play module is trained without iterative stages, which not only simplifies the pipeline but also makes the regularization during training easier and more adaptive. Experiments show that the proposed method outperforms previous erasing-based methods by a large margin. More importantly, it achieves universal improvement when plugged into various direct classification methods and obtains state-of-the-art performance.

Keywords:
Computer science Discriminative model Artificial intelligence Regularization (linguistics) Pipeline (software) Machine learning Dropout (neural networks) Margin (machine learning) Heuristic Process (computing) Pattern recognition (psychology)

Metrics

10
Cited By
1.24
FWCI (Field Weighted Citation Impact)
30
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Weakly-supervised temporal action localization: a survey

AbdulRahman M. A. BarakaMohd Halim Mohd Noor

Journal:   Neural Computing and Applications Year: 2022 Vol: 34 (11)Pages: 8479-8499
JOURNAL ARTICLE

Action Shuffling for Weakly Supervised Temporal Localization

Xiaoyu ZhangHaichao ShiChangsheng LiXinchu Shi

Journal:   IEEE Transactions on Image Processing Year: 2022 Vol: 31 Pages: 4447-4457
JOURNAL ARTICLE

Weakly supervised temporal action localization: a survey

Ronglu LiTianyi ZhangRubo Zhang

Journal:   Multimedia Tools and Applications Year: 2024 Vol: 83 (32)Pages: 78361-78386
© 2026 ScienceGate Book Chapters — All rights reserved.