JOURNAL ARTICLE

Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization

Peng DouYing ZengZhuoqun WangHaifeng Hu

Year: 2022 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 19 (3)Pages: 1-19   Publisher: Association for Computing Machinery

Abstract

Recent action localization works learn in a weakly supervised manner to avoid the expensive cost of human labeling. Those works are mostly based on the Multiple Instance Learning framework, where temporal pooling is an indispensable part that usually relies on the guidance of snippet-level Class Activation Sequences (CAS) . However, we observe that previous works only leverage a simple convolutional neural network for the generation of CAS, which ignores the weak discriminative foreground action segments and the background ones, and meanwhile, the relationship between different actions has not been considered. To solve this problem, we propose multiple temporal pooling mechanisms (MTP) for a more sufficient information utilization. Specifically, with the design of the Foreground Variance Branch, Dual Foreground Attention Branch and Hybrid Attention Fine-tuning Branch, MTP can leverage more effective information from different aspects and generate different CASs to guide the learning of temporal pooling. Moreover, different loss functions are designed for a better optimization of individual branches, aiming to effectively distinguish the action from the background. Our method shows excellent results on the THUMOS14 and ActivityNet1.2 datasets.

Keywords:
Leverage (statistics) Pooling Computer science Artificial intelligence Discriminative model Machine learning Convolutional neural network Pattern recognition (psychology)

Metrics

6
Cited By
0.74
FWCI (Field Weighted Citation Impact)
40
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Complementary adversarial mechanisms for weakly-supervised temporal action localization

Chuanxu WangJing WangPeng Liu

Journal:   Pattern Recognition Year: 2023 Vol: 139 Pages: 109426-109426
JOURNAL ARTICLE

Temporal Dropout for Weakly Supervised Action Localization

Chi XieZikun ZhuangShengjie ZhaoShuang Liang

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2022 Vol: 19 (3)Pages: 1-24
© 2026 ScienceGate Book Chapters — All rights reserved.