JOURNAL ARTICLE

Temporal Structure Mining for Weakly Supervised Action Detection

Abstract

Different from the fully-supervised action detection problem that is dependent on expensive frame-level annotations, weakly supervised action detection (WSAD) only needs video-level annotations, making it more practical for real-world applications. Existing WSAD methods detect action instances by scoring each video segment (a stack of frames) individually. Most of them fail to model the temporal relations among video segments and cannot effectively characterize action instances possessing latent temporal structure. To alleviate this problem in WSAD, we propose the temporal structure mining (TSM) approach. In TSM, each action instance is modeled as a multi-phase process and phase evolving within an action instance, \emph{i.e.}, the temporal structure, is exploited. Meanwhile, the video background is modeled by a background phase, which separates different action instances in an untrimmed video. In this framework, phase filters are used to calculate the confidence scores of the presence of an action's phases in each segment. Since in the WSAD task, frame-level annotations are not available and thus phase filters cannot be trained directly. To tackle the challenge, we treat each segment's phase as a hidden variable. We use segments' confidence scores from each phase filter to construct a table and determine hidden variables, i.e., phases of segments, by a maximal circulant path discovery along the table. Experiments conducted on three benchmark datasets demonstrate the state-of-the-art performance of the proposed TSM.

Keywords:
Computer science Benchmark (surveying) Artificial intelligence Frame (networking) Action (physics) Filter (signal processing) Pattern recognition (psychology) Construct (python library) Task (project management) Machine learning Path (computing) Data mining Computer vision

Metrics

102
Cited By
8.34
FWCI (Field Weighted Citation Impact)
63
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Weakly Supervised Temporal Action Detection With Temporal Dependency Learning

Bairong LiRuixin LiuTianquan ChenYuesheng Zhu

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2021 Vol: 32 (7)Pages: 4473-4485
JOURNAL ARTICLE

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection

Wenfei YangTianzhu ZhangZhendong MaoYongdong ZhangQi TianFeng Wu

Journal:   IEEE Transactions on Image Processing Year: 2021 Vol: 30 Pages: 5848-5861
JOURNAL ARTICLE

Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection

Bairong LiBiao GuoYuesheng ZhuJianfeng YinXiangli Ji

Journal:   IEEE Transactions on Multimedia Year: 2022 Vol: 25 Pages: 3628-3641
BOOK-CHAPTER

Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization

Haisheng SuXu ZhaoTianwei Lin

Lecture notes in computer science Year: 2019 Pages: 558-574
JOURNAL ARTICLE

Weakly Supervised Action Detection

Parthipan SivaTao Xiang

Year: 2011 Pages: 65.1-65.0
© 2026 ScienceGate Book Chapters — All rights reserved.