JOURNAL ARTICLE

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection

Wenfei YangTianzhu ZhangZhendong MaoYongdong ZhangQi TianFeng Wu

Year: 2021 Journal:   IEEE Transactions on Image Processing Vol: 30 Pages: 5848-5861   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Weakly supervised temporal action detection has better scalability and practicability than fully supervised action detection in reality deployment. However, it is difficult to learn a robust model without temporal action boundary annotations. In this paper, we propose an en-to-end Multi-Scale Structure-Aware Network (MSA-Net) for weakly supervised temporal action detection by exploring both the global structure information of a video and the local structure information of actions. The proposed SA-Net enjoys several merits. First, to localize actions with different durations, each video is encoded into feature representations with different temporal scales. Second, based on the multi-scale feature representation, the proposed model has designed two effective structure modeling mechanisms including global structure modeling and local structure modeling, which can effectively learn discriminative structure aware representations for robust and complete action detection. To the best of our knowledge, this is the first work to fully explore the global and local structure information in a unified deep model for weakly supervised action detection. And extensive experimental results on two benchmark datasets demonstrate that the proposed MSA-Net performs favorably against state-of-the-art methods.

Keywords:
Computer science Discriminative model Artificial intelligence Benchmark (surveying) Feature (linguistics) Scalability Pattern recognition (psychology) Machine learning Representation (politics) Feature learning

Metrics

29
Cited By
2.56
FWCI (Field Weighted Citation Impact)
83
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.