Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization

Yufan Hu; Jie Fu; Mengyuan Chen; Junyu Gao; Jianfeng Dong; Bin Fan; Hongmin Liu

doi:10.1109/tcsvt.2023.3283430

ScienceGate Book Chapters

JOURNAL ARTICLE

Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization

Yufan Hu Jie Fu Mengyuan Chen Junyu Gao Jianfeng Dong Bin Fan Hongmin Liu

Year: 2023 Journal: IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (1)Pages: 207-220 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcsvt.2023.3283430

Get Full-Text PDF Get Analytical Report

Abstract

Weakly-supervised temporal action localization (WTAL) aims to localize and classify action instances in untrimmed videos with only video-level labels available. Despite the remarkable success of existing methods, whose generated proposals are commonly far more than the ground-truth action instances, it still makes sense to improve the ranking accuracy of the generated proposals since users in real-world scenarios usually prioritize the action proposals with the highest confidence scores. The inaccuracy of the proposal ranking mainly comes from two aspects: For one thing, the traditional proposal generation manner entirely relies on snippet-level perception, resulting in a significant yet unnoticed gap with the target of proposal-level localization. For another, existing methods commonly employ a hand-crafted proposal generation manner, a post-process that does not participate in model optimization. To address the above issues, we propose an end-to-end trained two-stage method, termed as Learning Proposal-aware Re-ranking (LPR) for WTAL. In the first stage, we design a proposal-aware feature learning module to inject the proposal-aware contextual information into each snippet, and then the enhanced features are utilized for predicting initial proposals. Furthermore, to perform effective and efficient proposal re-ranking, in the second stage, we contrast the proposals attached with high confidence scores with our constructed multi-scale foreground/background prototypes for further optimization. Evaluated by both the vanilla and Top- $k$ mAP metrics, results of extensive experiments on two popular benchmarks demonstrate the effectiveness of our proposed method.

Keywords:

Computer science Artificial intelligence Ranking (information retrieval) Action (physics) Pattern recognition (psychology) Machine learning

Metrics

Cited By

4.19

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization

Abstract

Metrics

Citation History

Topics

Related Documents

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization

Background-Aware Robust Context Learning for Weakly-Supervised Temporal Action Localization

Uncertainty-Aware Dual-Evidential Learning for Weakly-Supervised Temporal Action Localization

Self-supervised temporal adaptive learning for weakly-supervised temporal action localization

Weakly Supervised Temporal Action Localization Through Contrastive Learning