JOURNAL ARTICLE

Coarse-to-Fine Loss Based On Viterbi Algorithm for Weakly Supervised Action Segmentation

Abstract

Weakly supervised action segmentation has been extensively studied to get the category and start time of actions that occur in videos, but it remains an unsolved issue because of lacking great annotation data in video analysis. To handle this issue, weakly supervised action segmentation only uses the action annotation on the whole sequence in a long video instead of specific labeling of each frame, which greatly reduces the difficulty of obtaining video datasets. However, the task remains challenging for the complex temporal length partition of actions in the videos. In this paper, we make use of the Viterbi algorithm to generate an initial action segmentation as the baseline and then design a new coarse-to-fine loss function to refine the length partition and learn the scores of valid and invalid segmentation routes respectively. The new coarse-to-fine loss is learned in the pipeline to reduce the weight of invalid segmentation routes and obtain the best video segmentation. Comparing with the state-of-the-art (SOTA) methods, the experiments on the breakfast and 50 salads datasets show that our fine partition model and coarse-to-fine loss function can be used to obtain higher frame accuracy and significantly reduce the time spent for action segmentation.

Keywords:
Segmentation Computer science Viterbi algorithm Artificial intelligence Scale-space segmentation Pattern recognition (psychology) Pipeline (software) Partition (number theory) Frame (networking) Image segmentation Segmentation-based object categorization Machine learning Hidden Markov model Mathematics

Metrics

1
Cited By
0.10
FWCI (Field Weighted Citation Impact)
52
Refs
0.44
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.