JOURNAL ARTICLE

Integrate the Temporal Scheme for Unsupervised Video Summarization via Attention Mechanism

Vo Quoc BangVo Hoai Viet

Year: 2025 Journal:   IEEE Access Vol: 13 Pages: 38147-38162   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In this work, we present a novel unsupervised scheme named SegSum, designed for video summarization through the creation of video skims. Most contemporary methods involve training a summarizer to assign importance scores to individual video frames, which are then aggregated to calculate scores for video segments produced by methods like Kernel Temporal Segmentation(KTS). Nonetheless, this methodology restricts the summarizer’s access to vital information essential for generating the summary—specifically, spatial-temporal relationships in video segments. Our proposed method incorporates the segment information obtained from KTS into the learning process of the summarizer based on concentrated attention architecture in deep learning models. In our experiment, we extensively evaluated our method across several datasets and many architectural frameworks for unsupervised video summarization. By incorporating a concentrated attention module, we managed to secure top F1-scores on established benchmarks, recording 54% on the SumMe dataset and 62% on the TVSum dataset. Furthermore, even with a straightforward Regressor network, SegSum demonstrates competitive performance, producing summaries that closely align with human annotations.

Keywords:
Automatic summarization Computer science Mechanism (biology) Scheme (mathematics) Artificial intelligence

Metrics

2
Cited By
9.55
FWCI (Field Weighted Citation Impact)
55
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Multimedia Communication and Technology
Social Sciences →  Social Sciences →  Sociology and Political Science

Related Documents

JOURNAL ARTICLE

Unsupervised Video Summarization Based on Spatiotemporal Semantic Graph and Enhanced Attention Mechanism

Xin ChengLei YangRui Li

Journal:   IEEE Transactions on Computational Social Systems Year: 2025 Vol: 12 (5)Pages: 3751-3764
JOURNAL ARTICLE

Unsupervised video summarization with adversarial graph-based attention network

Jeshmitha GunugantiZhi-Ting YehJenq‐Haur WangMehdi Norouzi

Journal:   Journal of Visual Communication and Image Representation Year: 2024 Vol: 102 Pages: 104200-104200
© 2026 ScienceGate Book Chapters — All rights reserved.