JOURNAL ARTICLE

Video summarization with a graph convolutional attention network

Ping LiChao TangXianghua Xu

Year: 2021 Journal:   Frontiers of Information Technology & Electronic Engineering Vol: 22 (6)Pages: 902-913   Publisher: Springer Science+Business Media

Abstract

Video summarization has established itself as a fundamental technique for generating compact and concise video, which alleviates managing and browsing large-scale video data. Existing methods fail to fully consider the local and global relations among frames of video, leading to a deteriorated summarization performance. To address the above problem, we propose a graph convolutional attention network (GCAN) for video summarization. GCAN consists of two parts, embedding learning and context fusion, where embedding learning includes the temporal branch and graph branch. In particular, GCAN uses dilated temporal convolution to model local cues and temporal self-attention to exploit global cues for video frames. It learns graph embedding via a multi-layer graph convolutional network to reveal the intrinsic structure of frame samples. The context fusion part combines the output streams from the temporal branch and graph branch to create the context-aware representation of frames, on which the importance scores are evaluated for selecting representative frames to generate video summary. Experiments are carried out on two benchmark databases, SumMe and TVSum, showing that the proposed GCAN approach enjoys superior performance compared to several state-of-the-art alternatives in three evaluation settings.

Keywords:
Automatic summarization Computer science Graph Artificial intelligence Embedding Exploit Attention network Convolutional neural network Context (archaeology) Pattern recognition (psychology) Theoretical computer science

Metrics

20
Cited By
1.53
FWCI (Field Weighted Citation Impact)
45
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Unsupervised video summarization with adversarial graph-based attention network

Jeshmitha GunugantiZhi-Ting YehJenq‐Haur WangMehdi Norouzi

Journal:   Journal of Visual Communication and Image Representation Year: 2024 Vol: 102 Pages: 104200-104200
JOURNAL ARTICLE

Dynamic graph convolutional network for multi-video summarization

Jiaxin WuSheng-hua ZhongYan Liu

Journal:   Pattern Recognition Year: 2020 Vol: 107 Pages: 107382-107382
JOURNAL ARTICLE

Adaptive Graph Convolutional Adjacency Matrix Network for Video Summarization

Jing ZhangGuangli WuShanshan Song

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2024 Vol: 80 (2)Pages: 1947-1965
© 2026 ScienceGate Book Chapters — All rights reserved.