JOURNAL ARTICLE

Video Summarization With Attention-Based Encoder–Decoder Networks

Zhong JiKailin XiongYanwei PangXuelong Li

Year: 2019 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 30 (6)Pages: 1709-1717   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper addresses the problem of supervised video summarization by formulating it as a sequence-to-sequence learning problem, where the input is a sequence of original video frames, and the output is a keyshot sequence. Our key idea is to learn a deep summarization network with attention mechanism to mimic the way of selecting the keyshots of human. To this end, we propose a novel video summarization framework named attentive encoder-decoder networks for video summarization (AVS), in which the encoder uses a bidirectional long short-term memory (BiLSTM) to encode the contextual information among the input video frames. As for the decoder, two attention-based LSTM networks are explored by using additive and multiplicative objective functions, respectively. Extensive experiments are conducted on two video summarization benchmark datasets, i.e., SumMe and TVSum. The results demonstrate the superiority of the proposed AVS-based approaches against the state-of-the-art approaches, with remarkable improvements on both datasets.

Keywords:
Automatic summarization Computer science Encoder Benchmark (surveying) Artificial intelligence ENCODE Encoding (memory) Key (lock) Sequence (biology) Deep learning Decoding methods Recurrent neural network Pattern recognition (psychology) Artificial neural network Algorithm

Metrics

354
Cited By
19.88
FWCI (Field Weighted Citation Impact)
58
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.