JOURNAL ARTICLE

Query-Biased Self-Attentive Network for Query-Focused Video Summarization

Shuwen XiaoZhou ZhaoZijian ZhangZiyu GuanDeng Cai

Year: 2020 Journal:   IEEE Transactions on Image Processing Vol: 29 Pages: 5889-5899   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper addresses the task of query-focused video summarization, which takes user queries and long videos as inputs and generates query-focused video summaries. Compared to video summarization, which mainly concentrates on finding the most diverse and representative visual contents as a summary, the task of query-focused video summarization considers the user's intent and the semantic meaning of generated summary. In this paper, we propose a method, named query-biased self-attentive network (QSAN) to tackle this challenge. Our key idea is to utilize the semantic information from video descriptions to generate a generic summary and then to combine the information from the query to generate a query-focused summary. Specifically, we first propose a hierarchical self-attentive network to model the relative relationship at three levels, which are different frames from a segment, different segments of the same video, textual information of video description and its related visual contents. We train the model on video caption dataset and employ a reinforced caption generator to generate a video description, which can help us locate important frames or shots. Then we build a query-aware scoring module to compute the query-relevant score for each shot and generate the query-focused summary. Extensive experiments on the benchmark dataset demonstrate the competitive performance of our approach compared to some methods.

Keywords:
Automatic summarization Computer science Query expansion Information retrieval Web search query Query optimization Artificial intelligence Search engine

Metrics

63
Cited By
3.78
FWCI (Field Weighted Citation Impact)
73
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Query-Focused Extractive Video Summarization

Aidean SharghiBoqing GongMubarak Shah

Lecture notes in computer science Year: 2016 Pages: 3-19
JOURNAL ARTICLE

Convolutional Hierarchical Attention Network for Query-Focused Video Summarization

Shuwen XiaoZhou ZhaoZijian ZhangXiaohui YanMin Yang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2020 Vol: 34 (07)Pages: 12426-12433
JOURNAL ARTICLE

MOFNet: Memory Optimization Fusion Network for Query-Focused Video Summarization

K. X. ZhaoRan MaMin SuHongwen YuPing AnGongyang Li

Journal:   IEEE Sensors Journal Year: 2025 Vol: 25 (14)Pages: 27642-27652
BOOK-CHAPTER

Regression Augmented Global Attention Network for Query-Focused Video Summarization

Min SuRan MaBing ZhangKai LiPing An

Communications in computer and information science Year: 2023 Pages: 326-338
© 2026 ScienceGate Book Chapters — All rights reserved.