Query-Biased Self-Attentive Network for Query-Focused Video Summarization

Shuwen Xiao; Zhou Zhao; Zijian Zhang; Ziyu Guan; Deng Cai

doi:10.1109/tip.2020.2985868

ScienceGate Book Chapters

JOURNAL ARTICLE

Query-Biased Self-Attentive Network for Query-Focused Video Summarization

Shuwen Xiao Zhou Zhao Zijian Zhang Ziyu Guan Deng Cai

Year: 2020 Journal: IEEE Transactions on Image Processing Vol: 29 Pages: 5889-5899 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tip.2020.2985868

Get Full-Text PDF Get Analytical Report

Abstract

This paper addresses the task of query-focused video summarization, which takes user queries and long videos as inputs and generates query-focused video summaries. Compared to video summarization, which mainly concentrates on finding the most diverse and representative visual contents as a summary, the task of query-focused video summarization considers the user's intent and the semantic meaning of generated summary. In this paper, we propose a method, named query-biased self-attentive network (QSAN) to tackle this challenge. Our key idea is to utilize the semantic information from video descriptions to generate a generic summary and then to combine the information from the query to generate a query-focused summary. Specifically, we first propose a hierarchical self-attentive network to model the relative relationship at three levels, which are different frames from a segment, different segments of the same video, textual information of video description and its related visual contents. We train the model on video caption dataset and employ a reinforced caption generator to generate a video description, which can help us locate important frames or shots. Then we build a query-aware scoring module to compute the query-relevant score for each shot and generate the query-focused summary. Extensive experiments on the benchmark dataset demonstrate the competitive performance of our approach compared to some methods.

Keywords:

Automatic summarization Computer science Query expansion Information retrieval Web search query Query optimization Artificial intelligence Search engine

Metrics

Cited By

3.78

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Query-Biased Self-Attentive Network for Query-Focused Video Summarization

Abstract

Metrics

Citation History

Topics

Related Documents

Query-Focused Extractive Video Summarization

Comparative Analysis of Various Textual-Visual Models for Self-Attentive Query-Focused Video Summarization

Convolutional Hierarchical Attention Network for Query-Focused Video Summarization

MOFNet: Memory Optimization Fusion Network for Query-Focused Video Summarization

Regression Augmented Global Attention Network for Query-Focused Video Summarization