Hierarchical Sequence Representation with Graph Network

Da Chen; Xiang Wu; Jianfeng Dong; Yuan He; Hui Xue; Feng Mao

doi:10.1109/icassp40776.2020.9054195

ScienceGate Book Chapters

JOURNAL ARTICLE

Hierarchical Sequence Representation with Graph Network

Da Chen Xiang Wu Jianfeng Dong Yuan He Hui Xue Feng Mao

Year: 2020 Pages: 2288-2292

DOI: 10.1109/icassp40776.2020.9054195

Get Full-Text PDF Get Analytical Report

Abstract

Video classification problem is a challenging task in computer vision. The performance of this task is highly relied on the scale of training data and the effectiveness of video embedding via a robust embedding network. Unsupervised solutions such as feature average pooling technique, as a simple label-independent and parameter-free based method, cannot efficiently represent the video sequences. While supervised methods, such as RNN, can improve the recognition accuracy. The performance of RNN based methods, however, is decreased with the increasing length of the videos and the hierarchical relationships between frames across events in the video. In this paper, we propose a novel video classification method based on a deep convolutional graph neural network (DCGN). The proposed method utilizes the characteristics of the hierarchical structure of the video, and performed multi-level embedding feature extraction on the video frame sequence through the graph network, and obtained a video representation which reflects the event semantics hierarchically. Experiments on YouTube-8M Large-Scale Video Understanding dataset show that our proposed model outperforms the commonly used RNN based models, verifying its effectiveness for video classification.

Keywords:

Computer science Artificial intelligence Embedding Pattern recognition (psychology) Pooling Recurrent neural network Feature extraction Graph Convolutional neural network Feature (linguistics) Feature learning Machine learning Artificial neural network Theoretical computer science

Metrics

Cited By

0.21

FWCI (Field Weighted Citation Impact)

Refs

0.47

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hierarchical Sequence Representation with Graph Network

Abstract

Metrics

Citation History

Topics

Related Documents

Hierarchical Video Frame Sequence Representation with Deep Convolutional Graph Network

Self-supervised Hierarchical Graph Neural Network for Graph Representation

Learning Effective Road Network Representation with Hierarchical Graph Neural Networks

HireGC: Hierarchical inductive network representation learning via graph coarsening

StarGAT: Star-Shaped Hierarchical Graph Attentional Network for Heterogeneous Network Representation Learning