VLAD-SSTA: VLAD with Soft Spatio-Temporal Assignment for Action Recognition

Shilei Cheng; Guoyi Qin; Siqi Li; Mei Xie; Zheng Ma

doi:10.1109/iccwamtip47768.2019.9067588

ScienceGate Book Chapters

JOURNAL ARTICLE

VLAD-SSTA: VLAD with Soft Spatio-Temporal Assignment for Action Recognition

Shilei Cheng Guoyi Qin Siqi Li Mei Xie Zheng Ma

Year: 2019 Vol: 2 Pages: 217-221

DOI: 10.1109/iccwamtip47768.2019.9067588

Get Full-Text PDF Get Analytical Report

Abstract

It is important to simultaneously characterize videos with spatial and temporal information, especially for human action recognition, as spatial cue can model the human appearance while the dynamic motion need to be represented by temporal cue. The vector of locally aggregated descriptor (VLAD) whose assignment with the shortage of temporal information, can be regarded as a suboptimal solution for action recognition. In this paper, VLAD with a soft spatio-temporal assignment, named VLAD-SSTA, is proposed to further boost the performance of action recognition by employing the soft assignment with spatio-temporal characteristic. Specifically, the Spatio- Temporal Aware module is creatively devised with a series of 3D convolutions to capture the spatio-temporal characteristic. Experimental results show that the proposed approach yields state-of-the-art performance on challenging datasets.

Keywords:

Computer science Action recognition Artificial intelligence Action (physics) Economic shortage Pattern recognition (psychology) Motion (physics) Temporal database Dynamics (music) Computer vision Data mining

Metrics

Cited By

0.11

FWCI (Field Weighted Citation Impact)

Refs

0.49

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

VLAD-SSTA: VLAD with Soft Spatio-Temporal Assignment for Action Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Encoding spatio-temporal distribution by generalized VLAD for action recognition

Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos

Spatio-Temporal Self-Attention Weighted VLAD Neural Network for Action Recognition

Action Recognition Using Spatio-temporal Co-occurrence Features and Improved VLAD

Action Recognition with Uncertain VLAD