Cascaded temporal spatial features for video action recognition

Tingzhao Yu; Huxiang Gu; Lingfeng Wang; Shiming Xiang; Chunhong Pan

doi:10.1109/icip.2017.8296542

ScienceGate Book Chapters

JOURNAL ARTICLE

Cascaded temporal spatial features for video action recognition

Tingzhao Yu Huxiang Gu Lingfeng Wang Shiming Xiang Chunhong Pan

Year: 2017 Pages: 1552-1556

DOI: 10.1109/icip.2017.8296542

Get Full-Text PDF Get Analytical Report

Abstract

Extracting spatial-temporal descriptors is a challenging task for video-based human action recognition. We decouple the 3D volume of video frames directly into a cascaded temporal spatial domain via a new convolutional architecture. The motivation behind this design is to achieve deep nonlinear feature representations with reduced network parameters. First, a 1D temporal network with shared parameters is first constructed to map the video sequences along the time axis into feature maps in temporal domain. These feature maps are then organized into channels like those of RGB image (named as Motion Image here for abbreviation), which is desired to preserve both temporal and spatial information. Second, the Motion Image is regarded as the input of the latter cascaded 2D spatial network. With the combination of the 1D temporal network and the 2D spatial network together, the size of whole network parameters is largely reduced. Benefiting from the Motion Image, our network is an end-to-end system for the task of action recognition, which can be trained with the classical algorithm of back propagation. Quantities of comparative experiments on two benchmark datasets demonstrate the effectiveness of our new architecture.

Keywords:

Computer science Artificial intelligence Feature (linguistics) RGB color model Benchmark (surveying) Pattern recognition (psychology) Computer vision Feature extraction Task (project management) Convolutional neural network Motion (physics) Geography

Metrics

Cited By

0.89

FWCI (Field Weighted Citation Impact)

Refs

0.79

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Cascaded temporal spatial features for video action recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Spatial-Temporal Separable Attention for Video Action Recognition

Video Based Action Recognition Using Spatial and Temporal Feature

Efficient Temporal-Spatial Feature Grouping For Video Action Recognition

Integrating Temporal and Spatial Attention for Video Action Recognition

Human action recognition using mid-level spatial-temporal features