Action recognition by mid-level discriminative spatial-temporal volume

Feifei Chen; Nong Sang

doi:10.1117/12.2031129

ScienceGate Book Chapters

JOURNAL ARTICLE

Action recognition by mid-level discriminative spatial-temporal volume

Feifei Chen Nong Sang

Year: 2013 Journal: Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Vol: 8919 Pages: 89190H-89190H Publisher: SPIE

DOI: 10.1117/12.2031129

Get Full-Text PDF Get Analytical Report

Abstract

Most of recent work on action recognition in video employ action parts, attributes etc. as mid- and high-level features to represent an action. However, these action parts, attributes subject to some aspects of weak discrimination and being difficult to obtain. In this paper, we present an approach that uses mid-level discriminative Spatial-Temporal Volume to recognize human actions. The spatial-temporal volume is represented by a Feature Graph which is constructed beyond on a local collection of feature points (e.g., cuboids, STIP) located in the corresponding spatial-temporal volume. Firstly, we densely sampling spatial-temporal volumes from training videos and construct a feature graph for each volume. Then, all feature graphs are clustered using spectral cluster method. We regard feature graphs as video words and characterize videos with the bag-of-features framework which we call it the bag-of-feature-graphs framework. While, in the process of clustering, the distance between two feature graphs is computed using an efficient spectral method. Final recognition is accomplished using a linear-SVM classifier. We test our algorithm in a publicly available human action dataset, the experimental results show the effectiveness of our method.

Keywords:

Discriminative model Computer science Pattern recognition (psychology) Artificial intelligence Feature (linguistics) Cluster analysis Action recognition Feature extraction Classifier (UML) Feature vector Support vector machine Spectral clustering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Action recognition by mid-level discriminative spatial-temporal volume

Abstract

Metrics

Topics

Related Documents

Human action recognition using mid-level spatial-temporal features

Learning a discriminative mid-level feature for action recognition

Discovering discriminative action parts from mid-level video representations

D-TSM: Discriminative Temporal Shift Module for Action Recognition

Discriminative Spatio-Temporal Pattern Discovery for 3D Action Recognition