Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition

Lin Bo; Bin Fang

doi:10.1109/spac.2017.8304361

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition

Lin Bo Bin Fang

Year: 2017 Journal: 2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC) Pages: 678-683

DOI: 10.1109/spac.2017.8304361

Get Full-Text PDF Get Analytical Report

Abstract

Automatic human action recognition is a core functionality of systems for video surveillance and human object interaction. In the whole recognition system, feature description and encoding represent two crucial key steps. In order to construct a powerful action recognition framework it is important that the two steps must provide reliable performance. In this paper, we proposed a new human action feature descriptor which is called spatial-temporal histograms of gradients (SPHOG). SPHOG is based on the spatial and temporal derivation signal, which extracts the gradient changes between consecutive frames. Compare to the traditional descriptors histograms of optical flow, our proposed SPHOG costs less computation resource. Vector of Locally Aggregated Descriptors (VLAD), which is a popular encoding approach for Bag-of-Feature representation. There is a main drawback of VLAD that it only considers the difference between local descriptor and their centroids. In order to resolve the weakness, we proposed a improved VLAD method called HOD-VLAD, which complementary the distribution information of local descriptors by computing a weight histograms of distance. We validated our proposed algorithm for human action recognition on three public available datasets KTH, UCF Sports and HMDB51. The evaluation experiment results indicate that the proposed descriptor and encoding method can improve the efficiency of human action recognition and the recognition accuracy.

Keywords:

Histogram Computer science Artificial intelligence Pattern recognition (psychology) Encoding (memory) Centroid Feature (linguistics) Representation (politics) Action recognition Computation Optical flow Feature extraction Action (physics) Key (lock) Computer vision Image (mathematics) Algorithm Class (philosophy)

Metrics

Cited By

0.39

FWCI (Field Weighted Citation Impact)

Refs

0.68

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition

Abstract

Metrics

Citation History

Topics

Related Documents

A new spatial-temporal histograms of gradients descriptor and HOD-VLAD encoding for human action recognition

Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos

Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information

Encoding spatio-temporal distribution by generalized VLAD for action recognition

Agglomerative Clustering and Residual-VLAD Encoding for Human Action Recognition