JOURNAL ARTICLE

Action Recognition Using Spatial-Temporal Context

Abstract

The spatial-temporal local features and the bag of words representation have been widely used in the action recognition field. However, this framework usually neglects the internal spatial-temporal relations between video-words, resulting in ambiguity in action recognition task, especially for videos "in the wild". In this paper, we solve this problem by utilizing the volumetric context around a video-word. Here, a local histogram of video-words distribution is calculated, which is referred as the "context" and further clustered into contextual words. To effectively use the contextual information, the descriptive video-phrases (ST-DVPs) and the descriptive video-cliques (ST-DVCs) are proposed. A general framework for ST-DVP and ST-DVC generation is described, and then action recognition can be done based on all these representations and their combinations. The proposed method is evaluated on two challenging human action datasets: the KTH dataset and the YouTube dataset. Experiment results confirm the validity of our approach.

Keywords:
Computer science Histogram Artificial intelligence Action (physics) Context (archaeology) Ambiguity Representation (politics) Task (project management) Pattern recognition (psychology) Spatial contextual awareness Word (group theory) Field (mathematics) Spatial relation Natural language processing Action recognition Context model Spatial analysis Object (grammar) Mathematics Image (mathematics) Geography

Metrics

20
Cited By
3.84
FWCI (Field Weighted Citation Impact)
17
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Learning Heterogeneous Spatial–Temporal Context for Skeleton-Based Action Recognition

Xuehao GaoYang YangYang WuShaoyi Du

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2023 Vol: 35 (9)Pages: 12130-12141
JOURNAL ARTICLE

Human action recognition using mid-level spatial-temporal features

Taiqing WangShengjin Wang

Journal:   Journal of Image and Graphics Year: 2015 Vol: 20 (4)Pages: 520-526
JOURNAL ARTICLE

Temporal–Spatial Mapping for Action Recognition

Xiaolin SongCuiling LanWenjun ZengJunliang XingXiaoyan SunJingyu Yang

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2019 Vol: 30 (3)Pages: 748-759
© 2026 ScienceGate Book Chapters — All rights reserved.