JOURNAL ARTICLE

Mining Spatial Temporal Saliency Structure for Action Recognition

Yinan LiuQingbo WuLinfeng XuBo Wu

Year: 2016 Journal:   IEICE Transactions on Information and Systems Vol: E99.D (10)Pages: 2643-2646   Publisher: Institute of Electronics, Information and Communication Engineers

Abstract

Traditional action recognition approaches use pre-defined rigid areas to process the space-time information, e.g. spatial pyramids, cuboids. However, most action categories happen in an unconstrained manner, that is, the same action in different videos can happen at different places. Thus we need a better video representation to deal with the space-time variations. In this paper, we introduce the idea of mining spatial temporal saliency. To better handle the uniqueness of each video, we use a space-time over-segmentation approach, e.g. supervoxel. We choose three different saliency measures that take not only the appearance cues, but also the motion cues into consideration. Furthermore, we design a category-specific mining process to find the discriminative power in each action category. Experiments on action recognition datasets such as UCF11 and HMDB51 show that the proposed spatial temporal saliency video representation can match or surpass some of the state-of-the-art alternatives in the task of action recognition.

Keywords:
Computer science Discriminative model Artificial intelligence Action (physics) Representation (politics) Segmentation Pattern recognition (psychology) Process (computing) Action recognition Task (project management) Motion (physics) Space (punctuation) Machine learning Class (philosophy)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
21
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Spatial-temporal saliency action mask attention network for action recognition

Min JiangNa PanJun Kong

Journal:   Journal of Visual Communication and Image Representation Year: 2020 Vol: 71 Pages: 102846-102846
JOURNAL ARTICLE

Temporal–Spatial Mapping for Action Recognition

Xiaolin SongCuiling LanWenjun ZengJunliang XingXiaoyan SunJingyu Yang

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2019 Vol: 30 (3)Pages: 748-759
BOOK-CHAPTER

Spatial-Temporal Attention for Action Recognition

Dengdi SunHanqing WuZhuanlian DingBin LuoJin Tang

Lecture notes in computer science Year: 2018 Pages: 854-864
© 2026 ScienceGate Book Chapters — All rights reserved.