JOURNAL ARTICLE

Hierarchical Explanations for Video Action Recognition

Abstract

To interpret deep neural networks, one main approach is to dissect the visual input and find the prototypical parts responsible for the classification. However, existing methods often ignore the hierarchical relationship between these prototypes, and thus can not explain semantic concepts at both higher level (e.g., water sports) and lower level (e.g., swimming). In this paper inspired by human cognition system, we leverage hierarchal information to deal with uncertainty. To this end, we propose HIerarchical Prototype Explainer (HIPE) to build hierarchical relations between prototypes and classes. The faithfulness of our method is verified by reducing accuracy-explainability trade-off on UCF-101 while providing multi-level explanations.

Keywords:
Leverage (statistics) Computer science Action recognition Artificial intelligence Hierarchical database model Action (physics) Deep neural networks Cognition Machine learning Artificial neural network Data mining Class (philosophy) Psychology

Metrics

13
Cited By
3.32
FWCI (Field Weighted Citation Impact)
67
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Learning hierarchical video representation for action recognition

Qing LiZhaofan QiuTing YaoTao MeiYong RuiJiebo Luo

Journal:   International Journal of Multimedia Information Retrieval Year: 2017 Vol: 6 (1)Pages: 85-98
JOURNAL ARTICLE

Hierarchical Semantics Interaction for Compressed Video Action Recognition

Jinxin GuoYang YangHuaiwen ZhangS-B QianChunqiang Xu

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2026 Pages: 1-1
BOOK-CHAPTER

Video Action Recognition

Kevin Ashley

Apress eBooks Year: 2020 Pages: 179-197
© 2026 ScienceGate Book Chapters — All rights reserved.