JOURNAL ARTICLE

Memory-Token Transformer for Unsupervised Video Anomaly Detection

Youyu LiXiaoning SongTianyang XuZhenhua Feng

Year: 2022 Journal:   2022 26th International Conference on Pattern Recognition (ICPR) Pages: 3325-3332

Abstract

Video anomaly detection is crucial for behavior analysis, which has witnessed continuous progress in recent years with the auto-encoder based reconstruction framework. However, in some cases, abnormal frames may also be reconstructed well due to the strong representation ability of deep networks, increasing missed detection. To mitigate this issue, the existing methods usually the memory bank method. This method records normal patterns and assigns high errors for the reconstruction of abnormal frames into normal frames. In this paper, to better use the semantic information of normal videos recorded in the memory module, we introduce the Memory-Token Transformer (MTT) to boost the reconstruction performance on normal frames. We assume that the anomalies in a video mainly concentrate on the regions containing people and relevant objects. Therefore, during the decoding stage, we first extract the semantic concepts of a feature map and generate the corresponding semantic tokens. Then the tokens are combined with the proposed memory module. Last, we introduce a transformer to fuse the complex relationship among different tokens, and use 3D convolution with the pooling operator in our encoder to enhance spatio-temporal feature extraction as compared with 2D models. The experimental results obtained on various benchmarks demonstrate the effectiveness of the proposed method.

Keywords:
Computer science Security token Artificial intelligence Transformer Anomaly detection Pattern recognition (psychology) Feature extraction Decoding methods Encoder Computer vision Autoencoder Deep learning Algorithm Voltage

Metrics

6
Cited By
0.71
FWCI (Field Weighted Citation Impact)
55
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Artificial Immune Systems Applications
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Memory-guided representation matching for unsupervised video anomaly detection

Yiran TaoYaosi HuZhenzhong Chen

Journal:   Journal of Visual Communication and Image Representation Year: 2024 Vol: 101 Pages: 104185-104185
JOURNAL ARTICLE

GCN-Transformer Autoencoder with Knowledge Distillation for Unsupervised Video Anomaly Detection

Mingchao YanYonghua XiongJinhua She

Journal:   Journal of Advanced Computational Intelligence and Intelligent Informatics Year: 2025 Vol: 29 (3)Pages: 659-667
© 2026 ScienceGate Book Chapters — All rights reserved.