Vision transformer embedded video anomaly detection using attention driven recurrence

Ummay Maria Muna; Shanta Biswas; Syed Abu Ammar Muhammad Zarif; Philip Jefferson Deori; Tauseef Tajwar; Swakkhar Shatabda

doi:10.1016/j.array.2025.100471

ScienceGate Book Chapters

JOURNAL ARTICLE

Vision transformer embedded video anomaly detection using attention driven recurrence

Ummay Maria Muna Shanta Biswas Syed Abu Ammar Muhammad Zarif Philip Jefferson Deori Tauseef Tajwar Swakkhar Shatabda

Year: 2025 Journal: Array Vol: 27 Pages: 100471-100471 Publisher: Elsevier BV

DOI: 10.1016/j.array.2025.100471

Get Full-Text PDF Get Analytical Report

Abstract

Automated video anomaly detection (VAD) is a challenging task due to its context-dependent and sporadic nature. However, recent deep learning advancements offer promising solutions. In this paper, we propose a novel framework for detecting anomalies in videos by uniquely analyzing spatial and temporal (spatio-temporal) features. We address challenges such as the processing of lengthy videos and the sparse occurrence of anomalies by segmenting and labeling anomalous parts within videos. We employ a modified pre-trained vision transformer for video feature extraction, leveraging its ability to capture complex spatio-temporal patterns and the global context. Additionally, we incorporate a parameter-efficient recurrent model, the Simple Recurrent Unit Plus Plus (SRU++), which processes long sequential video embeddings efficiently by reducing computational costs by ten times compared to traditional methods. To further enhance the multiclass prediction performance, we develop a cluster-based weighting mechanism that assigns weights to classification scores based on feature similarity. We extensively evaluated our approach on three popular datasets — UCF-Crime, RWF-2000, and Smart City CCTV Violence Detection (SCVD) — achieving superior performance compared to state-of-the-art methods, making it well-suited for real-world surveillance applications.

Keywords:

Anomaly detection Transformer Computer science Computer vision Artificial intelligence Anomaly (physics) Engineering Electrical engineering Physics Voltage

Metrics

Cited By

4.82

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Network Security and Intrusion Detection

Physical Sciences → Computer Science → Computer Networks and Communications

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vision transformer embedded video anomaly detection using attention driven recurrence

Abstract

Metrics

Citation History

Topics

Related Documents

TransAnomaly: Video Anomaly Detection Using Video Vision Transformer

Video Anomaly Detection using Factorized Self-Attention Transformer

Video Anomaly Detection with Video Vision Transformer

Video Anomaly Detection Using Encoder-Decoder Networks with Video Vision Transformer and Channel Attention Blocks

Unsupervised Video Anomaly Detection Using Video Vision Transformer and Adversarial Training