JOURNAL ARTICLE

Multi-Modal Anomaly Detection by Using Audio and Visual Cues

Ata Ur RehmanHafiz Sami UllahHaroon FarooqMuhammad Salman KhanTayyeb MahmoodHafiz Owais Ahmed Khan

Year: 2021 Journal:   IEEE Access Vol: 9 Pages: 30587-30603   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper considers the problem of anomaly detection in an outdoor environment where surveillance cameras are usually installed to monitor activities of general public. A novel solution is proposed which combines audio and visual data to automatically detect abnormal activities. The proposed anomaly detection algorithm makes use of both visual and audio features to automatically detect anomalous activities in scenes. Visual features such as optical flow technique combined with particle swam optimization and social force model are used, whereas, acoustic features such as, energy, zero crossing rate, volume, spectral-centroid, spectral spread, spectral roll-off, spectral flux, cross correlation and the mel-frequency cepstral coefficients (MFCCs) are used. An anomaly inference is developed which is based on both visual and audio features. The performance of the proposed algorithm is evaluated by testing it on the publicly available UMN datasets combined with the audio recordings. The proposed algorithm is compared with state-of-the-art techniques and is shown to achieve improved performance in terms of accuracy.

Keywords:
Computer science Anomaly detection Centroid Artificial intelligence Mel-frequency cepstrum Pattern recognition (psychology) Speech recognition Anomaly (physics) Ground truth Optical flow Computer vision Cepstrum Feature extraction Image (mathematics)

Metrics

35
Cited By
3.39
FWCI (Field Weighted Citation Impact)
60
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.