Zero-Shot Anomalous Sound Detection in Domestic Environments Using Large-Scale Pretrained Audio Pattern Recognition Models

Alessandro Ilic Mezza; Giulio Zanetti; Máximo Cobos; Fabio Antonacci

doi:10.1109/icassp49357.2023.10095736

ScienceGate Book Chapters

JOURNAL ARTICLE

Zero-Shot Anomalous Sound Detection in Domestic Environments Using Large-Scale Pretrained Audio Pattern Recognition Models

Alessandro Ilic Mezza Giulio Zanetti Máximo Cobos Fabio Antonacci

Year: 2023 Vol: 20 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095736

Get Full-Text PDF Get Analytical Report

Abstract

Anomalous sound detection is central to audio-based surveillance and monitoring. In a domestic environment, however, the classes of sounds to be considered anomalous are situation-dependent and cannot be determined in advance. At the same time, it is not feasible to expect a demanding labeling effort from the end user. To address these problems, we present a novel zero-shot method relying on an auxiliary large-scale pretrained audio neural network in support of an unsupervised anomaly detector. The auxiliary module is tasked to generate a fingerprint for each sound occasionally registered by the user. These fingerprints are then compared with those extracted from the input audio stream, and the resulting similarity score is used to increase or reduce the sensitivity of the base detector. Experimental results on synthetic data show that the proposed method substantially improves upon the unsupervised base detector and is capable of outperforming existing few-shot learning systems developed for machine condition monitoring without involving additional training.

Keywords:

Computer science Detector Shot (pellet) Fingerprint (computing) Similarity (geometry) Anomaly detection Speech recognition Base (topology) Artificial intelligence Data stream Sound (geography) Pattern recognition (psychology) Image (mathematics) Acoustics

Metrics

Cited By

1.07

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Zero-Shot Anomalous Sound Detection in Domestic Environments Using Large-Scale Pretrained Audio Pattern Recognition Models

Abstract

Metrics

Citation History

Topics

Related Documents

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition (Pretrained Models)

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition

Slides for: PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition

Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models

Zero-Shot Document Classification Using Pretrained Models