Abstract

As the labor force decreases, the demand for labor-saving automatic anomalous sound detection technology that conducts maintenance of industrial equipment has grown. Conventional approaches detect anomalies based on the reconstruction errors of an autoencoder. However, when the target machine sound is non-stationary, a reconstruction error tends to be large independent of an anomaly, and its variations increased because of the difficulty of predicting the edge frames. To solve the issue, we propose an approach to anomalous detection in which the model utilizes multiple frames of a spectrogram whose center frame is removed as an input, and it predicts an interpolation of the removed frame as an output. Rather than predicting the edge frames, the proposed approach makes the reconstruction error consistent with the anomaly. Experimental results showed that the proposed approach achieved 27% improvement based on the standard AUC score, especially against non-stationary machinery sounds.

Keywords:
Spectrogram Autoencoder Interpolation (computer graphics) Computer science Anomaly detection Anomaly (physics) Frame (networking) Artificial intelligence Enhanced Data Rates for GSM Evolution Artificial neural network Pattern recognition (psychology) Speech recognition Computer vision Image (mathematics) Telecommunications

Metrics

153
Cited By
11.60
FWCI (Field Weighted Citation Impact)
21
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Water Systems and Optimization
Physical Sciences →  Engineering →  Civil and Structural Engineering
Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.