JOURNAL ARTICLE

A Deep Learning Based Sound Event Location and Detection Algorithm Using Convolutional Recurrent Neural Network

Abstract

With the application of sound event detection in more and more fields, an accurate sound event location and detection system has attracted wide attention. In this paper, we propose a sound event location and detection algorithm based on convolutional recurrent neural network (CRNN). In the offline phase, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm is used to remove the noise of unknown distribution of the collected data set. Then, we extract filter banks (FBANK) features and cross correlation (GCC) features of each channel and fuse them. Finally, the features are input to CRNN which combined with soft attention mechanism to train the model. The CRNN is a multi-task learning framework. For sound category and sound location, it is realized by classification task and regression task respectively. Experimental results show that the algorithm is effective and can provide accurate category estimation and location estimation.

Keywords:
Computer science Noise (video) Convolutional neural network Recurrent neural network Event (particle physics) Artificial intelligence Fuse (electrical) Task (project management) Algorithm Set (abstract data type) Pattern recognition (psychology) Filter (signal processing) Artificial neural network Speech recognition Computer vision

Metrics

6
Cited By
1.17
FWCI (Field Weighted Citation Impact)
19
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.