Sound Event Detection via Conformer Recurrent Neural Networks

Fangqing Gao; Xin Li; Xiukun Wei

doi:10.1109/ccdc58219.2023.10327134

ScienceGate Book Chapters

JOURNAL ARTICLE

Sound Event Detection via Conformer Recurrent Neural Networks

Fangqing Gao Xin Li Xiukun Wei

Year: 2023 Pages: 4749-4754

DOI: 10.1109/ccdc58219.2023.10327134

Get Full-Text PDF Get Analytical Report

Abstract

Sound Event Detection (SED) is a critical subject in machine listening that aims to mimic the capacity of the human auditory system. Recently, convolutional recurrent neural networks (CRNN) have attained state-of-the-art SED performance. Local time-frequency information of audio are extracted using the convolution module in CRNN. However, global information cannot be obtained due to the size of the convolution kernel. Convolution module is replaced with conformer block module for the shortcoming, which combines the advantages of transformer and convolutional neural networks to successfully describe the local and global interdependence of audio sequences. When compared to CNN, RNN, and CRNN models using the TUT-SED 2017 dataset, the proposed method can improve F1-score by 9.86% and reduce ER by 0.1235 in the development dataset and improve F1-score by 9.13% and reduce ER by 0.0836 in the evaluation dataset. Experimental results demonstrate the superiority and effectiveness of the proposed approach.

Keywords:

Computer science Kernel (algebra) Recurrent neural network Convolution (computer science) Convolutional neural network Speech recognition Artificial intelligence Block (permutation group theory) Transformer Active listening Pattern recognition (psychology) Artificial neural network Mathematics

Metrics

Cited By

0.27

FWCI (Field Weighted Citation Impact)

Refs

0.49

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Sound Event Detection via Conformer Recurrent Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks

Relational recurrent neural networks for polyphonic sound event detection

Polyphonic Bird Sound Event Detection With Convolutional Recurrent Neural Networks

Sound Event Detection with Speech Interference Using Convolutional Recurrent Neural Networks