JOURNAL ARTICLE

Polyphonic Sound Event Detection Based on Transfer Learning Convolutional Retentive Network

CHEN PengfeiXIA Xiuyu

Year: 2025 Journal:   DOAJ (DOAJ: Directory of Open Access Journals)

Abstract

Aiming at the problems of limited strong annotation datasets and the sharp degradation of detection performance in real‑world scenarios for polyphonic sound event detection tasks, a method for polyphonic sound event detection based on Transfer learning convolutional retentive network is proposed. Firstly, the method utilizes convolutional blocks with pre‑trained weights to extract local features of audio data. Subsequently, the local features, along with orientation features, are input into the residual feature enhancement module for feature fusion and channel dimension reduction. The fused features are then fed into the retentive network with regularization methods to further learn the temporal information in the audio data. Experimental results demonstrate that, compared to the champion system model of the DCASE challenge, the method achieves a reduction in error rates by 0.277 and 0.106, and an increase in F1 scores by 22.6% and 6.6% on the development and evaluation sets of the DCASE 2016 Task3 dataset, respectively. On the development and evaluation sets of the DCASE 2017 Task3 dataset, the error rates are reduced by 0.22 and 0.123, and the F1 scores increase by 17.2% and 14.4%, respectively.

Keywords:
Pattern recognition (psychology) Convolutional neural network Transfer of learning Residual Feature (linguistics) Channel (broadcasting) Event (particle physics) Annotation

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Voice and Speech Disorders
Health Sciences →  Medicine →  Physiology
© 2026 ScienceGate Book Chapters — All rights reserved.