JOURNAL ARTICLE

Semi-Supervised Sound Event Detection with Local and Global Consistency Regularization

Abstract

Learning meaningful frame-wise features on a partially labeled dataset is crucial to semi-supervised sound event detection. Prior works either maintain consistency on frame-level predictions or seek feature-level similarity among neighboring frames, which cannot exploit the potential of unlabeled data. In this work, we design a Local and Global Consistency (LGC) regularization scheme to enhance the model on both label- and feature-level. The audio CutMix is introduced to change the contextual information of clips. Then, the local consistency is adopted to encourage the model to leverage local features for frame-level predictions, and the global consistency is applied to force features to align with global prototypes through a specially designed contrastive loss. Experiments on the DESED dataset indicate the superiority of LGC, surpassing its respective competitors largely under the same settings. Besides, combining LGC with existing methods can obtain further improvements. The code is available at https://github.com/Ming-er/LGC-SED.

Keywords:
Regularization (linguistics) Computer science Consistency (knowledge bases) Event (particle physics) Local consistency Artificial intelligence Speech recognition Physics

Metrics

5
Cited By
3.56
FWCI (Field Weighted Citation Impact)
33
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

RCT: Random consistency training for semi-supervised sound event detection

Nian ShaoErfan LoweimiXiaofei Li

Journal:   Interspeech 2022 Year: 2022 Pages: 1541-1545
JOURNAL ARTICLE

Semi-supervised learning with local and global consistency

GuiJieHuRong-XiangZhaoZhongqiuJiawei

Journal:   International Journal of Computer Mathematics Year: 2014
JOURNAL ARTICLE

Semi-supervised learning with local and global consistency

Jie GuiRong-Xiang HuZhong‐Qiu ZhaoJia Wei

Journal:   International Journal of Computer Mathematics Year: 2013 Vol: 91 (11)Pages: 2389-2402
JOURNAL ARTICLE

On Local Temporal Embedding for Semi-Supervised Sound Event Detection

Lijian GaoQirong MaoMing Dong

Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Year: 2024 Vol: 32 Pages: 1687-1698
© 2026 ScienceGate Book Chapters — All rights reserved.