HODGEPODGE: Sound Event Detection Based on Ensemble of Semi-Supervised Learning Methods

Ziqiang Shi; Liu Liu; Huibin Lin; Rujie Liu; Anyan Shi

doi:10.33682/9kcj-bq06

ScienceGate Book Chapters

JOURNAL ARTICLE

HODGEPODGE: Sound Event Detection Based on Ensemble of Semi-Supervised Learning Methods

Ziqiang Shi Liu Liu Huibin Lin Rujie Liu Anyan Shi

Year: 2019 Pages: 224-228

DOI: 10.33682/9kcj-bq06

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we present a method called HODGEPODGE\\footnotemark[1] for large-scale detection of sound events using weakly labeled, synthetic, and unlabeled data proposed in the Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 challenge Task 4: Sound event detection in domestic environments. To perform this task, we adopted the convolutional recurrent neural networks (CRNN) as our backbone network. In order to deal with a small amount of tagged data and a large amounts of unlabeled in-domain data, we aim to focus primarily on how to apply semi-supervise learning methods efficiently to make full use of limited data. Three semi-supervised learning principles have been used in our system, including: 1) Consistency regularization applies data augmentation; 2) MixUp regularizer requiring that the predictions for a interpolation of two inputs is close to the interpolation of the prediction for each individual input; 3) MixUp regularization applies to interpolation between data augmentations. We also tried an ensemble of various models, which are trained by using different semi-supervised learning principles. Our proposed approach significantly improved the performance of the baseline, achieving the event-based f-measure of 42.0\\% compared to 25.8\\% event-based f-measure of the baseline in the provided official evaluation dataset. Our submissions ranked third among 18 teams in the task 4.

Keywords:

Computer science Regularization (linguistics) Machine learning Artificial intelligence Task (project management) Interpolation (computer graphics) Consistency (knowledge bases) Event (particle physics) Labeled data Convolutional neural network Focus (optics) Baseline (sea) Pattern recognition (psychology)

Metrics

Cited By

2.46

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

HODGEPODGE: Sound Event Detection Based on Ensemble of Semi-Supervised Learning Methods

Abstract

Metrics

Citation History

Topics

Related Documents

Regression-based Sound Event Detection with Semi-supervised Learning

Semi-supervised Sound Event Detection Based on Meta Learning

Couple learning for semi-supervised sound event detection

An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection

Guided Learning for Weakly-Labeled Semi-Supervised Sound Event Detection