Mean Teacher for Weakly Supervised Polyphonic Sound Event Detection: An Empirical Study

Zhor Diffallah; Hadjer Ykhlef; Hafida Bouarfa

doi:10.1109/ispa54004.2022.9786322

ScienceGate Book Chapters

JOURNAL ARTICLE

Mean Teacher for Weakly Supervised Polyphonic Sound Event Detection: An Empirical Study

Zhor Diffallah Hadjer Ykhlef Hafida Bouarfa

Year: 2022 Vol: 30 Pages: 1-6

DOI: 10.1109/ispa54004.2022.9786322

Get Full-Text PDF Get Analytical Report

Abstract

Sound event detection refers to the task of categorizing the types of events occurring in an audio recording, in addition to pinpointing the start and end times of each occurrence. This task has recently grown in popularity as a result of its aptitude to enhance a myriad of applications. Building sound event detection systems heavily relies on the representational power of deep neural network architectures. Deep network architectures require a large amount of strongly annotated audio data, where the exact temporal locations of each sound event are indicated. However, manually annotating audio recordings with the type of events present and the corresponding time boundaries is both costly and laborious. To mend this, learning from weak labels has been adopted in an attempt to bypass the labeling barrier. In this paper, we examine the effect of incorporating weakly-labeled data into the training process of sound event detection systems. Moreover, we analyze the behavior of the Mean Teacher framework under various deep learning configurations. Our experimental results reveal that training a well calibrated Mean Teacher structure; on weakly-labeled data, can improve the predictive performance of sound event detection systems.

Keywords:

Computer science Event (particle physics) Task (project management) Deep learning Speech recognition Artificial intelligence Artificial neural network Audio signal Popularity

Metrics

Cited By

0.39

FWCI (Field Weighted Citation Impact)

Refs

0.49

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Mean Teacher for Weakly Supervised Polyphonic Sound Event Detection: An Empirical Study

Abstract

Metrics

Citation History

Topics

Related Documents

Teacher-Student Framework for Polyphonic Semi-supervised Sound Event Detection: Survey and Empirical Analysis

Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection

Task-Aware Mean Teacher Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection

An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection

Duration Robust Weakly Supervised Sound Event Detection