Augmented Strategy For Polyphonic Sound Event Detection

Bolun Wang; Zhonghua Fu; Hao Wu

doi:10.1109/apsipaasc47483.2019.9023025

ScienceGate Book Chapters

JOURNAL ARTICLE

Augmented Strategy For Polyphonic Sound Event Detection

Bolun Wang Zhonghua Fu Hao Wu

Year: 2019 Vol: 6 Pages: 1496-1500

DOI: 10.1109/apsipaasc47483.2019.9023025

Get Full-Text PDF Get Analytical Report

Abstract

Sound event detection is an important issue for many applications like audio content retrieval, intelligent monitoring, and scene-based interaction. The traditional studies on this topic are mainly focusing on identification of single sound event class. However, in real applications, several sound events usually happen concurrently and with different durations. That leads to a new detection task on polyphonic sound event classification along with event time boundaries. In this paper, we propose an augmented strategy for this task, which faces challenges of a large amount of unbalanced and weakly labelled training data. Specifically, the strategy includes data augmentation to enrich training set to eliminate data unbalance, a new loss function that combines cross entropy and F-score, and model fusion to integrate the powers of different classifiers. The performance of the strategy is validated on DCASE2019 dataset, and both the event and segment detections are significantly improved over the baseline system.

Keywords:

Computer science Event (particle physics) Polyphony Speech recognition Task (project management) Set (abstract data type) Artificial intelligence Sound (geography) Machine learning Pattern recognition (psychology)

Metrics

Cited By

0.16

FWCI (Field Weighted Citation Impact)

Refs

0.49

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Augmented Strategy For Polyphonic Sound Event Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Metrics for Polyphonic Sound Event Detection

Event Specific Attention for Polyphonic Sound Event Detection

Polyphonic Sound Event Detection with Weak Labeling

Polyphonic Sound Event Detection with Weak Labeling

SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection