Weakly-Supervised Sound Event Detection with Self-Attention

Koichi Miyazaki; Tatsuya Komatsu; Tomoki Hayashi; Shinji Watanabe; Tomoki Toda; Kazuya Takeda

doi:10.1109/icassp40776.2020.9053609

ScienceGate Book Chapters

JOURNAL ARTICLE

Weakly-Supervised Sound Event Detection with Self-Attention

Koichi Miyazaki Tatsuya Komatsu Tomoki Hayashi Shinji Watanabe Tomoki Toda Kazuya Takeda

Year: 2020 Pages: 66-70

DOI: 10.1109/icassp40776.2020.9053609

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we propose a novel sound event detection (SED) method that incorporates a self-attention mechanism of the Transformer for a weakly-supervised learning scenario. The proposed method utilizes the Transformer encoder, which consists of multiple self-attention modules, allowing to take both local and global context information of the input feature sequence into account. Furthermore, inspired by the great success of BERT in the natural language processing field, the proposed method introduces a special tag token into the input sequence for weak label prediction, which enables the aggregation of the whole sequence information. To demonstrate the performance of the proposed method, we conduct the experimental evaluation using the DCASE2019 Task4 dataset. The experimental results demonstrate that the proposed method outperforms the DCASE2019 Task4 baseline method, which is based on the convolutional recurrent neural network, and the self-attention mechanism effectively works for SED.

Keywords:

Computer science Transformer Artificial intelligence Encoder Recurrent neural network Security token Sequence labeling Convolutional neural network Pattern recognition (psychology) Speech recognition Machine learning Artificial neural network Task (project management) Voltage Engineering

Metrics

Cited By

7.39

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Weakly-Supervised Sound Event Detection with Self-Attention

Abstract

Metrics

Citation History

Topics

Related Documents

Weakly Labeled Semi-Supervised Sound Event Detection with Multi-Scale Residual Attention

Sparse Self-Attention for Semi-Supervised Sound Event Detection

Duration Robust Weakly Supervised Sound Event Detection

Improving Weakly Supervised Sound Event Detection with Causal Intervention

Towards Duration Robust Weakly Supervised Sound Event Detection