JOURNAL ARTICLE

Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network

Abstract

In this paper, we present a gated convolutional neural network and a temporal attention-based localization method for audio classification, which won the 1st place in the large-scale weakly supervised sound event detection task of Detection and Classification of Acoustic Scenes and Events (DCASE) 2017 challenge. The audio clips in this task, which are extracted from YouTube videos, are manually labelled with one or more audio tags, but without time stamps of the audio events, hence referred to as weakly labelled data. Two subtasks are defined in this challenge including audio tagging and sound event detection using this weakly labelled data. We propose a convolutional recurrent neural network (CRNN) with learnable gated linear units (GLUs) non-linearity applied on the log Mel spectrogram. In addition, we propose a temporal attention method along the frames to predict the locations of each audio event in a chunk from the weakly labelled data. The performances of our systems were ranked the 1st and the 2nd as a team in these two sub-tasks of DCASE 2017 challenge with F value 55.6% and Equal error 0.73, respectively.

Keywords:
Spectrogram Computer science Convolutional neural network Event (particle physics) Artificial intelligence Task (project management) Speech recognition Pattern recognition (psychology) Recurrent neural network Scale (ratio) Audio signal processing Audio signal Artificial neural network Speech coding

Metrics

202
Cited By
24.74
FWCI (Field Weighted Citation Impact)
25
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Image Classification using Supervised Convolutional Neural Network

Saripalli Sri SravyaK. Sri Rama KrishnaPallikonda Sarah Suhasini

Journal:   International Journal of Recent Technology and Engineering (IJRTE) Year: 2019 Vol: 9 (2)Pages: 4505-4507
JOURNAL ARTICLE

Weakly Supervised Bilinear Convolutional Neural Network for Fine-Grained Vehicle Classification

Linhao LiHan ZangXiaojuan FanHao ChengYongfeng Dong

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2025 Vol: 26 (12)Pages: 22470-22481
JOURNAL ARTICLE

Audio classification using attention-augmented convolutional neural network

Yu WuHua MaoYi Zhang

Journal:   Knowledge-Based Systems Year: 2018 Vol: 161 Pages: 90-100
© 2026 ScienceGate Book Chapters — All rights reserved.