Audio Event Detection Using Deep Neural Networks

Minkyu Lim; Dong‐Hyun Lee; Ho-Sung Park; Ji‐Hwan Kim

doi:10.9728/dcs.2017.18.1.183

ScienceGate Book Chapters

JOURNAL ARTICLE

Audio Event Detection Using Deep Neural Networks

Minkyu Lim Dong‐Hyun Lee Ho-Sung Park Ji‐Hwan Kim

Year: 2017 Journal: Journal of Digital Contents Society Vol: 18 (1)Pages: 183-190 Publisher: Digital Contents Society

DOI: 10.9728/dcs.2017.18.1.183

Get Full-Text PDF Get Analytical Report

Abstract

본 논문에서는 깊은 신경망을 이용한 오디오 이벤트 검출 방법을 제안한다. 오디오 입력의 매 프레임에 대한 오디오 이벤트 확률을 feed-forward 신경망을 적용하여 생성한다. 매 프레임에 대하여 멜 스케일 필터 뱅크 특징을 추출한 후, 해당 프레임의 전후 프레임으로부터의 특징벡터들을 하나의 특징벡터로 결합하고 이를 feed-forward 신경망의 입력으로 사용한다. 깊은 신경망의 출력층은 입력 프레임 특징값에 대한 오디오 이벤트 확률값을 나타낸다. 연속된 5개 이상의 프레임에서의 이벤트 확률값이 임계값을 넘을 경우 해당 구간이 오디오 이벤트로 검출된다. 검출된 오디오 이벤트는 1초 이내에 동일 이벤트로 검출되는 동안 하나의 오디오 이벤트로 유지된다. 제안된 방법으로 구현된 오디오 이벤트 검출기는 UrbanSound8K와 BBC Sound FX자료에서의 20개 오디오 이벤트에 대하여 71.8%의 검출 정확도를 보였다. This paper proposes an audio event detection method using Deep Neural Networks (DNN). The proposed method applies Feed Forward Neural Network (FFNN) to generate output probabilities of twenty audio events for each frame. Mel scale filter bank (FBANK) features are extracted from each frame, and its five consecutive frames are combined as one vector which is the input feature of the FFNN. The output layer of FFNN produces audio event probabilities for each input feature vector. More than five consecutive frames of which event probability exceeds threshold are detected as an audio event. An audio event continues until the event is detected within one second. The proposed method achieves as 71.8% accuracy for 20 classes of the UrbanSound8K and the BBC Sound FX dataset.

Keywords:

Computer science Event (particle physics) Artificial neural network Artificial intelligence Deep neural networks

Metrics

Cited By

0.20

FWCI (Field Weighted Citation Impact)

Refs

0.44

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Audio Event Detection Using Deep Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Audio-based multimedia event detection using deep recurrent neural networks

Audio Event Classification Using Deep Neural Networks

Audio event classification using deep neural networks

Disrupting Audio Event Detection Deep Neural Networks with White Noise

Sound event detection using deep neural networks