Robust sound event recognition using convolutional neural networks

Haomin Zhang; Ian McLoughlin; Yan Song

doi:10.1109/icassp.2015.7178031

ScienceGate Book Chapters

JOURNAL ARTICLE

Robust sound event recognition using convolutional neural networks

Haomin Zhang Ian McLoughlin Yan Song

Year: 2015 Pages: 559-563

DOI: 10.1109/icassp.2015.7178031

Get Full-Text PDF Get Analytical Report

Abstract

Traditional sound event recognition methods based on informative front end features such as MFCC, with back end sequencing methods such as HMM, tend to perform poorly in the presence of interfering acoustic noise. Since noise corruption may be unavoidable in practical situations, it is important to develop more robust features and classifiers. Recent advances in this field use powerful machine learning techniques with high dimensional input features such as spectrograms or auditory image. These improve robustness largely thanks to the discriminative capabilities of the back end classifiers. We extend this further by proposing novel features derived from spectrogram energy triggering, allied with the powerful classification capabilities of a convolutional neural network (CNN). The proposed method demonstrates excellent performance under noise-corrupted conditions when compared against state-of-the-art approaches on standard evaluation tasks. To the author's knowledge this in the first application of CNN in this field.

Keywords:

Spectrogram Computer science Convolutional neural network Robustness (evolution) Discriminative model Artificial intelligence Speech recognition Mel-frequency cepstrum Pattern recognition (psychology) Noise (video) Hidden Markov model Feature extraction Image (mathematics)

Metrics

260

Cited By

16.88

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Robust sound event recognition using convolutional neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

Ambient Sound Recognition using Convolutional Neural Networks

Robust Place Recognition using Convolutional Neural Networks

Robust Sound Event Classification Using Deep Neural Networks

Lightweight Environmental Sound Recognition Using Convolutional Neural Networks

Video Event Recognition using Two-Stream Convolutional Neural Networks