JOURNAL ARTICLE

Ambient Sound Recognition using Convolutional Neural Networks

Abstract

Due to its many uses in areas including voice recognition, music analysis, and security systems, sound recognition has attracted a lot of attention. Convolutional neural networks (CNNs) have become a potent tool for sound recognition, producing cutting-edge outcomes in a variety of challenges. In this study, we will look at the architecture of CNNs, several training methods used to enhance their performance, and accuracy testing. The performance of the proposed sound recognition technique has been tested using 1000 audio files from the UrbanSounds8K dataset. The accuracy results obtained by using a CNN and Support Vector Machine (SVM) models were 95.6% and 93% respectively. These results portray the efficiency of using an advanced CNN architecture with five convolution layers and a versatile dataset like Urbansoundsd8K.

Keywords:
Convolutional neural network Computer science Speech recognition Convolution (computer science) Support vector machine Pattern recognition (psychology) Artificial intelligence Architecture Enhanced Data Rates for GSM Evolution Variety (cybernetics) Sound (geography) Artificial neural network

Metrics

5
Cited By
1.34
FWCI (Field Weighted Citation Impact)
17
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.