Abstract

Abstract: We propose a model to classify environmental sounds such as People Sounds, Vehicles Sounds, Siren Sounds, Horn, Engine Sounds. We perform Data Augmentation techniques to extract best features from the given audio to classify which class of sound. Our deep convolutional neural network architecture uses stacked convolutional and pooling layers to extract highlevel feature representations from spectrogram-like features from the given input.

Keywords:
Spectrogram Convolutional neural network Pooling Computer science Speech recognition Sound (geography) Feature (linguistics) Artificial intelligence Pattern recognition (psychology) Deep learning Acoustics Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
15
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Vehicle Noise and Vibration Control
Physical Sciences →  Engineering →  Automotive Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.