JOURNAL ARTICLE

Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks

Chien-Yao WangTzu-Chiang TaiJia‐Ching WangAndri SantosoSeksan MathulaprangsanChin-Chin ChiangChung‐Hsien Wu

Year: 2020 Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Vol: 28 Pages: 1875-1887   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This article proposes two novel deep convolutional neural networks (CNN), which are called the sparse coding convolutional neural network (SC-CNN) and the multi-convolutional-channel SC-CNN (MSC-CNN), to address the sound event recognition and retrieval problem. Unlike the general framework of a CNN, in which the feature learning process is performed hierarchically, the proposed framework models the whole memorization process in the human brain, including encoding, storage, and recollection. In particular, the MSC-CNN is designed to recognize multiple sound events that occur simultaneously. The experimental results indicate that the proposed SC-CNN and MSC-CNN outperforms the state-of-the-art systems in sound event recognition and retrieval.

Keywords:
Convolutional neural network Computer science Artificial intelligence Pattern recognition (psychology) Convolutional code Neural coding Speech recognition Deep learning Encoding (memory) Feature (linguistics) Decoding methods Algorithm

Metrics

17
Cited By
1.48
FWCI (Field Weighted Citation Impact)
40
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.