Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks

Chien-Yao Wang; Tzu-Chiang Tai; Jia‐Ching Wang; Andri Santoso; Seksan Mathulaprangsan; Chin-Chin Chiang; Chung‐Hsien Wu

doi:10.1109/taslp.2020.2964959

ScienceGate Book Chapters

JOURNAL ARTICLE

Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks

Chien-Yao Wang Tzu-Chiang Tai Jia‐Ching Wang Andri Santoso Seksan Mathulaprangsan Chin-Chin Chiang Chung‐Hsien Wu

Year: 2020 Journal: IEEE/ACM Transactions on Audio Speech and Language Processing Vol: 28 Pages: 1875-1887 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/taslp.2020.2964959

Get Full-Text PDF Get Analytical Report

Abstract

This article proposes two novel deep convolutional neural networks (CNN), which are called the sparse coding convolutional neural network (SC-CNN) and the multi-convolutional-channel SC-CNN (MSC-CNN), to address the sound event recognition and retrieval problem. Unlike the general framework of a CNN, in which the feature learning process is performed hierarchically, the proposed framework models the whole memorization process in the human brain, including encoding, storage, and recollection. In particular, the MSC-CNN is designed to recognize multiple sound events that occur simultaneously. The experimental results indicate that the proposed SC-CNN and MSC-CNN outperforms the state-of-the-art systems in sound event recognition and retrieval.

Keywords:

Convolutional neural network Computer science Artificial intelligence Pattern recognition (psychology) Convolutional code Neural coding Speech recognition Deep learning Encoding (memory) Feature (linguistics) Decoding methods Algorithm

Metrics

Cited By

1.48

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music Technology and Sound Studies

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Recognition and retrieval of sound events using sparse coding convolutional neural network

Convolutional neural networks analyzed via convolutional sparse coding

Ambient Sound Recognition using Convolutional Neural Networks

Robust sound event recognition using convolutional neural networks

Lightweight Environmental Sound Recognition Using Convolutional Neural Networks