JOURNAL ARTICLE

Low-Latency Convolutional Recurrent Neural Network for Keyword Spotting

Abstract

A Low-latency Convolutional Recurrent Neural Network (L-CRNN) is proposed to reduce the complexity of a Keyword Spotting (KWS) system with high accuracy. The L-CRNN reduces a number of parameters between RNN layer and Full-Connected (FC) layer, which saves at least 1/2 memory for on-hands device compared with Convolutional Recurrent Neural Network (CRNN) depending on the number of FC units. Furthermore, it learns valid deep audio features to classify the keywords and garbage words with high accuracy. Results of experiments on the Google's Speech Commands Datasets show that the L-CRNN achieves 96.17% accuracy with less than 1/4 number of parameters and fewer float operations compared with Convolutional Neural Network (CNN) and CRNN.

Keywords:
Recurrent neural network Computer science Latency (audio) Keyword spotting Convolutional neural network Artificial intelligence Spotting Speech recognition Artificial neural network Pattern recognition (psychology) Telecommunications

Metrics

3
Cited By
0.40
FWCI (Field Weighted Citation Impact)
13
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.