Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Iván López‐Espejo; Ram C. M. C. Shekar; Zheng‐Hua Tan; Jesper Jensen; John H. L. Hansen

doi:10.1109/icassp49357.2023.10095436

ScienceGate Book Chapters

JOURNAL ARTICLE

Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Iván López‐Espejo Ram C. M. C. Shekar Zheng‐Hua Tan Jesper Jensen John H. L. Hansen

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095436

Get Full-Text PDF Get Analytical Report

Abstract

In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance. In this study, we demonstrate that filterbank learning outperforms handcrafted speech features for KWS whenever the number of filterbank channels is severely decreased. Reducing the number of channels might yield certain KWS performance drop, but also a substantial energy consumption reduction, which is key when deploying common always-on KWS on low-resource devices. Experimental results on a noisy version of the Google Speech Commands Dataset show that filterbank learning adapts to noise characteristics to provide a higher degree of robustness to noise, especially when dropout is integrated. Thus, switching from typically used 40-channel log-Mel features to 8channel learned features leads to a relative KWS accuracy loss of only 3.5% while simultaneously achieving a 6.3× energy consumption reduction.

Keywords:

Keyword spotting Computer science Filter bank Speech recognition Robustness (evolution) Energy consumption Artificial intelligence Noise reduction Filter (signal processing) Computer vision Engineering

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.64

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Abstract

Metrics

Citation History

Topics

Related Documents

Joint Framework of Curriculum Learning and Knowledge Distillation for Noise-Robust and Small-Footprint Keyword Spotting

Exploring representation learning for small-footprint keyword spotting

Deep Residual Learning for Small-Footprint Keyword Spotting

DCCRN-KWS: An Audio Bias Based Model for Noise Robust Small-Footprint Keyword Spotting

Small Footprint Multi-channel Keyword Spotting