Accented Speech Recognition With Accent-specific Codebooks

Darshan Prabhu; Preethi Jyothi; Sriram Ganapathy; V. S. Unni

doi:10.18653/v1/2023.emnlp-main.444

ScienceGate Book Chapters

JOURNAL ARTICLE

Accented Speech Recognition With Accent-specific Codebooks

Darshan Prabhu Preethi Jyothi Sriram Ganapathy V. S. Unni

Year: 2023 Pages: 7175-7188

DOI: 10.18653/v1/2023.emnlp-main.444

Get Full-Text PDF Get Analytical Report

Abstract

Speech accents pose a significant challenge to state-of-the-art automatic speech recognition (ASR) systems. Degradation in performance across underrepresented accents is a severe deterrent to the inclusive adoption of ASR. In this work, we propose a novel accent adaptation approach for end-to-end ASR systems using cross-attention with a trainable set of codebooks. These learnable codebooks capture accent-specific information and are integrated within the ASR encoder layers. The model is trained on accented English speech, while the test data also contained accents which were not seen during training. On the Mozilla Common Voice multi-accented dataset, we show that our proposed approach yields significant performance gains not only on the seen English accents (up to 37% relative improvement in word error rate) but also on the unseen accents (up to 5% relative improvement in WER). Further, we illustrate benefits for a zero-shot transfer setup on the L2Artic dataset. We also compare the performance with other approaches based on accent adversarial training.

Keywords:

Stress (linguistics) Computer science Speech recognition Word error rate Encoder Set (abstract data type) Adaptation (eye) Artificial intelligence Training set Natural language processing Psychology

Metrics

Cited By

1.02

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Accented Speech Recognition With Accent-specific Codebooks

Abstract

Metrics

Citation History

Topics

Related Documents

Fast accent identification and accented speech recognition

Exploring Accent Similarity for Cross-Accented Speech Recognition

Accent detection and speech recognition for Shanghai-accented Mandarin

Partial change accent models for accented Mandarin speech recognition

Accented Speech and Accent Addition Training