JOURNAL ARTICLE

Improving Transformer Based End-to-End Code-Switching Speech Recognition Using Language Identification

Zheying HuangPei WangJian WangHaoran MiaoXu JiPengyuan Zhang

Year: 2021 Journal:   Applied Sciences Vol: 11 (19)Pages: 9106-9106   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

A Recurrent Neural Networks (RNN) based attention model has been used in code-switching speech recognition (CSSR). However, due to the sequential computation constraint of RNN, there are stronger short-range dependencies and weaker long-range dependencies, which makes it hard to immediately switch languages in CSSR. Firstly, to deal with this problem, we introduce the CTC-Transformer, relying entirely on a self-attention mechanism to draw global dependencies and adopting connectionist temporal classification (CTC) as an auxiliary task for better convergence. Secondly, we proposed two multi-task learning recipes, where a language identification (LID) auxiliary task is learned in addition to the CTC-Transformer automatic speech recognition (ASR) task. Thirdly, we study a decoding strategy to combine the LID into an ASR task. Experiments on the SEAME corpus demonstrate the effects of the proposed methods, achieving a mixed error rate (MER) of 30.95%. It obtains up to 19.35% relative MER reduction compared to the baseline RNN-based CTC-Attention system, and 8.86% relative MER reduction compared to the baseline CTC-Transformer system.

Keywords:
Computer science Transformer Connectionism End-to-end principle Recurrent neural network Speech recognition Decoding methods Artificial intelligence Language model Artificial neural network Algorithm Engineering

Metrics

8
Cited By
0.56
FWCI (Field Weighted Citation Impact)
25
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.