JOURNAL ARTICLE

Towards Language-Universal End-to-End Speech Recognition

Abstract

Building speech recognizers in multiple languages typically involves replicating a monolingual training recipe for each language, or utilizing a multi-task learning approach where models for different languages have separate output labels but share some internal parameters. In this work, we exploit recent progress in end-to-end speech recognition to create a single multilingual speech recognition system capable of recognizing any of the languages seen in training. To do so, we propose the use of a universal character set that is shared among all languages. We also create a language-specific gating mechanism within the network that can modulate the network's internal representations in a language-specific way. We evaluate our proposed approach on the Microsoft Cortana task across three languages and show that our system outperforms both the individual monolingual systems and systems built with a multi-task learning approach. We also show that this model can be used to initialize a monolingual speech recognizer, and can be used to create a bilingual model for use in code-switching scenarios.

Keywords:
Computer science Task (project management) Natural language processing Set (abstract data type) Artificial intelligence Exploit Language model Speech recognition End-to-end principle Code (set theory) Programming language

Metrics

53
Cited By
6.75
FWCI (Field Weighted Citation Impact)
29
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Towards End-to-End Unsupervised Speech Recognition

Alexander H. LiuWei-Ning HsuMichael AuliAlexei Baevski

Journal:   2022 IEEE Spoken Language Technology Workshop (SLT) Year: 2023 Pages: 221-228
JOURNAL ARTICLE

End-to-End Speech Recognition of Tamil Language

Mohamed Hashim ChangrampadiA. ShahinaMenaka NarayananA. Nayeemulla Khan

Journal:   Intelligent Automation & Soft Computing Year: 2021 Vol: 32 (2)Pages: 1309-1323
JOURNAL ARTICLE

Towards End-to-End Speech Recognition System for Pashto Language Using Transformer Model

Munazza SherNasir AhmadMadiha Sher

Journal:   International Journal of Innovations in Science and Technology Year: 2024 Pages: 115-131
JOURNAL ARTICLE

Towards end-to-end speech recognition with transfer learning

Chu-Xiong QinDan QuLianhai Zhang

Journal:   EURASIP Journal on Audio Speech and Music Processing Year: 2018 Vol: 2018 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.