JOURNAL ARTICLE

A Transformer-Based End-to-End Automatic Speech Recognition Algorithm

Fang DongYiyang QianTianlei WangPeng LiuJiuwen Cao

Year: 2023 Journal:   IEEE Signal Processing Letters Vol: 30 Pages: 1592-1596   Publisher: Institute of Electrical and Electronics Engineers

Abstract

End-to-End (E2E) automatic speech recognition (ASR) becomes popular recent years and has been widely used in many applications. However, current ASR algorithms are usually less effective when applied in specific applications with terminologies such as medical and economic fields. To address this issue, we propose a powerful Transformer based ASR decoding method for beam searching, called soft beam pruning algorithm (SBPA). SBPA can dynamically adjust the width of beam search. Meanwhile, a prefix module (PM) is added to access the contextual information and avoid removing professional words in the beam search. Combining SBPA and PM, the proposed ASR can achieve promising recognition performance on professional terminologies. To verify the effectiveness, experiments are conducted on real-world conversation data with medical terminology. It is shown that the proposed ASR achieved significant performance on both professional and regular words.

Keywords:
Computer science Decoding methods End-to-end principle Conversation Transformer Prefix Terminology Beam search Speech recognition Artificial intelligence Algorithm Search algorithm Engineering

Metrics

3
Cited By
0.77
FWCI (Field Weighted Citation Impact)
37
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.