JOURNAL ARTICLE

Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition

Qiujia LiChao ZhangPhilip C. Woodland

Year: 2019 Journal:   2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Pages: 39-46

Abstract

This paper proposes a novel automatic speech recognition (ASR) framework called Integrated Source-Channel and Attention (ISCA) that combines the advantages of traditional systems based on the noisy source-channel model (SC) and end-to-end style systems using attention-based sequence-to-sequence models. The traditional SC system framework includes hidden Markov models and connectionist temporal classification (CTC) based acoustic models, language models (LMs), and a decoding procedure based on a lexicon, whereas the end-to-end style attention-based system jointly models the whole process with a single model. By rescoring the hypotheses produced by traditional systems using end-to-end style systems based on an extended noisy source-channel model, ISCA allows structured knowledge to be easily incorporated via the SC-based model while exploiting the complementarity of the attention-based model. Experiments on the AMI meeting corpus show that ISCA is able to give a relative word error rate reduction up to 21% over an individual system, and by 13% over an alternative method which also involves combining CTC and attention-based models.

Keywords:
Computer science Connectionism Hidden Markov model Language model Speech recognition Decoding methods Channel (broadcasting) Sequence (biology) Artificial intelligence Word error rate Sequence labeling Lexicon Complementarity (molecular biology) Natural language processing Artificial neural network Algorithm Task (project management)

Metrics

16
Cited By
2.00
FWCI (Field Weighted Citation Impact)
90
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.