Explicit Alignment Learning for Neural Machine Translation

Zuchao Li; Hai Zhao; Fengshun Xiao; Masao Utiyama; Eiichiro Sumita

doi:10.24963/ijcai.2022/587

ScienceGate Book Chapters

JOURNAL ARTICLE

Explicit Alignment Learning for Neural Machine Translation

Zuchao Li Hai Zhao Fengshun Xiao Masao Utiyama Eiichiro Sumita

Year: 2022 Journal: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Pages: 4230-4237

DOI: 10.24963/ijcai.2022/587

Get Full-Text PDF Get Analytical Report

Abstract

Even though neural machine translation (NMT) has become the state-of-the-art solution for end-to-end translation, it still suffers from a lack of translation interpretability, which may be conveniently enhanced by explicit alignment learning (EAL), as performed in traditional statistical machine translation (SMT). To provide the benefits of both NMT and SMT, this paper presents a novel model design that enhances NMT with an additional training process for EAL, in addition to the end-to-end translation training. Thus, we propose two approaches an explicit alignment learning approach, in which we further remove the need for the additional alignment model, and perform embedding mixup with the alignment based on encoder--decoder attention weights in the NMT model. We conducted experiments on both small-scale (IWSLT14 De->En and IWSLT13 Fr->En) and large-scale (WMT14 En->De, En->Fr, WMT17 Zh->En) benchmarks. Evaluation results show that our EAL methods significantly outperformed strong baseline methods, which shows the effectiveness of EAL. Further explorations show that the translation improvements are due to a better spatial alignment of the source and target language embeddings. Our method improves translation performance without the need to increase model parameters and training data, which verifies that the idea of incorporating techniques of SMT into NMT is worthwhile.

Keywords:

Machine translation Computer science Interpretability Translation (biology) Artificial intelligence Embedding Machine learning Encoder Process (computing) Natural language processing Programming language

Metrics

Cited By

0.12

FWCI (Field Weighted Citation Impact)

Refs

0.26

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Explicit Alignment Learning for Neural Machine Translation

Abstract

Metrics

Citation History

Topics

Related Documents

Neural Machine Translation With Explicit Phrase Alignment

Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance

Improving neural machine translation with sentence alignment learning

Alignment-Based Neural Machine Translation

Explicit Sentence Compression for Neural Machine Translation