JOURNAL ARTICLE

Cross-Lingual Pre-Training Based Transfer for Zero-Shot Neural Machine Translation

Baijun JiZhirui ZhangXiangyu DuanMin ZhangBoxing ChenWeihua Luo

Year: 2020 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 34 (01)Pages: 115-122   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Transfer learning between different language pairs has shown its effectiveness for Neural Machine Translation (NMT) in low-resource scenario. However, existing transfer methods involving a common target language are far from success in the extreme scenario of zero-shot translation, due to the language space mismatch problem between transferor (the parent model) and transferee (the child model) on the source side. To address this challenge, we propose an effective transfer learning approach based on cross-lingual pre-training. Our key idea is to make all source languages share the same feature space and thus enable a smooth transition for zero-shot translation. To this end, we introduce one monolingual pre-training method and two bilingual pre-training methods to obtain a universal encoder for different languages. Once the universal encoder is constructed, the parent model built on such encoder is trained with large-scale annotated data and then directly applied in zero-shot translation scenario. Experiments on two public datasets show that our approach significantly outperforms strong pivot-based baseline and various multilingual NMT approaches.

Keywords:
Computer science Machine translation Encoder Artificial intelligence Transfer of learning Zero (linguistics) Translation (biology) Natural language processing Language model Transfer (computing) Space (punctuation) Key (lock) Baseline (sea) Machine learning Linguistics

Metrics

54
Cited By
5.51
FWCI (Field Weighted Citation Impact)
49
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders

Guanhua ChenShuming MaYun ChenLi DongDongdong ZhangJia PanWenping WangFuru Wei

Journal:   Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Year: 2021 Pages: 15-26
JOURNAL ARTICLE

Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation

Guanhua ChenShuming MaYun ChenDongdong ZhangJia PanWenping WangFuru Wei

Journal:   Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Year: 2022
JOURNAL ARTICLE

Zero-Shot Neural Transfer for Cross-Lingual Entity Linking

Shruti RijhwaniJiateng XieGraham NeubigJaime Carbonell

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2019 Vol: 33 (01)Pages: 6924-6931
© 2026 ScienceGate Book Chapters — All rights reserved.