Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation

GuanHua Chen; Yun Chen; Yong Wang; Victor O. K. Li

doi:10.24963/ijcai.2020/496

ScienceGate Book Chapters

JOURNAL ARTICLE

Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation

GuanHua Chen Yun Chen Yong Wang Victor O. K. Li

Year: 2020 Pages: 3587-3593

DOI: 10.24963/ijcai.2020/496

Get Full-Text PDF Get Analytical Report

Abstract

Leveraging lexical constraint is extremely significant in domain-specific machine translation and interactive machine translation. Previous studies mainly focus on extending beam search algorithm or augmenting the training corpus by replacing source phrases with the corresponding target translation. These methods either suffer from the heavy computation cost during inference or depend on the quality of the bilingual dictionary pre-specified by user or constructed with statistical machine translation. In response to these problems, we present a conceptually simple and empirically effective data augmentation approach in lexical constrained neural machine translation. Specifically, we make constraint-aware training data by first randomly sampling the phrases of the reference as constraints, and then packing them together into the source sentence with a separation symbol. Extensive experiments on several language pairs demonstrate that our approach achieves superior translation results over the existing systems, improving translation of constrained sentences without hurting the unconstrained ones.

Keywords:

Machine translation Computer science Transfer-based machine translation Artificial intelligence Example-based machine translation Constraint (computer-aided design) Translation (biology) Natural language processing Inference Sentence Lexical choice Lexical item Mathematics

Metrics

Cited By

5.58

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Syntax-Aware Data Augmentation for Neural Machine Translation

Data augmentation using back-translation for context-aware neural machine translation

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

Improving Lexical-Constraint-Aware Machine Translation by Factoring Encoders

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation