JOURNAL ARTICLE

Boosting Neural Machine Translation with Similar Translations

Abstract

This paper explores data augmentation methods for training Neural Machine Translation to make use of similar translations, in a comparable way a human translator employs fuzzy matches. In particular, we show how we can simply present the neural model with information of both source and target sides of the fuzzy matches, we also extend the similarity to include semantically related translations retrieved using sentence distributed representations. We show that translations based on fuzzy matching provide the model with "copy" information while translations based on embedding similarities tend to extend the translation "context". Results indicate that the effect from both similar sentences are adding up to further boost accuracy, combine naturally with model fine-tuning and are providing dynamic adaptation for unseen translation pairs. Tests on multiple data sets and domains show consistent accuracy improvements. To foster research around these techniques, we also release an Open-Source toolkit with efficient and flexible fuzzy-match implementation.

Keywords:
Computer science Machine translation Artificial intelligence Embedding Boosting (machine learning) Sentence Translation (biology) Fuzzy logic Matching (statistics) Transfer-based machine translation Natural language processing Similarity (geometry) Approximate string matching Context (archaeology) Machine learning Example-based machine translation Pattern matching Mathematics

Metrics

58
Cited By
6.61
FWCI (Field Weighted Citation Impact)
29
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Evaluating the Impact of Integrating Similar Translations into Neural Machine Translation

Arda TezcanBram Bulté

Journal:   Information Year: 2022 Vol: 13 (1)Pages: 19-19
BOOK-CHAPTER

Learning to Reuse Translations: Guiding Neural Machine Translation with Examples

Qian CaoShaohui KuangDeyi Xiong

Frontiers in artificial intelligence and applications Year: 2020
JOURNAL ARTICLE

Explicitly Modeling Word Translations in Neural Machine Translation

Dong Seog HanJunhui LiYachao LiMin ZhangGuodong Zhou

Journal:   ACM Transactions on Asian and Low-Resource Language Information Processing Year: 2019 Vol: 19 (1)Pages: 1-17
© 2026 ScienceGate Book Chapters — All rights reserved.