Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning

Wei-Rui Chen; Muhammad Abdul-Mageed

doi:10.18653/v1/2023.loresmt-1.6

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning

Wei-Rui Chen Muhammad Abdul-Mageed

Year: 2023 Pages: 73-85

DOI: 10.18653/v1/2023.loresmt-1.6

Get Full-Text PDF Get Analytical Report

Abstract

Machine translation (MT) involving Indigenous languages, including endangered ones, is challenging primarily due to lack of sufficient parallel data. We describe an approach exploiting bilingual and multilingual pretrained MT models in a transfer learning setting to translate from Spanish into ten South American Indigenous languages. Our models set new SOTA on five out of the ten language pairs we consider, even doubling performance on one of these five pairs. Unlike previous SOTA that perform data augmentation to enlarge the train sets, we retain the low-resource setting to test the effectiveness of our models under such a constraint. In spite of the rarity of linguistic information available about the Indigenous languages, we offer a number of quantitative and qualitative analyses (e.g., as to morphology, tokenization, and orthography) to contextualize our results.

Keywords:

Computer science Machine translation Natural language processing Artificial intelligence Indigenous Orthography Agglutinative language Training set Constraint (computer-aided design) Set (abstract data type) Transfer of learning Linguistics Parsing Programming language Reading (process) Mathematics

Metrics

Cited By

1.02

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages

Improving Multilingual Neural Machine Translation System for Indic Languages

Bridging Philippine Languages With Multilingual Neural Machine Translation

Improving Multilingual Neural Machine Translation For Low-Resource Languages: French, English - Vietnamese

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation