Zero-shot Cross-lingual Transfer is Under-specified Optimization

Shijie Wu; Benjamin Van Durme; Mark Dredze

doi:10.18653/v1/2022.repl4nlp-1.25

ScienceGate Book Chapters

JOURNAL ARTICLE

Zero-shot Cross-lingual Transfer is Under-specified Optimization

Shijie Wu Benjamin Van Durme Mark Dredze

Year: 2022 Pages: 236-248

DOI: 10.18653/v1/2022.repl4nlp-1.25

Get Full-Text PDF Get Analytical Report

Abstract

Pretrained multilingual encoders enable zero-shot cross-lingual transfer, but often produce unreliable models that exhibit high performance variance on the target language. We postulate that this high variance results from zero-shot cross-lingual transfer solving an under-specified optimization problem. We show that any linear-interpolated model between the source language monolingual model and source + target bilingual model has equally low source language generalization error, yet the target language generalization error reduces smoothly and linearly as we move from the monolingual to bilingual model, suggesting that the model struggles to identify good solutions for both source and target languages using the source language alone. Additionally, we show that zero-shot solution lies in non-flat region of target language error generalization surface, causing the high variance.

Keywords:

Generalization Zero (linguistics) Computer science Variance (accounting) Transfer (computing) Encoder Language model Artificial intelligence Shot (pellet) Algorithm Natural language processing Speech recognition Mathematics Linguistics Mathematical analysis

Metrics

Cited By

1.57

FWCI (Field Weighted Citation Impact)

Refs

0.81

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Zero-shot Cross-lingual Transfer is Under-specified Optimization

Abstract

Metrics

Citation History

Topics

Related Documents

Generalization Measures for Zero-Shot Cross-Lingual Transfer

Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer

Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation

Zero-Shot Emotion Transfer for Cross-Lingual Speech Synthesis

Evaluating morphological typology in zero-shot cross-lingual transfer