Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models

Yixin Ji; Jikai Wang; Juntao Li; Hai Ye; Min Zhang

doi:10.18653/v1/2023.findings-emnlp.545

ScienceGate Book Chapters

JOURNAL ARTICLE

Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models

Yixin Ji Jikai Wang Juntao Li Hai Ye Min Zhang

Year: 2023 Pages: 8104-8118

DOI: 10.18653/v1/2023.findings-emnlp.545

Get Full-Text PDF Get Analytical Report

Abstract

With the development of multilingual pre-trained language models (mPLMs), zero-shot cross-lingual transfer shows great potential. To further improve the performance of cross-lingual transfer, many studies have explored representation misalignment caused by morphological differences but neglected the misalignment caused by the anisotropic distribution of contextual representations. In this work, we propose enhanced isotropy and constrained code-switching for zero-shot cross-lingual transfer to alleviate the problem of misalignment caused by the anisotropic representations and maintain syntactic structural knowledge. Extensive experiments on three zero-shot cross-lingual transfer tasks demonstrate that our method gains significant improvements over strong mPLM backbones and further improves the state-of-the-art methods.

Keywords:

Isotropy Computer science Zero (linguistics) Representation (politics) Transfer (computing) Anisotropy Shot (pellet) Artificial intelligence Code (set theory) Natural language processing Physics Optics Materials science Linguistics Programming language Parallel computing

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.17

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models

Abstract

Metrics

Topics

Related Documents

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models

Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction

Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model

Zero-shot cross-lingual transfer in instruction tuning of large language models

Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERT