Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model

Juntao Li; Ruidan He; Hai Ye; Hwee Tou Ng; Lidong Bing; Rui Yan

doi:10.24963/ijcai.2020/508

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model

Juntao Li Ruidan He Hai Ye Hwee Tou Ng Lidong Bing Rui Yan

Year: 2020 Pages: 3672-3678

DOI: 10.24963/ijcai.2020/508

Get Full-Text PDF Get Analytical Report

Abstract

Recent research indicates that pretraining cross-lingual language models on large-scale unlabeled texts yields significant performance improvements over various cross-lingual and low-resource tasks. Through training on one hundred languages and terabytes of texts, cross-lingual language models have proven to be effective in leveraging high-resource languages to enhance low-resource language processing and outperform monolingual models. In this paper, we further investigate the cross-lingual and cross-domain (CLCD) setting when a pretrained cross-lingual language model needs to adapt to new domains. Specifically, we propose a novel unsupervised feature decomposition method that can automatically extract domain-specific features and domain-invariant features from the entangled pretrained cross-lingual representations, given unlabeled raw texts in the source language. Our proposed model leverages mutual information estimation to decompose the representations computed by a cross-lingual model into domain-invariant and domain-specific parts. Experimental results show that our proposed method achieves significant performance improvements over the state-of-the-art pretrained cross-lingual language model in the CLCD setting.

Keywords:

Computer science Domain adaptation Terabyte Language model Artificial intelligence Natural language processing Domain (mathematical analysis) Adaptation (eye) Feature (linguistics) Linguistics

Metrics

Cited By

2.64

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model

Abstract

Metrics

Citation History

Topics

Related Documents

Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling

Unsupervised multilingual machine translation with pretrained cross-lingual encoders

Language model adaptation using cross-lingual information

Vector Quantized Cross-lingual Unsupervised Domain Adaptation for Speech Emotion Recognition

Emerging Cross-lingual Structure in Pretrained Language Models