BOOK-CHAPTER

Refinement of Unsupervised Cross-Lingual Word Embeddings

Abstract

Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal.The lion's share of the methods are projectionbased approaches that map pre-trained embeddings into a shared latent space.These methods are mostly based on the orthogonal transformation, which assumes language vector spaces to be isomorphic.However, this criterion does not necessarily hold, especially for morphologically-rich languages.In this paper, we propose a selfsupervised method to refine the alignment of unsupervised bilingual word embeddings.The proposed model moves vectors of words and their corresponding translations closer to each other as well as enforces length-and center-invariance, thus allowing to better align cross-lingual embeddings.The experimental results demonstrate the effectiveness of our approach, as in most cases it outperforms stateof-the-art methods in a bilingual lexicon induction task.

Keywords:
Word (group theory) Computer science Natural language processing Artificial intelligence Linguistics Philosophy

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
29
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Unsupervised Learning of Cross-Lingual Word Embeddings

Anders SøgaardIvan VulićSebastian RuderManaal Faruqui

Synthesis lectures on human language technologies Year: 2019 Pages: 67-74
BOOK

Cross-Lingual Word Embeddings

Anders SøgaardIvan VulićSebastian RuderManaal Faruqui

Synthesis lectures on human language technologies Year: 2019
JOURNAL ARTICLE

Cross-Lingual Word Embeddings

Anders SøgaardIvan VulićSebastian RuderManaal Faruqui

Journal:   Synthesis lectures on human language technologies Year: 2019 Vol: 12 (2)Pages: 1-132
© 2026 ScienceGate Book Chapters — All rights reserved.