JOURNAL ARTICLE

Bilingual Word Embeddings for Phrase-Based Machine Translation

Abstract

We introduce bilingual word embeddings: semantic embeddings associated across two languages in the context of neural language models. We propose a method to learn bilingual embeddings from a large unlabeled corpus, while utilizing MT word alignments to constrain translational equivalence. The new embeddings significantly out-perform baselines in word semantic similarity. A single semantic similarity feature induced with bilingual embeddings adds near half a BLEU point to the results of NIST08 Chinese-English machine translation task.

Keywords:
Computer science Natural language processing Machine translation Artificial intelligence Phrase Word (group theory) Equivalence (formal languages) Similarity (geometry) Semantic similarity Context (archaeology) Task (project management) Feature (linguistics) Point (geometry) Translation (biology) Bilingual dictionary Word embedding Example-based machine translation Linguistics Embedding Mathematics

Metrics

539
Cited By
82.52
FWCI (Field Weighted Citation Impact)
42
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.