JOURNAL ARTICLE

Extracting bilingual multi-word expressions for low-resource statistical machine translation

Abstract

Improving the performance of statistical machine translation is often a significant problem, especially in low language resource scenarios such as Chinese-Mongolian SMT. In this paper, we propose a method to improve the performance of Chinese-Mongolian SMT system using multi-word expressions, which is also a pilot study for this language pair. We extract MWEs from the phrase-table then integrate the MWEs into SMT system by various strategies. Experimental results indicate our method outperforms a baseline model by 0.81 BLEU points on Test-All and 1.54 BLEU points on Test-MWE.

Keywords:
Machine translation Computer science Phrase Word (group theory) Artificial intelligence Natural language processing Table (database) BLEU Translation (biology) Bilingual dictionary Baseline (sea) Language model Resource (disambiguation) Data mining Linguistics

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
23
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.