JOURNAL ARTICLE

Multi-word Expressions in English-Latvian Machine Translation

Inguna Skadiņa

Year: 2016 Journal:   Baltic Journal of Modern Computing Vol: 4 (4)   Publisher: Vilnius University, University of Latvia, Latvia University of Agriculture, Institute of Mathematics and Informatics of Universi

Abstract

The paper presents series of experiments that aim to find best method how to treat multi-word expressions (MWE) in machine translation task.Methods have been investigated in a framework of statistical machine translation (SMT) for translation form English into Latvian.MWE candidates have been extracted using pattern-based and statistical approaches.Different techniques for MWE integration into SMT system are analysed.The best result -+0.59 BLEU pointshas been achieved by combining two phrase tables -bilingual MWE dictionary and phrase table created from the parallel corpus in which statistically extracted MWE candidates are treated as single tokens.Using only bilingual dictionary as additional source of information the best result (+0.36 BLEU points) is obtained by combining two phrase tables.In case of statistically obtained MWE lists, the best result (+0.51 BLEU points) is achieved with the largest list of MWE candidates.

Keywords:
Latvian Word (group theory) Translation (biology) Linguistics Natural language processing Machine translation Computer science Artificial intelligence Philosophy Biology

Metrics

5
Cited By
0.56
FWCI (Field Weighted Citation Impact)
29
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Lexicography and Language Studies
Social Sciences →  Arts and Humanities →  Language and Linguistics
© 2026 ScienceGate Book Chapters — All rights reserved.