JOURNAL ARTICLE

Predicting Machine Translation Adequacy with Document Embeddings

Abstract

This paper describes USAAR's submission to the the metrics shared task of the Workshop on Statistical Machine Translation (WMT) in 2015.The goal of our submission is to take advantage of the semantic overlap between hypothesis and reference translation for predicting MT output adequacy using language independent document embeddings.The approach presented here is learning a Bayesian Ridge Regressor using document skip-gram embeddings in order to automatically evaluate Machine Translation (MT) output by predicting semantic adequacy scores.The evaluation of our submission -measured by the correlation with human judgements -shows promising results on system-level scores.

Keywords:
Machine translation Computer science Natural language processing Evaluation of machine translation Artificial intelligence Task (project management) Translation (biology) Machine learning Bayesian probability Example-based machine translation Machine translation software usability

Metrics

14
Cited By
3.46
FWCI (Field Weighted Citation Impact)
45
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.