JOURNAL ARTICLE

Manawi: Using Multi-Word Expressions and Named Entities to Improve Machine Translation

Abstract

We describe the Manawi 1 ( ) system submitted to the 2014 WMT translation shared task.We participated in the English-Hindi (EN-HI) and Hindi-English (HI-EN) language pair and achieved 0.792 for the Translation Error Rate (TER) score 2 for EN-HI, the lowest among the competing systems.Our main innovations are (i) the usage of outputs from NLP tools, viz.billingual multi-word expression extractor and named-entity recognizer to improve SMT quality and (ii) the introduction of a novel filter method based on sentence-alignment features.The Manawi system showed the potential of improving translation quality by incorporating multiple NLP tools within the MT pipeline.

Keywords:
Computer science Machine translation Natural language processing Artificial intelligence Pipeline (software) Hindi Sentence Word error rate Translation (biology) Word (group theory) Task (project management) Named-entity recognition Named entity Extractor Entity linking Quality (philosophy) Speech recognition Linguistics Programming language Knowledge base

Metrics

23
Cited By
6.28
FWCI (Field Weighted Citation Impact)
22
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.