JOURNAL ARTICLE

Lightly supervised transliteration for machine translation

Abstract

We present a Hebrew to English transliteration method in the context of a machine translation system. Our method uses machine learning to determine which terms are to be transliterated rather than translated. The training corpus for this purpose includes only positive examples, acquired semi-automatically. Our classifier reduces more than 38% of the errors made by a baseline method. The identified terms are then transliterated. We present an SMT-based transliteration model trained with a parallel corpus extracted from Wikipedia using a fairly simple method which requires minimal knowledge. The correct result is produced in more than 76% of the cases, and in 92% of the instances it is one of the top-5 results. We also demonstrate a small improvement in the performance of a Hebrew-to-English MT system that uses our transliteration module.

Keywords:
Transliteration Computer science Hebrew Artificial intelligence Machine translation Natural language processing Classifier (UML) Context (archaeology) Evaluation of machine translation Translation (biology) Example-based machine translation Machine translation software usability Linguistics

Metrics

12
Cited By
0.76
FWCI (Field Weighted Citation Impact)
40
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Bodo to English Machine Translation through Transliteration

Saiful IslamProf. Bipul Syam Purkayastha

Journal:   International Journal of Innovative Technology and Exploring Engineering Year: 2019 Vol: 8 (12)Pages: 3825-3830
© 2026 ScienceGate Book Chapters — All rights reserved.