JOURNAL ARTICLE

Cognate Production using Character-based Machine Translation

Lisa BeinbornTorsten ZeschIryna Gurevych

Year: 2013 Journal:   TUbilio (Technical University of Darmstadt) Pages: 883-891   Publisher: Technical University of Darmstadt

Abstract

Cognates are words in different languages that are associated with each other by language learners. Thus, cognates are important indicators for the prediction of the perceived difficulty of a text. We introduce a method for automatic cognate production using character-based machine translation. We show that our approach is able to learn production patterns from noisy training data and that it works for a wide range of language pairs. It even works across different alphabets, e.g. we obtain good results on the tested language pairs English-Russian, English-Greek, and English-Farsi. Our method performs significantly better than similarity measures used in previous work on cognates.

Keywords:
Cognate Computer science Machine translation Character (mathematics) Natural language processing Artificial intelligence Similarity (geometry) Translation (biology) Production (economics) Linguistics Speech recognition Mathematics

Metrics

35
Cited By
4.24
FWCI (Field Weighted Citation Impact)
24
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.