Abstract

The Kailinese language, spoken in Indonesia's Central Sulawesi Province, faces challenges due to limited everyday usage. According to an article published on the brin.go.id website, some dialects of Kailinese have only four family speakers. To address these concerns, this research introduces a dedicated machine translation model and dataset for Kailinese. Our translation system utilizes the IndoBART-V2 model, enabling seamless translations between Indonesian and Kailinese. We conducted two testing scenarios: one with a diverse dataset of reviews and random topics, and another focusing on review-type datasets. By employing default parameters and preprocessing techniques using the "Colloquial Indonesian Lexicon Dictionary," our translation model achieves impressive SacreBLEU scores compared to not using preprocessing or changing default parameters. For scenario 1 (Indonesian to Kailinese translation), the model achieves a score of 19.8, while for scenario 2 (Kailinese to Indonesian translation), the score is 23.0. In scenario 2, where both training and testing data consist of review-type sentences, the model achieves scores of 18.4 and 22.7 for Indonesian to Kailinese and Kailinese to Indonesian translations, respectively. These results demonstrate the effectiveness and accuracy of the developed model. Furthermore, our analysis reveals that sentence composition significantly influences the model's performance, with no notable difference between scenario 1 and scenario 2. This emphasizes the importance of considering sentence types in the translation model.

Keywords:
Indonesian Computer science Machine translation Translation (biology) Artificial intelligence Natural language processing Linguistics Philosophy

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
25
Refs
0.57
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Indonesian-to-Javanese Machine Translation

Aji Prasetya Wibawa

Journal:   International Journal of Innovation Management and Technology Year: 2013
JOURNAL ARTICLE

Interference of Kailinese on Sentence Formation in Indonesian Among Indonesian High School Students

Yasril AnantaRizky Anugrah Putra

Journal:   Pulchra Lingua A Journal of Language Study Literature & Linguistics Year: 2024 Vol: 3 (1)Pages: 56-70
JOURNAL ARTICLE

Machine Translation Indonesian Bengkulu Malay Using Neural Machine Translation-LSTM

Bella Okta Sari MirandaHerman YuliansyahMuhammad Kunta Biddinika

Journal:   IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Year: 2024 Vol: 18 (3)
© 2026 ScienceGate Book Chapters — All rights reserved.