JOURNAL ARTICLE

Enhanced Amharic-Arabic Cross-Language Information Retrieval System using Part of Speech Tagging

Abstract

In this paper we extended our first experiment on Neural Machine Translation (NMT) based query translation for Amharic-Arabic Cross Language Information Retrieval (CLIR) task to retrieve relevant documents from Amharic and Arabic text collections in response to a query expressed in the Amharic language by modifying the ranking algorithm with Parts of speech Tags (POS). We used a pre-trained NMT model, to map a query in the source language into an equivalent query in the language of the target document collection. The relevant documents are then retrieved using a Language Modeling (LM) based retrieval algorithm by substituting lambda with POS based LM. The experimental result is compared with four conventional IR models, namely Uni-gram and Bi-gram LM, Probabilistic model and Vector Space Model (VSM). The proposed POS based LM ranking algorithm outperform all others for both Amharic and Arabic language document collections.

Keywords:
Amharic Computer science Arabic Natural language processing Artificial intelligence Information retrieval Speech recognition Linguistics

Metrics

4
Cited By
0.46
FWCI (Field Weighted Citation Impact)
37
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Improving Arabic Information Retrieval Systems Using Part of Speech Tagging

Ghassan KanaanRiyad Al–ShalabiMajdi Sawalha

Journal:   Information Technology Journal Year: 2004 Vol: 4 (1)Pages: 32-37
JOURNAL ARTICLE

Toward enhanced Arabic speech recognition using part of speech tagging

Dia AbuZeinaWasfi G. Al-KhatibMoustafa ElshafeiHusni Al-Muhtaseb

Journal:   International Journal of Speech Technology Year: 2011 Vol: 14 (4)Pages: 419-426
JOURNAL ARTICLE

Improving part-of-speech tagging in Amharic language using deep neural network

Sintayehu HirpassaGurpreet Singh Lehal

Journal:   Heliyon Year: 2023 Vol: 9 (7)Pages: e17175-e17175
JOURNAL ARTICLE

Enhanced Arabic Information Retrieval by Using Arabic Slang Language

Mustafa Abdel-Kareem AbabnehGhassan KanaanAyat Amin Al-Jarrah

Journal:   Modern Applied Science Year: 2019 Vol: 13 (6)Pages: 24-24
© 2026 ScienceGate Book Chapters — All rights reserved.