Enhancing Arabic Information Retrieval for Question Answering

Maria Al-Ghamdi; Mohammed Abushawarib; Mahmoud Ellouh; Mustafa Ghaleb; Muhamad Felemban

doi:10.1145/3644713.3644763

ScienceGate Book Chapters

JOURNAL ARTICLE

Enhancing Arabic Information Retrieval for Question Answering

Maria Al-Ghamdi Mohammed Abushawarib Mahmoud Ellouh Mustafa Ghaleb Muhamad Felemban

Year: 2023 Pages: 366-371

DOI: 10.1145/3644713.3644763

Get Full-Text PDF Get Analytical Report

Abstract

In the modern landscape of Natural Language Processing (NLP), intelligent chatbots like ChatGPT 3.5 and Google's Bard have shown remarkable competence in generic question-answering (QA) tasks. However, their performance falters when navigating domain-specific QA, particularly in the Arabic language, which is celebrated for its complex morphology and syntax. This paper presents a comprehensive approach to address these issues. The aim of this research is to build a chatbot tailored for a university community. We first create an extensive Arabic Q&A dataset by extracting data from academic documents, employing state-of-the-art Optical Character Recognition (OCR) tools. Then, we evaluate multiple text similarity measures like Pooled FastText Word embedding, BM25 ranking functions, and various semantic sentence embedding models. A thorough performance assessment reveals that the domain-specific model excels at both sentence-level similarity and context-relevance tasks. The developed web application chatbot, leveraging LangChain library and Retrieval Augmented Generation (RAG) methods, outperforms existing chatbots in domain-specific, Arabic language QA scenarios.

Keywords:

Computer science Natural language processing Question answering Artificial intelligence Chatbot Information retrieval Relevance (law) Word embedding Sentence Domain (mathematical analysis) Natural language Language model Embedding

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

AI in Service Interactions

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Enhancing Arabic Information Retrieval for Question Answering

Abstract

Metrics

Citation History

Topics

Related Documents

Enhancing Arabic Question Answering System

Arabic Question Answering System for Information Retrieval on Large-scale Image Objects

Information Retrieval techniques for Question Answering

Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

Fact Retrieval and Deductive Question-Answering Information Retrieval Systems