JOURNAL ARTICLE

Named Entity Recognition on Arabic-English Code-Mixed Data

Abstract

As a result of globalization and better quality of education, a significant percentage of the population in Arab countries have become bilingual/multilingual. This has raised to the frequency of code-switching and code-mixing among Arabs in daily communication. Consequently, huge amount of Code-Mixed (CM) content can be found on different social media platforms. Such data could be analyzed and used in different Natural Language Processing (NLP) tasks to tackle the challenges emerging due to this multilingual phenomenon. Named Entity Recognition (NER) is one of the major tasks for several NLP systems. It is the process of identifying named entities in text. However, there is a lack of annotated CM data and resources for such task. This work aims at collecting and building the first annotated CM Arabic-English corpus for NER. Furthermore, we constructed a baseline NER system using deep neural networks and word embedding for Arabic-English CM text and enhanced it using a pooling technique.

Keywords:
Computer science Named-entity recognition Natural language processing Word embedding Artificial intelligence Pooling Task (project management) Code (set theory) Baseline (sea) Population Arabic Process (computing) Embedding Linguistics Programming language

Metrics

26
Cited By
2.61
FWCI (Field Weighted Citation Impact)
43
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Techniques for Named Entity Recognition on Arabic-English Code-Mixed Data

Caroline SabtyAhmed SherifMohamed ElmahdySlim Abdennadher

Journal:   International Journal of Robotic Computing Year: 2019 Pages: 44-63
JOURNAL ARTICLE

spaCy Performance on Named Entity Recognition with Code-Mixed Data

Xia, Hanxin

Journal:   Arabixiv (OSF Preprints) Year: 2024
BOOK-CHAPTER

Performance Analysis of Named Entity Recognition Approaches on Code-Mixed Data

Sreeja GaddamidiRajendra Prasath

Communications in computer and information science Year: 2021 Pages: 153-167
JOURNAL ARTICLE

Named Entity Recognition for Code Mixed Social Media Sentences

Yashvardhan SharmaRupal BhargavaBapiraju Vamsi Tadikonda

Journal:   International Journal of Software Science and Computational Intelligence Year: 2021 Vol: 13 (2)Pages: 23-36
© 2026 ScienceGate Book Chapters — All rights reserved.