Abstract

Traditional NER systems are typically trained to recognize coarse-grained categories of entities, and less attention is given to classifying entities into a hierarchy of fine-grained lower-level sub-types. This article aims to advance Arabic NER with fine-grained entities. We chose to extend Wojood (an open-source Nested Arabic Named Entity Corpus) with sub-types. In particular, four main entity types in Wojood (geopolitical entity (GPE), location (LOC), organization (ORG), and facility (FAC) are extended with 31 sub-types of entities. To do this, we first revised Wojood’s annotations of GPE, LOC, ORG, and FAC to be compatible with the LDC’s ACE guidelines, which yielded 5, 614 changes. Second, all mentions of GPE, LOC, ORG, and FAC (~ 44K) in Wojood are manually annotated with the LDC’s ACE subtypes. This extended version of Wojood is called WojoodFine. To evaluate our annotations, we measured the inter-annotator agreement (IAA) using both Cohen’s Kappa and F1 score, resulting in 0.9861 and 0.9889, respectively. To compute the baselines of WojoodFine, we fine-tune three pre-trained Arabic BERT encoders in three settings: flat NER, nested NER and nested NER with sub-types and achieved F1 score of 0.920, 0.866, and 0.885, respectively. Our corpus and models are open source and available at https://sina.birzeit.edu/wojood/.

Keywords:
Computer science Natural language processing Named-entity recognition Arabic Artificial intelligence Entity linking Modern Standard Arabic Information retrieval Linguistics Knowledge base

Metrics

4
Cited By
1.02
FWCI (Field Weighted Citation Impact)
38
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Fine-Grained Entity Recognition

Xiao LingDaniel S. Weld

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 26 (1)Pages: 94-100
JOURNAL ARTICLE

FgER: Fine-Grained Entity Recognition

Abhishek Singh

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2018 Vol: 32 (1)
JOURNAL ARTICLE

Multilingual Fine-Grained Named Entity Recognition

Viorica-Camelia LupancuAdrian Iftene

Journal:   Computer Science Journal of Moldova Year: 2023 Vol: 31 (3(93))Pages: 321-339
JOURNAL ARTICLE

Fine-grained Dutch named entity recognition

Bart DesmetVéronique Hoste

Journal:   Language Resources and Evaluation Year: 2013 Vol: 48 (2)Pages: 307-343
© 2026 ScienceGate Book Chapters — All rights reserved.