JOURNAL ARTICLE

Fine-Grained Named Entity Recognition for Sinhala

Abstract

For English, Named Entity Recognition (NER) is more or less a solved problem. However, for low-resourced and morphologically rich languages such as Sinhala, minimal research has been done. In this paper, we present a novel fine-grained Named Entity (NE) tag set and an NE annotated Sinhala corpus of 70k word tokens. We trained a custom NER model for Sinhala based on Conditional Random Fields (CRF). Despite the low-resourced setting, this NER model could achieve an micro-averaged F1 score of 84.8.

Keywords:
Computer science Natural language processing Named-entity recognition Artificial intelligence Named entity Entity linking Information retrieval Engineering Task (project management)

Metrics

4
Cited By
0.29
FWCI (Field Weighted Citation Impact)
28
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Multilingual Fine-Grained Named Entity Recognition

Viorica-Camelia LupancuAdrian Iftene

Journal:   Computer Science Journal of Moldova Year: 2023 Vol: 31 (3(93))Pages: 321-339
JOURNAL ARTICLE

Fine-grained Dutch named entity recognition

Bart DesmetVéronique Hoste

Journal:   Language Resources and Evaluation Year: 2013 Vol: 48 (2)Pages: 307-343
JOURNAL ARTICLE

Fine Grained Named Entity Recognition via Seq2seq Framework

Huiming ZhuChunhui HeYang FangWeidong Xiao

Journal:   IEEE Access Year: 2020 Vol: 8 Pages: 53953-53961
© 2026 ScienceGate Book Chapters — All rights reserved.