Towards Bangla Named Entity Recognition

Shammur Absar Chowdhury; Firoj Alam; Naira Khan

doi:10.1109/iccitechn.2018.8631931

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards Bangla Named Entity Recognition

Shammur Absar Chowdhury Firoj Alam Naira Khan

Year: 2018 Pages: 1-7

DOI: 10.1109/iccitechn.2018.8631931

Get Full-Text PDF Get Analytical Report

Abstract

Named Entity Recognition is one of the fundamental problems for Information Extraction and the task is to find the mentioned entities in text. Over the years there has been significant progress in Named Entity Recognition (NER) research for resource-rich languages such as English, Chinese, and Italian. Although, there are a number of studies for Bangla NER, however, most of these studies are conducted almost a decade ago and were focused on a single geographical location (i.e., India). Therefore, in this paper, we present a corpus annotated with seven named entities with a particular focus on Bangladeshi Bangla. It is a part of the development of the Bangla Content Annotation Bank (B-CAB). We also present baseline results, which can be useful for future research. For the baseline results, we employed word-level, POS, gazetteers and contextual features along with Conditional Random Fields (CRFs). Our study also includes the exploration of deep neural networks. Additionally, we investigated another large corpus from a different geographical location (i.e., India) and concluded on the importance of geographic-based NER for a language.

Keywords:

Bengali Named-entity recognition Computer science CRFS Natural language processing Annotation Conditional random field Artificial intelligence Task (project management) Baseline (sea) Focus (optics) Word (group theory) Named entity Information retrieval Linguistics

Metrics

Cited By

0.99

FWCI (Field Weighted Citation Impact)

Refs

0.81

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Towards Bangla Named Entity Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Towards Robust Named Entity Recognition in Bangla With LLMs Based Data Augmentation

A step towards information extraction: Named entity recognition in Bangla using deep learning

GRU based Named Entity Recognition System for Bangla Online Newspapers

Banner: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition

BanglaMedNER: A Gold Standard Medical Named Entity Recognition Corpus for Bangla Text