JOURNAL ARTICLE

Named Entity Recognition Using Web Document Corpus

Wahiba Ben Abdessalem Karaa

Year: 2011 Journal:   International Journal of Managing Information Technology Vol: 3 (1)Pages: 46-56

Abstract

This paper introduces a named entity recognition approach in textual corpus.\nThis Named Entity (NE) can be a named: location, person, organization, date,\ntime, etc., characterized by instances. A NE is found in texts accompanied by\ncontexts: words that are left or right of the NE. The work mainly aims at\nidentifying contexts inducing the NE's nature. As such, The occurrence of the\nword "President" in a text, means that this word or context may be followed by\nthe name of a president as President "Obama". Likewise, a word preceded by the\nstring "footballer" induces that this is the name of a footballer. NE\nrecognition may be viewed as a classification method, where every word is\nassigned to a NE class, regarding the context. The aim of this study is then to\nidentify and classify the contexts that are most relevant to recognize a NE,\nthose which are frequently found with the NE. A learning approach using\ntraining corpus: web documents, constructed from learning examples is then\nsuggested. Frequency representations and modified tf-idf representations are\nused to calculate the context weights associated to context frequency, learning\nexample frequency, and document frequency in the corpus.\n

Keywords:

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
0
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Named Entity Recognition Using Web Document Corpus

Wahiba Ben Abdessalem Karâa

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2011
JOURNAL ARTICLE

Named Entity Recognition Using Web Document Corpus

Karaa, Wahiba Ben Abdessalem

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2011
BOOK-CHAPTER

Document Theme Extraction Using Named-Entity Recognition

Deepali NagraleVaibhav KhatavkarParag Kulkarni

Advances in intelligent systems and computing Year: 2018 Pages: 499-509
JOURNAL ARTICLE

Thai Nested Named Entity Recognition Corpus

Weerayut BuaphetCan UdomcharoenchaikitPeerat LimkonchotiwatAttapol RutherfordSarana Nutanong

Journal:   Findings of the Association for Computational Linguistics: ACL 2022 Year: 2022 Pages: 1473-1486
© 2026 ScienceGate Book Chapters — All rights reserved.