JOURNAL ARTICLE

Leveraging Concept-Enhanced Pre-Training Model and Masked-Entity Language Model for Named Entity Disambiguation

Zizheng JiLin DaiJin PangTingting Shen

Year: 2020 Journal:   IEEE Access Vol: 8 Pages: 100469-100484   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in an input-text sequence to their correct references in a knowledge graph. We tackle NED problem by leveraging two novel objectives for pre-training framework, and propose a novel pre-training NED model. Especially, the proposed pre-training NED model consists of: (i) concept-enhanced pre-training, aiming at identifying valid lexical semantic relations with the concept semantic constraints derived from external resource Probase; and (ii) masked entity language model, aiming to train the contextualized embedding by predicting randomly masked entities based on words and non-masked entities in the given input-text. Therefore, the proposed pre-training NED model could merge the advantage of pre-training mechanism for generating contextualized embedding with the superiority of the lexical knowledge (e.g., concept knowledge emphasized here) for understanding language semantic. We conduct experiments on the CoNLL dataset and TAC dataset, and various datasets provided by GERBIL platform. The experimental results demonstrate that the proposed model achieves significantly higher performance than previous models.

Keywords:
Computer science Entity linking Natural language processing Artificial intelligence Knowledge graph Language model Merge (version control) Embedding Named-entity recognition Sequence labeling Named entity Task (project management) Information retrieval Knowledge base

Metrics

11
Cited By
0.88
FWCI (Field Weighted Citation Impact)
78
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

BOOK-CHAPTER

Named Entity Identification Based Translation Disambiguation Model

Vijay Kumar SharmaNamita Mittal

Lecture notes in computer science Year: 2017 Pages: 365-372
BOOK-CHAPTER

Disambiguation Model for Bio-Medical Named Entity Recognition

Ashutosh Kumar

Studies in big data Year: 2019 Pages: 41-55
JOURNAL ARTICLE

Research on Named Entity Recognition Method Based on Language Pre-Training Model

Jiurong Fan

Journal:   Advances in Computer Signals and Systems Year: 2023 Vol: 7 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.