JOURNAL ARTICLE

On Graph-Based Name Disambiguation

Xiaoming FanJianyong WangPu XuLizhu ZhouBing Lv

Year: 2011 Journal:   Journal of Data and Information Quality Vol: 2 (2)Pages: 1-23   Publisher: Association for Computing Machinery

Abstract

Name ambiguity stems from the fact that many people or objects share identical names in the real world. Such name ambiguity decreases the performance of document retrieval, Web search, information integration, and may cause confusion in other applications. Due to the same name spellings and lack of information, it is a nontrivial task to distinguish them accurately. In this article, we focus on investigating the problem in digital libraries to distinguish publications written by authors with identical names. We present an effective framework named GHOST (abbreviation for GrapHical framewOrk for name diSambiguaTion), to solve the problem systematically. We devise a novel similarity metric, and utilize only one type of attribute (i.e., coauthorship) in GHOST. Given the similarity matrix, intermediate results are grouped into clusters with a recently introduced powerful clustering algorithm called Affinity Propagation . In addition, as a complementary technique, user feedback can be used to enhance the performance. We evaluated the framework on the real DBLP and PubMed datasets, and the experimental results show that GHOST can achieve both high precision and recall .

Keywords:
Computer science Ambiguity Information retrieval Precision and recall Similarity (geometry) Focus (optics) Task (project management) Cluster analysis Graph Confusion Metric (unit) Natural language processing Artificial intelligence Theoretical computer science Image (mathematics)

Metrics

134
Cited By
14.11
FWCI (Field Weighted Citation Impact)
35
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

JOURNAL ARTICLE

Name Disambiguation Based on Graph Convolutional Network

Ya ChenHongliang YuanTingting LiuNan Ding

Journal:   Scientific Programming Year: 2021 Vol: 2021 Pages: 1-11
JOURNAL ARTICLE

Author Name Disambiguation Based on Heterogeneous Graph

Chuang Ma ChuangHelong Xia Chuang

Journal:   電腦學刊 Year: 2023 Vol: 34 (4)Pages: 041-052
BOOK-CHAPTER

Author Name Disambiguation Based on Rule and Graph Model

Lizhi ZhangZhijie Ban

Lecture notes in computer science Year: 2020 Pages: 617-628
JOURNAL ARTICLE

Author name disambiguation based on heterogeneous graph neural network

Ge WangZikai SunWei HuMisheng Cai

Journal:   PLoS ONE Year: 2025 Vol: 20 (2)Pages: e0310992-e0310992
JOURNAL ARTICLE

Graph-based methods for Author Name Disambiguation: a survey

Michele De BonisFabrizio FalchiPaolo Manghi

Journal:   PeerJ Computer Science Year: 2023 Vol: 9 Pages: e1536-e1536
© 2026 ScienceGate Book Chapters — All rights reserved.