JOURNAL ARTICLE

Knowledge Derived From Wikipedia For Computing Semantic Relatedness

Simone Paolo PonzettoMichael Strube

Year: 2007 Journal:   Journal of Artificial Intelligence Research Vol: 30 Pages: 181-212   Publisher: AI Access Foundation

Abstract

Wikipedia provides a semantic network for computing semantic relatedness in a more structured fashion than a search engine and with more coverage than WordNet. We present experiments on using Wikipedia for computing semantic relatedness and compare it to WordNet on various benchmarking datasets. Existing relatedness measures perform better using Wikipedia than a baseline given by Google counts, and we show that Wikipedia outperforms WordNet on some datasets. We also address the question whether and how Wikipedia can be integrated into NLP applications as a knowledge base. Including Wikipedia improves the performance of a machine learning based coreference resolution system, indicating that it represents a valuable resource for NLP applications. Finally, we show that our method can be easily used for languages other than English by computing semantic relatedness for a German dataset.

Keywords:
WordNet Computer science Coreference Information retrieval Semantic similarity Natural language processing Baseline (sea) Artificial intelligence Benchmarking Benchmark (surveying) Semantic computing Resource (disambiguation) Knowledge base Semantic search Resolution (logic) Semantic Web

Metrics

232
Cited By
15.13
FWCI (Field Weighted Citation Impact)
102
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.