JOURNAL ARTICLE

Automatic content-based categorization of Wikipedia articles

Abstract

Wikipedia's article contents and its category hierarchy are widely used to produce semantic resources which improve performance on tasks like text classification and keyword extraction. The reverse -- using text classification methods for predicting the categories of Wikipedia articles -- has attracted less attention so far. We propose to "return the favor" and use text classifiers to improve Wikipedia. This could support the emergence of a virtuous circle between the wisdom of the crowds and machine learning/NLP methods.

Keywords:
Computer science Categorization Crowds Text categorization Information retrieval Hierarchy Artificial intelligence Natural language processing Keyword extraction

Metrics

10
Cited By
2.08
FWCI (Field Weighted Citation Impact)
28
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Wikis in Education and Collaboration
Social Sciences →  Social Sciences →  Communication
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

WikiAutoCat: Information Retrieval System for Automatic Categorization of Wikipedia Articles

Nesma RefaeiElsayed E. HemayedRiham Mansour

Journal:   Arabian Journal for Science and Engineering Year: 2018 Vol: 43 (12)Pages: 8095-8109
JOURNAL ARTICLE

Content driven automatic categorization of research articles

J. Jeba EmilynNivas RavichandranC.R. SakthivelRaja Krishnan

Journal:   AIP conference proceedings Year: 2025 Vol: 3279 Pages: 020027-020027
BOOK-CHAPTER

Weakly-Supervised Neural Categorization of Wikipedia Articles

Xingyu ChenMizuho Iwaihara

Lecture notes in computer science Year: 2019 Pages: 16-22
BOOK-CHAPTER

Categorization of Wikipedia Articles with Spectral Clustering

Julian Szymański

Lecture notes in computer science Year: 2011 Pages: 108-115
© 2026 ScienceGate Book Chapters — All rights reserved.