JOURNAL ARTICLE

Chinese News Text Classification Algorithm Based on Online Knowledge Extension and Convolutional Neural Network

Abstract

Chinese text classification is an important task in data mining, which extracts category features from unstructured contents. Conventional Chinese text classification models only leverage the surface features in the original text, which omits the potential extensional knowledge of each word. To capture the semantic features of each word more comprehensive, this paper proposed a Chinese news text classification algorithm based on an online knowledge extension and convolutional neural network (OKE-CNN), which leverages both knowledge graph to extend latent semantic information and CNN to obtain the category. Compared with other baseline methods, OKE-CNN can utilize the surface and latent features, simultaneously, which can be adapted to complex scenes, e.g., sparse data and unclear topics. In our experiment, OKE-CNN exhibits superior performance and achieves 97.94% and 87.03% on THUCNews and TouTiao datasets, separately, over SOTA competitors.

Keywords:
Computer science Leverage (statistics) Convolutional neural network Artificial intelligence Natural language processing Latent semantic analysis Graph Semantics (computer science) Extension (predicate logic) Pattern recognition (psychology) Information retrieval Theoretical computer science

Metrics

7
Cited By
0.46
FWCI (Field Weighted Citation Impact)
49
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.