Chinese News Text Multi Classification Based on Naive Bayes Algorithm

Fei Wang; Xin Deng; Lunqing Hou

doi:10.1145/3284557.3284704

ScienceGate Book Chapters

JOURNAL ARTICLE

Chinese News Text Multi Classification Based on Naive Bayes Algorithm

Fei Wang Xin Deng Lunqing Hou

Year: 2018 Vol: 32 Pages: 1-5

DOI: 10.1145/3284557.3284704

Get Full-Text PDF Get Analytical Report

Abstract

With the development of Internet, there are more and more text data appear, the companies face the challenge to organize the content and the users feel confused about what is useful content for them. If the text data can be classified will make a contribution to solve the problem. It has been a long time, text classification work is done by human beings, like editors. So text classification become a hot topic in nature language processing field, especially for Chinese text classification. Sentiment classification just need to classify two classes, but there are more situations where we need to do multi classification. Such as the news editors have to give an article tags manually. There are several ways to solve the text classification problem: (1) Naive Bayes algorithm (2) support vector machine algorithm (3) neural network (4) k nearest neighbors (5) decision tree [1][2][3][4][5]. Naive Bayes applies Bayes' theorem with strong(naive) independence assumptions between the features. This paper proposes to use Naive Bayes to finish a Chinese news text multi classification with nine classes.

Keywords:

Naive Bayes classifier Computer science Artificial intelligence Support vector machine Machine learning Decision tree Bayesian programming Statistical classification Field (mathematics) The Internet Natural language processing Information retrieval Bayes' theorem Bayesian probability Mathematics World Wide Web Bayes factor

Metrics

Cited By

0.20

FWCI (Field Weighted Citation Impact)

Refs

0.61

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Web Data Mining and Analysis

Physical Sciences → Computer Science → Information Systems

Chinese News Text Multi Classification Based on Naive Bayes Algorithm

Abstract

Metrics

Citation History

Topics

Related Documents

A Chinese text classification system based on Naive Bayes algorithm

Parallel naive Bayes algorithm for large-scale Chinese text classification based on spark

A New Naive Bayes Text Classification Algorithm

Chinese Web Text Classification System Model Based on Naive Bayes

Text Classification in Architecture Field Based on Naive Bayes Algorithm