Estimating a one-class naive Bayes text classifier

Yihong Zhang; Adam Jatowt

doi:10.3233/ida-194669

ScienceGate Book Chapters

JOURNAL ARTICLE

Estimating a one-class naive Bayes text classifier

Yihong Zhang Adam Jatowt

Year: 2020 Journal: Intelligent Data Analysis Vol: 24 (3)Pages: 567-579 Publisher: IOS Press

DOI: 10.3233/ida-194669

Get Full-Text PDF Get Analytical Report

Abstract

Nowadays more and more information extraction projects need to classify large amounts of text data. The common way to classify text is to build a supervised classifier trained on human-labeled positive and negative examples. In many cases, however, it is easy to label positive examples, but hard to label negative examples. In this paper, we address the problem of building a one-class classifier when only the positive examples are labeled. Previous works on building one-class classifier mostly use positive examples and unlabeled data. In this paper, we show that a configurable one-class classifier such as one-class naive Bayes can be optimized by examining the clustering quality of the classification on target data. We propose to use existing and new quality scores for determining clustering quality of the classification. Experimental analysis with real-world data show that our approach generally achieves high classification accuracy, and in some cases improves the accuracy by more than 10% compared to state-of-art baselines.

Keywords:

Naive Bayes classifier Classifier (UML) Artificial intelligence Computer science Cluster analysis Bayes classifier Machine learning Pattern recognition (psychology) Class (philosophy) Data mining Bayes error rate Support vector machine

Metrics

Cited By

1.17

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Estimating a one-class naive Bayes text classifier

Abstract

Metrics

Citation History

Topics

Related Documents

Removing Smoothing from Naive Bayes Text Classifier

OCPAD: One class Naive Bayes classifier for payload based anomaly detection

Class dependent feature scaling method using naive Bayes classifier for text datamining

Improving Naive Bayes Text Classifier Using Smoothing Methods

Modifying Naive Bayes classifier for multinomial text classification