JOURNAL ARTICLE

Semi-supervised machine learning for textual anomaly detection

Abstract

Anomaly detection comprises the identification of observations which do not follow the expected patterns of the assumed data set. We attempt to simplify the problem of textual anomaly detection by constructing a Multinomial Naïve Bayes classifier and enhancing it with an augmented Expectation Maximization (EM) algorithm. By doing so, we utilize large amounts of unlabelled data and show how the EM algorithm could increase the accuracy of the Naïve Bayes classifier. The process is applied to a binary classification environment in order to detect anomalies in text.

Keywords:
Anomaly detection Computer science Naive Bayes classifier Artificial intelligence Pattern recognition (psychology) Classifier (UML) Expectation–maximization algorithm Bayes classifier Binary classification Machine learning Data mining Maximum likelihood Mathematics Support vector machine Statistics

Metrics

6
Cited By
0.00
FWCI (Field Weighted Citation Impact)
13
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Network Security and Intrusion Detection
Physical Sciences →  Computer Science →  Computer Networks and Communications
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.