JOURNAL ARTICLE

One-class svms for document classification

Larry M. ManevitzMalik Yousef

Year: 2002 Journal:   Journal of Machine Learning Research Vol: 2 (2)Pages: 139-154   Publisher: The MIT Press

Abstract

We implemented versions of the SVM appropriate for one-class classification in the context of information retrieval. The experiments were conducted on the standard Reuters data set. For the SVM implementation we used both a version of Schoelkopf et al. and a somewhat different version of one-class SVM based on identifying outlier data as representative of the second-class. We report on experiments with different kernels for both of these implementations and with different representations of the data, including binary vectors, tf-idf representation and a modification called Hadamard representation. Then we compared it with one-class versions of the algorithms prototype (Rocchio), nearest neighbor, naive Bayes, and finally a natural one-class neural network classification method based on bottleneck compression generated filters.The SVM approach as represented by Schoelkopf was superior to all the methods except the neural network one, where it was, although occasionally worse, essentially comparable. However, the SVM methods turned out to be quite sensitive to the choice of representation and kernel in ways which are not well understood; therefore, for the time being leaving the neural network approach as the most robust.

Keywords:
Support vector machine Computer science Artificial intelligence Pattern recognition (psychology) Artificial neural network Class (philosophy) Context (archaeology) Representation (politics) Kernel (algebra) Outlier Binary classification Machine learning Data mining Mathematics

Metrics

1221
Cited By
18.95
FWCI (Field Weighted Citation Impact)
12
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.