JOURNAL ARTICLE

Semi-Supervised Text Classification With Universum Learning

Chien‐Liang LiuWen-Hoar HsaioChia-Hoang LeeTao-Hsing ChangTsung-Hsun Kuo

Year: 2015 Journal:   IEEE Transactions on Cybernetics Vol: 46 (2)Pages: 462-473   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Universum, a collection of nonexamples that do not belong to any class of interest, has become a new research topic in machine learning. This paper devises a semi-supervised learning with Universum algorithm based on boosting technique, and focuses on situations where only a few labeled examples are available. We also show that the training error of AdaBoost with Universum is bounded by the product of normalization factor, and the training error drops exponentially fast when each weak classifier is slightly better than random guessing. Finally, the experiments use four data sets with several combinations. Experimental results indicate that the proposed algorithm can benefit from Universum examples and outperform several alternative methods, particularly when insufficient labeled examples are available. When the number of labeled examples is insufficient to estimate the parameters of classification functions, the Universum can be used to approximate the prior distribution of the classification functions. The experimental results can be explained using the concept of Universum introduced by Vapnik, that is, Universum examples implicitly specify a prior distribution on the set of classification functions.

Keywords:
Boosting (machine learning) AdaBoost Computer science Machine learning Artificial intelligence Classifier (UML) Normalization (sociology) One-class classification Bounded function Generalization error Mathematics Stability (learning theory)

Metrics

78
Cited By
8.17
FWCI (Field Weighted Citation Impact)
72
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.