JOURNAL ARTICLE

Semi-supervised Learning from General Unlabeled Data

Abstract

We consider the problem of semi-supervised learning (SSL) from general unlabeled data, which may contain irrelevant samples. Within the binary setting, our model manages to better utilize the information from unlabeled data by formulating them as a three-class (-1,+1, 0) mixture, where class 0 represents the irrelevant data. This distinguishes our work from the traditional SSL problem where unlabeled data are assumed to contain relevant samples only, either +1 or -1, which are forced to be the same as the given labeled samples. This work is also different from another family of popular models, universum learning (universum means "irrelevant" data), in that the universum need not to be specified beforehand. One significant contribution of our proposed framework is that such irrelevant samples can be automatically detected from the available unlabeled data, even though they are mixed with relevant data. This hence presents a general SSL framework that does not force "clean" unlabeled data.More importantly, we formulate this general learning framework as a Semi-definite Programming problem, making it solvable in polynomial time. A series of experiments demonstrate that the proposed framework can outperform the traditional SSL on both synthetic and real data.

Keywords:
Computer science Semi-supervised learning Class (philosophy) Labeled data Artificial intelligence Machine learning Supervised learning Binary number Binary classification Data mining Support vector machine Mathematics Artificial neural network

Metrics

23
Cited By
3.99
FWCI (Field Weighted Citation Impact)
40
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Infrastructure Maintenance and Monitoring
Physical Sciences →  Engineering →  Civil and Structural Engineering

Related Documents

JOURNAL ARTICLE

Semi-supervised crowd counting from unlabeled data

Haoran DuanYawen HuangYang LongXian WuFeiyue HuangShaoxin Li

Journal:   Pattern Recognition Year: 2025 Vol: 172 Pages: 112787-112787
JOURNAL ARTICLE

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Amin Banitalebi-DehkordiPratik GujjarYong Zhang

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Year: 2022 Pages: 3998-4005
JOURNAL ARTICLE

Distributed Semi-Supervised Learning with Positive and Unlabeled Data

Ying LiuChen Zhang

Journal:   2022 3rd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) Year: 2022 Vol: 37 Pages: 488-492
© 2026 ScienceGate Book Chapters — All rights reserved.