Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning

Zeping Min; Jinfeng Bai; Chengfei Li

doi:10.1609/aaai.v38i13.29350

ScienceGate Book Chapters

JOURNAL ARTICLE

Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning

Zeping Min Jinfeng Bai Chengfei Li

Year: 2024 Journal: Proceedings of the AAAI Conference on Artificial Intelligence Vol: 38 (13)Pages: 14370-14378 Publisher: Association for the Advancement of Artificial Intelligence

DOI: 10.1609/aaai.v38i13.29350

Get Full-Text PDF Get Analytical Report

Abstract

Semi-supervised learning algorithms that use pseudo-labeling have become increasingly popular for improving model performance by utilizing both labeled and unlabeled data. In this paper, we offer a fresh perspective on the selection of pseudo-labels, inspired by theoretical insights. We suggest that pseudo-labels with a high degree of local variance are more prone to inaccuracies. Based on this premise, we introduce the Local Variance Match (LVM) method, which aims to optimize the selection of pseudo-labels in semi-supervised learning (SSL) tasks. Our methodology is validated through a series of experiments on widely-used image classification datasets, such as CIFAR-10, CIFAR-100, and SVHN, spanning various labeled data quantity scenarios. The empirical findings show that the LVM method substantially outpaces current SSL techniques, achieving state-of-the-art results in many of these scenarios. For instance, we observed an error rate of 5.41% on CIFAR-10 with a single label for each class, 35.87% on CIFAR-100 when using four labels per class, and 1.94% on SVHN with four labels for each class. Notably, the standout error rate of 5.41% is less than 1% shy of the performance in a fully-supervised learning environment. In experiments on ImageNet with 100k labeled data, the LVM also reached state-of-the-art outcomes. Additionally, the efficacy of the LVM method is further validated by its stellar performance in speech recognition experiments.

Keywords:

Selection (genetic algorithm) Variance (accounting) Machine learning Artificial intelligence Computer science Business

Metrics

Cited By

1.21

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Pseudo-label Selection for Deep Semi-supervised Learning

Robust pseudo-label selection for holistic semi-supervised learning

Pseudo‐Label Selection‐Based Federated Semi‐Supervised Learning Framework for Vehicular Networks

Pseudo-Label Semi-Supervised Learning for Soybean Monitoring

Naive semi-supervised deep learning using pseudo-label