Abstract

Multi-label classification is a common supervised machine learning problem where each instance is associated with multiple classes. The key challenge in this problem is learning the correlations between the classes. An additional challenge arises when the labels of the training instances are provided by noisy, heterogeneous crowd-workers with unknown qualities. We first assume labels from a perfect source and propose a novel topic model (ML-PA-LDA) where the classes that are present as well as the classes absent generate the latent topics and hence the words. Extensive experimentation on real world datasets reveals the superior performance of the proposed model. We then non-trivially extend our topic model to the scenario where the labels are provided by noisy crowd-workers and refer to this model as ML-PA-LDA-C. With experiments on simulated crowd, the proposed model learns the qualities of the annotators well, even with minimal training data.

Keywords:
Computer science Artificial intelligence Machine learning Key (lock) Topic model Labeled data Multi-label classification Training set

Metrics

5
Cited By
0.85
FWCI (Field Weighted Citation Impact)
35
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Centroid prior topic model for multi-label classification

Ximing LiJihong OuyangXiaotang Zhou

Journal:   Pattern Recognition Letters Year: 2015 Vol: 62 Pages: 8-13
JOURNAL ARTICLE

Labelset topic model for multi-label document classification

Ximing LiJihong OuyangXiaotang Zhou

Journal:   Journal of Intelligent Information Systems Year: 2014 Vol: 46 (1)Pages: 83-97
BOOK-CHAPTER

Multi-label Classification via Label-Topic Pairs

Gang ChenYue PengChongjun Wang

Lecture notes in computer science Year: 2018 Pages: 32-44
© 2026 ScienceGate Book Chapters — All rights reserved.