Topic Model Based Multi-Label Classification

Divya Padmanabhan; Satyanath Bhat; Shirish Shevade; Y. Narahari

doi:10.1109/ictai.2016.0154

ScienceGate Book Chapters

JOURNAL ARTICLE

Topic Model Based Multi-Label Classification

Divya Padmanabhan Satyanath Bhat Shirish Shevade Y. Narahari

Year: 2016 Pages: 996-1003

DOI: 10.1109/ictai.2016.0154

Get Full-Text PDF Get Analytical Report

Abstract

Multi-label classification is a common supervised machine learning problem where each instance is associated with multiple classes. The key challenge in this problem is learning the correlations between the classes. An additional challenge arises when the labels of the training instances are provided by noisy, heterogeneous crowd-workers with unknown qualities. We first assume labels from a perfect source and propose a novel topic model (ML-PA-LDA) where the classes that are present as well as the classes absent generate the latent topics and hence the words. Extensive experimentation on real world datasets reveals the superior performance of the proposed model. We then non-trivially extend our topic model to the scenario where the labels are provided by noisy crowd-workers and refer to this model as ML-PA-LDA-C. With experiments on simulated crowd, the proposed model learns the qualities of the annotators well, even with minimal training data.

Keywords:

Computer science Artificial intelligence Machine learning Key (lock) Topic model Labeled data Multi-label classification Training set

Metrics

Cited By

0.85

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Web Data Mining and Analysis

Physical Sciences → Computer Science → Information Systems

Topic Model Based Multi-Label Classification

Abstract

Metrics

Citation History

Topics

Related Documents

A Label Distribution Topic Model for Multi-label Classification

Centroid prior topic model for multi-label classification

Labelset topic model for multi-label document classification

Multi-label Classification via Label-Topic Pairs

Multi-label topic classification model of COVID-19 literature