Ensemble-based Semi-Supervised Learning for Hate Speech Detection

Safa Alsafari

doi:10.32473/flairs.v34i1.128427

ScienceGate Book Chapters

JOURNAL ARTICLE

Ensemble-based Semi-Supervised Learning for Hate Speech Detection

Safa Alsafari

Year: 2021 Journal: Proceedings of the ... International Florida Artificial Intelligence Research Society Conference Vol: 34 (1) Publisher: George A. Smathers Libraries

DOI: 10.32473/flairs.v34i1.128427

Get Full-Text PDF Get Analytical Report

Abstract

Large and accurately labeled textual corpora are vital to developing efficient hate speech classifiers. This paper introduces an ensemble-based semi-supervised learning approach to leverage the availability of abundant social media content. Starting with a reliable hate speech dataset, we train and test diverse classifiers that are then used to label a corpus of one million tweets. Next, we investigate several strategies to select the most confident labels from the obtained pseudo labels. We assess these strategies by re-training all the classifiers with the seed dataset augmented with the trusted pseudo-labeled data. Finally, we demonstrate that our approach improves classification performance over supervised hate speech classification methods.

Keywords:

Leverage (statistics) Computer science Ensemble learning Artificial intelligence Labeled data Voice activity detection Machine learning Supervised learning Natural language processing Speech recognition Speech processing Artificial neural network

Metrics

Cited By

0.25

FWCI (Field Weighted Citation Impact)

Refs

0.47

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection

Physical Sciences → Computer Science → Artificial Intelligence

Internet Traffic Analysis and Secure E-voting

Physical Sciences → Computer Science → Artificial Intelligence

Network Security and Intrusion Detection

Physical Sciences → Computer Science → Computer Networks and Communications

Ensemble-based Semi-Supervised Learning for Hate Speech Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Semi-meta-supervised hate speech detection

Semi-Supervised Self-Learning for Arabic Hate Speech Detection

Ensemble Based Hinglish Hate Speech Detection

Ensemble-Based Semi-Supervised Learning for Milling Chatter Detection

BERT-based ensemble learning for multi-aspect hate speech detection