Zero-shot cross-lingual content filtering: offensive language and hate speech detection

Andraž, Pelicon; Shekhar, Ravi; Martinc, Matej; Škrlj, Blaž; Pollak, Senja; Purver, Matthew

doi:10.5281/zenodo.4730307

ScienceGate Book Chapters

JOURNAL ARTICLE

Zero-shot cross-lingual content filtering: offensive language and hate speech detection

Andraž, Pelicon Shekhar, Ravi Martinc, Matej Škrlj, Blaž Pollak, Senja Purver, Matthew

Year: 2021 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.4730307

Get Full-Text PDF Get Analytical Report

Abstract

We present a system for zero-shot crosslingual offensive language and hate speech classification. The system was trained on English datasets and tested on a task of detecting hate speech and offensive social media content in a number of languages without any additional training. Experiments show an impressive ability of both models to generalize from English to other languages. There is however an expected gap in performance between the tested cross-lingual models and the monolingual models. The best performing model (offensive content classifier) is available online as a REST API

Keywords:

Offensive Voice activity detection Task (project management) Content (measure theory) Language identification Factor (programming language)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.28

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Hate Speech and Cyberbullying Detection

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Bullying, Victimization, and Aggression

Social Sciences → Psychology → Social Psychology

Zero-shot cross-lingual content filtering: offensive language and hate speech detection

Abstract

Metrics

Topics

Related Documents

Zero-shot cross-lingual content filtering: offensive language and hate speech detection

Cross-Lingual Few-Shot Hate Speech and Offensive Language Detection Using Meta Learning

Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection

Label modification and bootstrapping for zero-shot cross-lingual hate speech detection

Label modification and bootstrapping for zero-shot cross-lingual hate speech detection