Multilingual Hate Speech and Offensive Language Detection

Sapthak Mohajon Turjya; Rina Kumari; Sujata Swain; Anjan Bandyopadhyay

doi:10.1109/ocit59427.2023.10431222

ScienceGate Book Chapters

JOURNAL ARTICLE

Multilingual Hate Speech and Offensive Language Detection

Sapthak Mohajon Turjya Rina Kumari Sujata Swain Anjan Bandyopadhyay

Year: 2023 Pages: 660-664

DOI: 10.1109/ocit59427.2023.10431222

Get Full-Text PDF Get Analytical Report

Abstract

Internet and social media usage has skyrocketed over the past two decades, changing how people communicate with one another on a basic level. Numerous favourable results have resulted from this. The risks and harms that come with it are also there. It is impossible for humans to control the amount of damaging content, such as hate speech, that is available online. Researching automated methods for hate speech identification has drawn more attention from academics. Through the creation of a single homogeneous dataset, we investigate various publicly accessible datasets in this work. We establish a baseline model and enhance model performance scores using various optimisation strategies after classifying them into two categories: hate or non-hate. After achieving a competitive performance score, we develop a tool that, using the same feedback, quickly locates and evaluates a page with an effective measure. This tool then retrains our model using the new data. In three languages: English, German, and Spanish. We demonstrate the superior performance of our multilingual approach. In comparison to most monolingual models, this results in performance that is equal to or better.

Keywords:

Offensive Computer science Linguistics Speech recognition Natural language processing Artificial intelligence Engineering Philosophy

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection

Physical Sciences → Computer Science → Artificial Intelligence

Swearing, Euphemism, Multilingualism

Social Sciences → Social Sciences → Communication

Freedom of Expression and Defamation

Social Sciences → Social Sciences → Law

Multilingual Hate Speech and Offensive Language Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Hate Speech and Offensive Language Detection in Bengali

Multilingual Hate Speech Detection

Multilingual hate speech detection

Hate-Speech and Offensive Language Detection in Roman Urdu

Hate Speech and Offensive Language Detection from Social Media