Abstract

Most of the published approaches and resources for hate speech detection are tailored for the English language. In consequence, cross-lingual and cross-cultural perspectives lack some essential resources.The lack of diversity of the datasets in Spanish is notable. Variations throughout Spanish-speaking countries make existing datasets not enough to encompass the task in the different Spanish variants. We annotated 9834 tweets from Chile to enrich the existing Spanish resources with different words and new targets of hate that have not been considered in previous studies.We conducted several cross-dataset evaluation experiments of the models published in the literature using our Chilean dataset and two others in English and Spanish. We propose a comparative framework for quickly conducting comparative experiments using different previously published models.In addition, we set up a Codalab competition for further comparison of new models in a standard scenario, that is, data partitions and evaluation metrics. All resources can be accessed trough a centralized repository for researchers to get a complete picture of the progress on the multilingual hate speech and offensive language detection task.

Keywords:
Computer science Offensive Task (project management) Natural language processing Artificial intelligence Set (abstract data type) Diversity (politics) Political science Operations research

Metrics

16
Cited By
3.13
FWCI (Field Weighted Citation Impact)
26
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Internet Traffic Analysis and Secure E-voting
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Multilingual Hate Speech Detection

Λαυρεντιάδου, Βασιλική Γεωργίου

Journal:   Aristotle University of Thessaloniki Year: 2022
DISSERTATION

Multilingual hate speech detection

Aymé Arango Monnar

University:   Repositorio Institucional Year: 2025
JOURNAL ARTICLE

Multilingual hate speech detection using deep learning

Vincent VincentAmalia Zahra

Journal:   International Journal of Informatics and Communication Technology (IJ-ICT) Year: 2025 Vol: 14 (3)Pages: 1015-1015
JOURNAL ARTICLE

Multilingual Hate Speech Detection Using NLP Techniques

Kartik Kumar

Journal:   INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT Year: 2025 Vol: 09 (03)Pages: 1-9
© 2026 ScienceGate Book Chapters — All rights reserved.