Boosting Fake News Detection in Arabic Dialects with Consistency-Aware LLM Merging Techniques

Abdelouahab Hocini; Kamel Smaı̈li

doi:10.5121/csit.2025.151803

ScienceGate Book Chapters

JOURNAL ARTICLE

Boosting Fake News Detection in Arabic Dialects with Consistency-Aware LLM Merging Techniques

Abdelouahab Hocini Kamel Smaı̈li

Year: 2025 Pages: 25-33

DOI: 10.5121/csit.2025.151803

Get Full-Text PDF Get Analytical Report

Abstract

This work explores the use of Large Language Models (LLMs) for fake news detection in multilingual and multi-script contexts, focusing on Arabic dialects. We address the challenge of insufficient digital data for many Arabic dialects by using pretrained LLMs on a diverse corpus including Modern Standard Arabic (MSA), followed by fine-tuning on dialect-specific data. We examine AraBERT, DarijaBERT, and mBERT for performance on North African Arabic dialects, incorporating code-switching and writing styles such as Arabizi. We evaluate these models on the BOUTEF dataset, which includes fake news, fake comments, and denial categories. Our approach fine-tunes both Arabic and Latin script text, with a focus on cross-script generalization. We improve accuracy using an ensemble strategy that merges predictions from AraBERT and DarijaBERT. Additionally, we introduce a new custom loss function, named CALLM to enforce consistency between models, boosting classification performance. The use of CALLM achieves significant improvement in F1-score (12.88 ↑) and accuracy (2.47 ↑) compared to the best model (MarBERT).

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.57

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Misinformation and Its Impacts

Social Sciences → Social Sciences → Sociology and Political Science

Hate Speech and Cyberbullying Detection

Physical Sciences → Computer Science → Artificial Intelligence

Boosting Fake News Detection in Arabic Dialects with Consistency-Aware LLM Merging Techniques

Abstract

Metrics

Topics

Related Documents

Event Consistency-aware Robust Fake News Detection

Survey of machine learning techniques for Arabic fake news detection

Evidence-Aware Multilingual Fake News Detection

Context-Aware Fake News Detection Model

User Preference-aware Fake News Detection