JOURNAL ARTICLE

Cyberbullying Detection in Urdu Language Using Machine Learning

Abstract

Cyberbullying has become a significant problem with the surge in the use of social media. The most basic way to prevent cyberbullying on these social media platforms is to identify and remove offensive comments. However, it is hard for humans to read and remove all the comments manually. Current research work focuses on using machine learning to detect and eliminate cyberbullying. Although most of the work has been conducted on English texts to detect cyberbullying, limited to no work can be found in Urdu. This paper aims to detect cyberbullying from the users' comments posted in Urdu on Twitter using machine learning and Natural Language Processing (NLP) techniques. To the best of our knowledge, cyberbullying detection on Urdu text comments has not been performed due to the lack of a publicly available standard Urdu dataset. In this paper, we created a dataset of offensive user-generated Urdu comments from Twitter. The comments in the dataset are classified into five categories. n-gram techniques are used to extract features at character and word levels. Various supervised machine-learning techniques are applied to the dataset to detect cyberbullying. Evaluation metrics such as precision, recall, accuracy and F1 scores are used to analyse the performance of machine learning techniques.

Keywords:
Urdu Computer science Offensive Artificial intelligence Social media Machine learning Natural language processing Recall World Wide Web Engineering Psychology

Metrics

10
Cited By
1.96
FWCI (Field Weighted Citation Impact)
13
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Bullying, Victimization, and Aggression
Social Sciences →  Psychology →  Social Psychology
Advanced Malware Detection Techniques
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

BOOK-CHAPTER

Cyberbullying Detection for Urdu Language Using Machine Learning

H. MustafaKashif Zafar

Lecture notes in networks and systems Year: 2024 Pages: 244-257
JOURNAL ARTICLE

Fake news detection in Urdu language using machine learning

Muhammad Shoaib FarooqAnsar NaseemFurqan RustamImran Ashraf

Journal:   PeerJ Computer Science Year: 2023 Vol: 9 Pages: e1353-e1353
JOURNAL ARTICLE

Cyberbullying detection using machine learning

Ajay Kumar YadavHari Om Patel

Journal:   AIP conference proceedings Year: 2025 Vol: 3224 Pages: 020062-020062
JOURNAL ARTICLE

Cyberbullying Detection using Machine Learning

Aaminah AliAdeel M. Syed

Journal:   Pakistan Journal of Engineering and Technology Year: 2022 Vol: 3 (2)Pages: 45-50
© 2026 ScienceGate Book Chapters — All rights reserved.