JOURNAL ARTICLE

A Framework for Early Detection of Cyberbullying in Chinese-English Code-Mixed Social Media Text Using Natural Language Processing and Machine Learning

Abstract

This study develops a new expert system framework to address the issue of early detection of cyberbullying incidents in Chinese-English code-mixed language on social media networks. The framework covers the crawling of session-based social media texts with potential cyberbullying messages with a crowdsourcing web application to systematically retrieve and manually annotate a cyberbullying dataset, and most importantly establishes an explainable artificial intelligence model based on natural language processing algorithm for identification of targeted emotional colloquial slang phrases and machine learning method using Shapley value and transfer learning approach for automatic early detection of cyberbullying incidents in Chinese-English codemixed language.

Keywords:
Computer science Natural language processing Social media Artificial intelligence Slang Crowdsourcing Language identification Identification (biology) Natural language Code (set theory) Session (web analytics) World Wide Web Linguistics

Metrics

2
Cited By
0.51
FWCI (Field Weighted Citation Impact)
19
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.