Offensive Chinese Text Detection Based on Multi-Feature Fusion

Na Li; Shaomei Li; Jiahao Hong

doi:10.1109/isceic59030.2023.10271226

ScienceGate Book Chapters

JOURNAL ARTICLE

Offensive Chinese Text Detection Based on Multi-Feature Fusion

Na Li Shaomei Li Jiahao Hong

Year: 2023 Pages: 460-465

DOI: 10.1109/isceic59030.2023.10271226

Get Full-Text PDF Get Analytical Report

Abstract

To purify the online environment, it is essential to identify objectionable content, including offensive texts. However, some offensive texts are expressed in a more subtle manner, making it difficult to detect their literal characteristics. To enhance the effectiveness of detecting offensive Chinese text, we propose a multi-feature fusion-based method. First, we combine the word vectors obtained from Wobert with the character vectors obtained from ALBERT. The attention mechanism assigns greater importance to key features within the word vectors. Next, we merge the fusion vector with the sentence vector generated by ALBERT, which encompasses contextual semantics and syntactic information. This results in a new fusion vector that captures information at the character, word, and sentence levels. Finally, we employ a fully connected layer to process the three-level fusion vector and obtain the detection outcome. Experimental results demonstrate that this approach provides a comprehensive characterization of offensive text by fusing information from multiple levels. It substantially enhances the detection performance for offensive Chinese text.

Keywords:

Offensive Computer science Sentence Artificial intelligence Natural language processing Merge (version control) Word (group theory) Character (mathematics) Fusion Feature (linguistics) Information retrieval Linguistics Mathematics

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hate Speech and Cyberbullying Detection

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Web Application Security Vulnerabilities

Physical Sciences → Computer Science → Information Systems

Offensive Chinese Text Detection Based on Multi-Feature Fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Implicit Offensive Speech Detection Based on Multi-feature Fusion

Video text detection based on multi-feature fusion

UFNet: A Multi-scale Fusion Feature based Text Detection Method

Chinese Event Detection Based on Multi-Feature Fusion and BiLSTM

Scene Text Detection Based on Multi-scale Feature Extraction and Bidirectional Feature Fusion