JOURNAL ARTICLE

Toward Cross-Lingual Social Event Detection with Hybrid Knowledge Distillation

Jiaqian RenHao PengLei JiangZhifeng HaoJia WuShengxiang GaoZhengtao YuQiang Yang

Year: 2024 Journal:   ACM Transactions on Knowledge Discovery from Data Vol: 18 (9)Pages: 1-36   Publisher: Association for Computing Machinery

Abstract

Recently published graph neural networks (GNNs) show promising performance at social event detection tasks. However, most studies are oriented toward monolingual data in languages with abundant training samples. This has left the common lesser-spoken languages relatively unexplored. Thus, in this work, we present a GNN-based framework that integrates cross-lingual word embeddings into the process of graph knowledge distillation for detecting events in low-resource language data streams. To achieve this, a novel cross-lingual knowledge distillation framework, called CLKD, exploits prior knowledge learned from similar threads in English to make up for the paucity of annotated data. Specifically, to extract sufficient useful knowledge, we propose a hybrid distillation method that consists of both feature-wise and relation-wise information. To transfer both kinds of knowledge in an effective way, we add a cross-lingual module in the feature-wise distillation to eliminate the language gap and selectively choose beneficial relations in the relation-wise distillation to avoid distraction caused by teachers’ misjudgments. Our proposed CLKD framework also adopts different configurations to suit both offline and online situations. Experiments on real-world datasets show that the framework is highly effective at detection in languages where training samples are scarce.

Keywords:
Computer science Exploit Artificial intelligence Distillation Relation (database) Natural language processing Machine learning Graph Feature (linguistics) Process (computing) Artificial neural network Data mining Theoretical computer science

Metrics

8
Cited By
5.11
FWCI (Field Weighted Citation Impact)
71
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.