JOURNAL ARTICLE

Chinese-Vietnamese cross-lingual event retrieval method based on knowledge distillation

Shengxiang GaoZhilei HeZhengtao YuEnchang ZhuShaoyang Wu

Year: 2024 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 46 (4)Pages: 8461-8475   Publisher: IOS Press

Abstract

Cross-lingual event retrieval is an information retrieval task aimed at cross-lingual event retrieval among multiple languages to find text or documents related to a specific event. Specific to Chinese-Vietnamese cross-language event retrieval, it involves using Chinese as a query to retrieve Vietnamese documents related to the query event. The critical issue is how to efficiently align query and document representations with limited resources. Existing cross-language pre-training models are trained on large-scale multilingual corpora, but their training goals do not include explicit language alignment tasks. Due to the uneven distribution of training corpora between different languages, these models have The problem of language bias. Therefore, this linguistic bias is also inherited in cross-lingual retrieval based on these models. To solve this problem, this paper proposes a Chinese-Vietnamese cross-lingual event retrieval method based on knowledge distillation. This approach enables the model to learn good query-document matching features from monolingual retrieval by transferring knowledge from high-resource to low-resource languages. By enhancing the alignment between queries and documents in different languages in a shared semantic space, the method improves the performance of Chinese-Vietnamese cross-lingual event retrieval.

Keywords:
Computer science Vietnamese Natural language processing Event (particle physics) Artificial intelligence Task (project management) Information retrieval Matching (statistics) Query expansion Query language Resource (disambiguation) Machine translation Linguistics

Metrics

2
Cited By
1.28
FWCI (Field Weighted Citation Impact)
5
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.