Hybrid Knowledge Transfer for Improved Cross-Lingual Event Detection via Hierarchical Sample Selection

Luis Guzman Nateras; Franck Dernoncourt; Thien Huu Nguyen

doi:10.18653/v1/2023.acl-long.296

ScienceGate Book Chapters

JOURNAL ARTICLE

Hybrid Knowledge Transfer for Improved Cross-Lingual Event Detection via Hierarchical Sample Selection

Luis Guzman Nateras Franck Dernoncourt Thien Huu Nguyen

Year: 2023 Pages: 5414-5427

DOI: 10.18653/v1/2023.acl-long.296

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we address the Event Detection task under a zero-shot cross-lingual setting where a model is trained on a source language but evaluated on a distinct target language for which there is no labeled data available. Most recent efforts in this field follow a direct transfer approach in which the model is trained using language-invariant features and then directly applied to the target language. However, we argue that these methods fail to take advantage of the benefits of the data transfer approach where a cross-lingual model is trained on target-language data and is able to learn task-specific information from syntactical features or word-label relations in the target language. As such, we propose a hybrid knowledge-transfer approach that leverages a teacher-student framework where the teacher and student networks are trained following the direct and data transfer approaches, respectively. Our method is complemented by a hierarchical training-sample selection scheme designed to address the issue of noisy labels being generated by the teacher model. Our model achieves state-of-the-art results on 9 morphologically-diverse target languages across 3 distinct datasets, highlighting the importance of exploiting the benefits of hybrid transfer.

Keywords:

Computer science Artificial intelligence Language model Transfer of learning Task (project management) Selection (genetic algorithm) Sample (material) Natural language processing Word (group theory) Event (particle physics) Machine learning

Metrics

Cited By

2.04

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hybrid Knowledge Transfer for Improved Cross-Lingual Event Detection via Hierarchical Sample Selection

Abstract

Metrics

Citation History

Topics

Related Documents

Toward Cross-Lingual Social Event Detection with Hybrid Knowledge Distillation

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

Transfer language selection for zero-shot cross-lingual abusive language detection

TTL: transformer-based two-phase transfer learning for cross-lingual news event detection

Cross-Lingual Event Detection via Optimized Adversarial Training