JOURNAL ARTICLE

FCGCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation

Abstract

It is notoriously difficult to implement end-to-end speech translation (E2E-ST) model because of the task complexity and data scarcity. Existing techniques often attempt to carry out implicit knowledge transfer from machine translation (MT) to ST model by imposing various constraints. However, in this transfer scenario, a significant problem is that the performance of the MT will drop significantly and the final transfer effect is also restricted. In this article, we recommend Fine and Coarse Granularity Contrastive Learning (FCGCL), which conduct explicit knowledge transfer from MT to ST model. Specially, we ensure through multi granularity contrastive learning that inputs with similar semantic between different modalities are encoded closely in the shared semantic space while inputs with different semantics are kept apart. Experiments on the MuST-C datasets on all 8 languages and further analysis show that our method can effectively improve the E2E-ST performance and achieves an average BLEU of 29.0.

Keywords:
Computer science Granularity Machine translation Natural language processing Artificial intelligence Transfer of learning Semantics (computer science) Task (project management) Translation (biology) Programming language

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
65
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Coarse-to-Fine Contrastive Learning on Graphs

Peiyao ZhaoYuangang PanXin LiXu ChenIvor W. TsangLejian Liao

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2023 Vol: 35 (4)Pages: 4622-4634
JOURNAL ARTICLE

Contrastive coarse-to-fine medical segmentation with prototype guidance and dual-granularity fusion

Zekai LiuMuxi LiFei Yang

Journal:   Neurocomputing Year: 2026 Vol: 670 Pages: 132603-132603
JOURNAL ARTICLE

Cross-modal Contrastive Learning for Speech Translation

Rong YeMingxuan WangLei Li

Journal:   Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Year: 2022 Pages: 5099-5113
© 2026 ScienceGate Book Chapters — All rights reserved.