Gloss-Free End-to-End Sign Language Translation

Kezhou Lin; Xiaohan Wang; Linchao Zhu; Ke Sun; Bang Zhang; Yi Yang

doi:10.18653/v1/2023.acl-long.722

ScienceGate Book Chapters

JOURNAL ARTICLE

Gloss-Free End-to-End Sign Language Translation

Kezhou Lin Xiaohan Wang Linchao Zhu Ke Sun Bang Zhang Yi Yang

Year: 2023 Pages: 12904-12916

DOI: 10.18653/v1/2023.acl-long.722

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we tackle the problem of sign language translation (SLT) without gloss annotations. Although intermediate representation like gloss has been proven effective, gloss annotations are hard to acquire, especially in large quantities. This limits the domain coverage of translation datasets, thus handicapping real-world applications. To mitigate this problem, we design the Gloss-Free End-to-end sign language translation framework (GloFE). Our method improves the performance of SLT in the gloss-free setting by exploiting the shared underlying semantics of signs and the corresponding spoken translation. Common concepts are extracted from the text and used as a weak form of intermediate representation. The global embedding of these concepts is used as a query for cross-attention to find the corresponding information within the learned visual features. In a contrastive manner, we encourage the similarity of query results between samples containing such concepts and decrease those that do not. We obtained state-of-the-art results on large-scale datasets, including OpenASL and How2Sign.

Keywords:

Gloss (optics) Computer science Embedding Natural language processing Artificial intelligence Sign language Linguistics

Metrics

Cited By

4.39

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gloss-Free End-to-End Sign Language Translation

Abstract

Metrics

Citation History

Topics

Related Documents

Gloss Attention for Gloss-free Sign Language Translation

End-to-End Two-Handed Sign Language Translation

End to End Sign Language Translation via Multitask Learning

End to End Simple Indian Sign Language Sentence Translation Using Sign Transformer Network

Cross-modality Data Augmentation for End-to-End Sign Language Translation