Heterogeneous attention based transformer for sign language translation

Hao Zhang; Yixiang Sun; Zenghui Liu; Qiyuan Liu; Xiyao Liu; Ming Jiang; Gerald Schafer; Hui Fang

doi:10.1016/j.asoc.2023.110526

ScienceGate Book Chapters

JOURNAL ARTICLE

Heterogeneous attention based transformer for sign language translation

Hao Zhang Yixiang Sun Zenghui Liu Qiyuan Liu Xiyao Liu Ming Jiang Gerald Schafer Hui Fang

Year: 2023 Journal: Applied Soft Computing Vol: 144 Pages: 110526-110526 Publisher: Elsevier BV

DOI: 10.1016/j.asoc.2023.110526

Get Full-Text PDF Get Analytical Report

Abstract

Sign language translation (SLT) has attracted significant interest both from research and industry, enabling convenient communications with the deaf-mute community. While recent transformer-based models have shown improved sign translation performance, it is still under-explored how to design an efficient transformer-based deep network architecture that effectively extracts joint visual-text features by exploiting multi-level spatial and temporal contextual information. In this paper, we propose heterogeneous attention based transformer(HAT), a novel SLT model to generate attentions from diverse spatial and temporal contextual levels. Specifically, the proposed light dual-stream sparse attention-based module yields more effective visual-text representations compared to conventional transformers. Extensive experiments demonstrate that our HAT achieves state-of-the-art performance on the challenging PHOENIX2014T benchmark dataset with a BLEU-4 score of 25.33 on the test set.

Keywords:

Computer science Transformer Machine translation Architecture Sign language Artificial intelligence

Metrics

Cited By

2.68

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hearing Impairment and Communication

Social Sciences → Psychology → Developmental and Educational Psychology

Heterogeneous attention based transformer for sign language translation

Abstract

Metrics

Citation History

Topics

Related Documents

Heterogeneous Attention Based Transformer for Sign Language Translation

Transformer based sign-to-text translation for Bangladeshi sign language

Locality-Aware Transformer for Video-Based Sign Language Translation

Gloss-free sign language translation based on fusion attention

Sign Language Translation Using Multi Context Transformer