JOURNAL ARTICLE

Skeleton-Aware Neural Sign Language Translation

Abstract

As an essential communication way for deaf-mutes, sign languages are expressed by human actions. To distinguish human actions for sign language understanding, the skeleton which contains position information of human pose can provide an important cue, since different actions usually correspond to different poses/skeletons. However, skeleton has not been fully studied for Sign Language Translation (SLT), especially for end-to-end SLT. Therefore, in this paper, we propose a novel end-to-end Skeleton-Aware neural Network (SANet) for video-based SLT. Specifically, to achieve end-to-end SLT, we design a self-contained branch for skeleton extraction. To efficiently guide the feature extraction from video with skeletons, we concatenate the skeleton channel and RGB channels of each frame for feature extraction. To distinguish the importance of clips, we construct a skeleton-based Graph Convolutional Network (GCN) for feature scaling, i.e., giving importance weight for each clip. The scaled features of each clip are then sent to a decoder module to generate spoken language. In our SANet, a joint training strategy is designed to optimize skeleton extraction and sign language translation jointly. Experimental results on two large scale SLT datasets demonstrate the effectiveness of our approach, which outperforms the state-of-the-art methods. Our code is available at https://github.com/SignLanguageCode/SANet.

Keywords:
Computer science Sign language Artificial intelligence Skeleton (computer programming) Feature extraction Translation (biology) Convolutional neural network Feature (linguistics) Human skeleton Natural language processing Graph Pattern recognition (psychology) Speech recognition Computer vision Theoretical computer science

Metrics

21
Cited By
2.13
FWCI (Field Weighted Citation Impact)
35
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Vision-Based Sign Language Translation via a Skeleton-Aware Neural Network

Shiwei GanYafeng YinZhiwei JiangLei XieSang-Lu Lu

Journal:   Journal of Computer Science and Technology Year: 2025 Vol: 40 (2)Pages: 378-396
JOURNAL ARTICLE

Neural Sign Language Recognition and Translation

CAMGÖZ, NECATI CIHAN

Journal:   Surrey Open Research repository (University of Surrey) Year: 2020
JOURNAL ARTICLE

Cross-modal Neural Sign Language Translation

Amanda Duarte

Year: 2019 Pages: 1650-1654
© 2026 ScienceGate Book Chapters — All rights reserved.