Skeleton-Aware Neural Sign Language Translation

Shiwei Gan; Yafeng Yin; Zhiwei Jiang; Lei Xie; Sanglu Lu

doi:10.1145/3474085.3475577

ScienceGate Book Chapters

JOURNAL ARTICLE

Skeleton-Aware Neural Sign Language Translation

Shiwei Gan Yafeng Yin Zhiwei Jiang Lei Xie Sanglu Lu

Year: 2021 Pages: 4353-4361

DOI: 10.1145/3474085.3475577

Get Full-Text PDF Get Analytical Report

Abstract

As an essential communication way for deaf-mutes, sign languages are expressed by human actions. To distinguish human actions for sign language understanding, the skeleton which contains position information of human pose can provide an important cue, since different actions usually correspond to different poses/skeletons. However, skeleton has not been fully studied for Sign Language Translation (SLT), especially for end-to-end SLT. Therefore, in this paper, we propose a novel end-to-end Skeleton-Aware neural Network (SANet) for video-based SLT. Specifically, to achieve end-to-end SLT, we design a self-contained branch for skeleton extraction. To efficiently guide the feature extraction from video with skeletons, we concatenate the skeleton channel and RGB channels of each frame for feature extraction. To distinguish the importance of clips, we construct a skeleton-based Graph Convolutional Network (GCN) for feature scaling, i.e., giving importance weight for each clip. The scaled features of each clip are then sent to a decoder module to generate spoken language. In our SANet, a joint training strategy is designed to optimize skeleton extraction and sign language translation jointly. Experimental results on two large scale SLT datasets demonstrate the effectiveness of our approach, which outperforms the state-of-the-art methods. Our code is available at https://github.com/SignLanguageCode/SANet.

Keywords:

Computer science Sign language Artificial intelligence Skeleton (computer programming) Feature extraction Translation (biology) Convolutional neural network Feature (linguistics) Human skeleton Natural language processing Graph Pattern recognition (psychology) Speech recognition Computer vision Theoretical computer science

Metrics

Cited By

2.13

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Skeleton-Aware Neural Sign Language Translation

Abstract

Metrics

Citation History

Topics

Related Documents

Vision-Based Sign Language Translation via a Skeleton-Aware Neural Network

Neural Sign Language Translation

Skeleton Aware Multi-modal Sign Language Recognition

Neural Sign Language Recognition and Translation

Cross-modal Neural Sign Language Translation