Hybrid Swin Transformer-Based Classification of Gaze Target Regions

Gongpu Wu; Changyuan Wang; Lina Gao; Jinna Xue

doi:10.1109/access.2023.3335249

ScienceGate Book Chapters

JOURNAL ARTICLE

Hybrid Swin Transformer-Based Classification of Gaze Target Regions

Gongpu Wu Changyuan Wang Lina Gao Jinna Xue

Year: 2023 Journal: IEEE Access Vol: 11 Pages: 132055-132067 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2023.3335249

Get Full-Text PDF Get Analytical Report

Abstract

Inferring gaze targeting or gaze following is an effective approach for comprehending human behavior and intentions. This paper employs a non-intrusive appearance-based tracking technique, utilizing a binocular stereo vision camera to capture the face image and head pose to address errors caused by problems such as the disappearance of the eye image and head deflection occlusion in image capture. Each gaze direction is determined based on a single image frame. To improve the classification and detection of the gaze target region by effectively handling head motion and view direction, this paper proposes a hybrid structure for the Swin Transformer gaze target region classification method. The facial image features are extracted using both the ResNet50 model and the Swin Transformer model, followed by fusing head pose features to categorise the gaze target area. The study also compares the classification effects of various structural models. The analysis of the results demonstrates that the hybrid Swin Transformer model outperforms in classifying and detecting the gaze target region, achieving an accuracy rate of 90%. Finally, the research examines the gaze of flight trainees during flight missions by using a heatmap, which lays the groundwork for future analyses of pilot attention and operational intentions during flights.

Keywords:

Gaze Artificial intelligence Computer vision Computer science Transformer Engineering

Metrics

Cited By

0.73

FWCI (Field Weighted Citation Impact)

Refs

0.68

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Gaze Tracking and Assistive Technology

Physical Sciences → Computer Science → Human-Computer Interaction

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Hybrid Swin Transformer-Based Classification of Gaze Target Regions

Abstract

Metrics

Citation History

Topics

Related Documents

Hybrid Swin Transformer for Appearance Gaze Estimation

Gaze estimation based on swin transformer

Gaze-Swin: Enhancing Gaze Estimation with a Hybrid CNN-Transformer Network and Dropkey Mechanism

Swin Transformer-Based Poisonous Mushroom Classification Model

Respiratory Sound Classification Based on Swin Transformer