Gaze Estimation Based on a Multi-Stream Adaptive Feature Fusion Network

Changli Li; Elizabeth Tong; Kao Zhang; Ningxin Cheng; Zhongyuan Lai; Zhigeng Pan

doi:10.3390/app15073684

ScienceGate Book Chapters

JOURNAL ARTICLE

Gaze Estimation Based on a Multi-Stream Adaptive Feature Fusion Network

Changli Li Elizabeth Tong Kao Zhang Ningxin Cheng Zhongyuan Lai Zhigeng Pan

Year: 2025 Journal: Applied Sciences Vol: 15 (7)Pages: 3684-3684 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/app15073684

Get Full-Text PDF Get Analytical Report

Abstract

Recently, with the widespread application of deep learning networks, appearance-based gaze estimation has made breakthrough progress. However, most methods focus on feature extraction from the facial region while neglecting the critical role of the eye region in gaze estimation, leading to insufficient eye detail representation. To address this issue, this paper proposes a multi-stream multi-input network architecture (MSMI-Net) based on appearance. The model consists of two independent streams designed to extract high-dimensional eye features and low-dimensional features, integrating both eye and facial information. A parallel channel and spatial attention mechanism is employed to fuse low-dimensional eye and facial features, while an adaptive weight adjustment mechanism (AWAM) dynamically determines the contribution ratio of eye and facial features. The concatenated high-dimensional and fused low-dimensional features are processed through fully connected layers to predict the final gaze direction. Extensive experiments on the EYEDIAP, MPIIFaceGaze, and Gaze360 datasets validate the superiority of the proposed method.

Keywords:

Computer science Artificial intelligence Gaze Computer vision Pattern recognition (psychology)

Metrics

Cited By

6.16

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Gaze Tracking and Assistive Technology

Physical Sciences → Computer Science → Human-Computer Interaction

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Advanced Computing and Algorithms

Social Sciences → Social Sciences → Urban Studies

Gaze Estimation Based on a Multi-Stream Adaptive Feature Fusion Network

Abstract

Metrics

Citation History

Topics

Related Documents

MFFC-Net: multi-scale feature fusion-based coordination network for gaze estimation

Multi-feature fusion gaze estimation based on attention mechanism

Gaze estimation network with two-branch feature fusion

FAFNet: Feature Adaptive Fusion Network for Robust Appearance-Based Gaze Estimation Under Extreme Head Poses

Multi-Scale Feature Adaptive Fusion: Unsupervised Monocular Depth Estimation Based on Dynamic Network