JOURNAL ARTICLE

Gaze Estimation with Multi-Scale Channel and Spatial Attention

Abstract

Gaze estimation is well established as a significant research topic in computer vision given its importance for different applications. Recent studies demonstrate that other regions of the face beyond the two eyes contain valuable information for gaze estimation. Motivated by these works, we propose a novel and powerful deep convolutional network with multi-scale channel and spatial attention, which only takes the full-face image as input without additional modules to detect the eyes and estimate the head pose, to handle the gaze estimation task. It uses multi-scale channel and spatial information to adaptively select and increase important features and suppress some unnecessary facial regions which may not contribute to estimate gaze. By rigorously evaluating our module, we show that our method significantly outperforms the state-of-the-art for 3D gaze estimation on multiple public datasets.

Keywords:
Gaze Computer science Artificial intelligence Channel (broadcasting) Computer vision Task (project management) Scale (ratio) Convolutional neural network Face (sociological concept) Estimation Pattern recognition (psychology) Geography Cartography

Metrics

11
Cited By
0.97
FWCI (Field Weighted Citation Impact)
71
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction
Retinal Imaging and Analysis
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
Advanced Computing and Algorithms
Social Sciences →  Social Sciences →  Urban Studies
© 2026 ScienceGate Book Chapters — All rights reserved.