JOURNAL ARTICLE

Head Pose Estimation with Combined 2D SIFT and 3D HOG Features

Abstract

In this paper, an approach is presented to estimate the 3D position and orientation of head from RGB and depth images captured by a commercial sensor Kinect. We use 2D Scale-invariant feature transform (SIFT) features together with 3D histogram of oriented gradients (HOG) features which are extracted in a pair of RGB and depth images captured synchronously, named SIFT-HOG features, to improve the robustness and accuracy of head pose estimation. We apply random forests to formulate pose estimation as a regression problem, due to their power for handling large training data and the high mapping speed. And then the mean-shift method is employed to refine the result obtained by the random forests. The experiment results demonstrate that our approach of head pose estimation is efficient.

Keywords:
Scale-invariant feature transform Artificial intelligence Pose Computer vision RGB color model Histogram Robustness (evolution) Computer science Random forest Pattern recognition (psychology) Histogram of oriented gradients Feature extraction 3D pose estimation Orientation (vector space) Mathematics Image (mathematics)

Metrics

41
Cited By
1.82
FWCI (Field Weighted Citation Impact)
19
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.