JOURNAL ARTICLE

Latent structured models for human pose estimation

Abstract

We present an approach for automatic 3D human pose reconstruction from monocular images, based on a discriminative formulation with latent segmentation inputs. We advanced the field of structured prediction and human pose reconstruction on several fronts. First, by working with a pool of figure-ground segment hypotheses, the prediction problem is formulated in terms of combined learning and inference over segment hypotheses and 3D human articular configurations. Beside constructing tractable formulations for the combined segment selection and pose estimation problem, we propose new augmented kernels that can better encode complex dependencies between output variables. Furthermore, we provide primal linear re-formulations based on Fourier kernel approximations, in order to scale-up the non-linear latent structured prediction methodology. The proposed models are shown to be competitive in the HumanEva benchmark and are also illustrated in a clip collected from a Hollywood movie, where the model can infer human poses from monocular images captured in complex environments.

Keywords:
Pose Artificial intelligence Computer science Inference Discriminative model Segmentation Kernel (algebra) Benchmark (surveying) Monocular ENCODE Machine learning Pattern recognition (psychology) Computer vision Mathematics

Metrics

251
Cited By
4.86
FWCI (Field Weighted Citation Impact)
36
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Gaussian Process Latent Variable Models for Human Pose Estimation

Carl Henrik EkPhilip H. S. TorrNeil D. Lawrence

Lecture notes in computer science Year: 2008 Pages: 132-143
DISSERTATION

Structured Articulated Human Pose Estimation with Neural Networks

Jie Song

University:   Repository for Publications and Research Data (ETH Zurich) Year: 2020
BOOK-CHAPTER

Human Pose Tracking Using Multi-level Structured Models

Mun Wai LeeRam Nevatia

Lecture notes in computer science Year: 2006 Pages: 368-381
© 2026 ScienceGate Book Chapters — All rights reserved.