JOURNAL ARTICLE

Self-avatar: Monocular 3D human reconstruction from RGB image

Ruixiao Zhang

Year: 2024 Journal:   Applied and Computational Engineering Vol: 41 (1)Pages: 89-98

Abstract

3D human position and shape estimate are crucial in many computer vision applications. Despite the fact that there are numerous deep learning techniques designed to handle this problem, they frequently only use training networks with RGB images from a single point of view. In this paper, a unique approach to solve this issue is proposed by combining a regression-based multi-view picture learning loop with an optimization-based multi-view model. This is because some public datasets are collected by multi-view camera systems. A parameterized human body model's position and shape parameters are initially deduced by a convolutional neural network (CNN) from multi-view photos. This work then introduces an enhanced multi-view optimization method called MV-SMPLify, which aligns the SMPL model with multi-view images by using the regressed pose and shape as beginning values. Following that, the CNN model's training can be monitored using the optimum parameters. The Self-avatar project as a whole is a self-supervised framework that combines the advantages of both the CNN method and the optimization-based strategy. Additionally, the use of multi-view photos improves thorough supervision during training. This methodology outperforms earlier methods in a variety of ways, according to qualitative and quantitative testing using open datasets.

Keywords:
Computer science Artificial intelligence Convolutional neural network Avatar Monocular RGB color model Computer vision Deep learning Point (geometry) Position (finance) Artificial neural network Parameterized complexity Machine learning Human–computer interaction Algorithm Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
21
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

Boyi JiangH. C. YangHujun BaoJuyong Zhang

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 5595-5605
JOURNAL ARTICLE

3D Reconstruction of Human Motion from Monocular Image Sequences

Bastian WandtHanno AckermannBodo Rosenhahn

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2016 Vol: 38 (8)Pages: 1505-1516
JOURNAL ARTICLE

GaussianAvatar: Human avatar Gaussian splatting from monocular videos

Haoheng LinYinwei Zhan

Journal:   Computers & Graphics Year: 2024 Vol: 126 Pages: 104155-104155
© 2026 ScienceGate Book Chapters — All rights reserved.