Self-avatar: Monocular 3D human reconstruction from RGB image

Ruixiao Zhang

doi:10.54254/2755-2721/41/20230715

ScienceGate Book Chapters

JOURNAL ARTICLE

Self-avatar: Monocular 3D human reconstruction from RGB image

Ruixiao Zhang

Year: 2024 Journal: Applied and Computational Engineering Vol: 41 (1)Pages: 89-98

DOI: 10.54254/2755-2721/41/20230715

Get Full-Text PDF Get Analytical Report

Abstract

3D human position and shape estimate are crucial in many computer vision applications. Despite the fact that there are numerous deep learning techniques designed to handle this problem, they frequently only use training networks with RGB images from a single point of view. In this paper, a unique approach to solve this issue is proposed by combining a regression-based multi-view picture learning loop with an optimization-based multi-view model. This is because some public datasets are collected by multi-view camera systems. A parameterized human body model's position and shape parameters are initially deduced by a convolutional neural network (CNN) from multi-view photos. This work then introduces an enhanced multi-view optimization method called MV-SMPLify, which aligns the SMPL model with multi-view images by using the regressed pose and shape as beginning values. Following that, the CNN model's training can be monitored using the optimum parameters. The Self-avatar project as a whole is a self-supervised framework that combines the advantages of both the CNN method and the optimization-based strategy. Additionally, the use of multi-view photos improves thorough supervision during training. This methodology outperforms earlier methods in a variety of ways, according to qualitative and quantitative testing using open datasets.

Keywords:

Computer science Artificial intelligence Convolutional neural network Avatar Monocular RGB color model Computer vision Deep learning Point (geometry) Position (finance) Artificial neural network Parameterized complexity Machine learning Human–computer interaction Algorithm Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.02

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Self-avatar: Monocular 3D human reconstruction from RGB image

Abstract

Metrics

Topics

Related Documents

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

Adaptive mesh-aligned Gaussian Splatting for monocular human avatar reconstruction

Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video

3D Reconstruction of Human Motion from Monocular Image Sequences

GaussianAvatar: Human avatar Gaussian splatting from monocular videos