LEAD: Latent Realignment for Human Motion Diffusion

N. Andreou; Xi Wang; Victoria Fernández Abrevaya; Marie‐Paule Cani; Yiorgos Chrysanthou; Vicky Kalogeiton

doi:10.1111/cgf.70093

ScienceGate Book Chapters

JOURNAL ARTICLE

LEAD: Latent Realignment for Human Motion Diffusion

N. Andreou Xi Wang Victoria Fernández Abrevaya Marie‐Paule Cani Yiorgos Chrysanthou Vicky Kalogeiton

Year: 2025 Journal: Computer Graphics Forum Publisher: Wiley

DOI: 10.1111/cgf.70093

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Our goal is to generate realistic human motion from natural language. Modern methods often face a trade‐off between model expressiveness and text‐to‐motion (T2M) alignment. Some align text and motion latent spaces but sacrifice expressiveness; others rely on diffusion models producing impressive motions but lacking semantic meaning in their latent space. This may compromise realism, diversity and applicability. Here, we address this by combining latent diffusion with a realignment mechanism, producing a novel, semantically structured space that encodes the semantics of language. Leveraging this capability, we introduce the task of textual motion inversion to capture novel motion concepts from a few examples. For motion synthesis, we evaluate LEAD on HumanML3D and KIT‐ML and show comparable performance to the state‐of‐the‐art in terms of realism, diversity and text‐motion consistency. Our qualitative analysis and user study reveal that our synthesised motions are sharper, more human‐like and comply better with the text compared to modern methods. For motion textual inversion (MTI), our method demonstrates improvements in capturing out‐of‐distribution characteristics in comparison to traditional VAEs.

Keywords:

Computer science Motion (physics) Diffusion Computer graphics (images) Lead (geology) Computer vision Artificial intelligence Geology Physics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Human Motion and Animation

Physical Sciences → Engineering → Control and Systems Engineering

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

LEAD: Latent Realignment for Human Motion Diffusion

Abstract

Metrics

Topics

Related Documents

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction

FineMLD: A fine-grained motion latent diffusion for human motion prediction in Human–robot Collaboration

HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback

Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction

Towards Realistic Human Motion Prediction with Latent Diffusion and Physics-Based Models