JOURNAL ARTICLE

DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Abstract

In this paper, we present the decomposed triplane-hash neural radiance fields (DT-NeRF), a framework that significantly improves the photorealistic rendering of talking faces and achieves state-of-the-art results on key evaluation datasets. Our architecture decomposes the facial region into two specialized triplanes: one specialized for representing the mouth, and the other for the broader facial features. We introduce audio features as residual terms and integrate them as query vectors into our model through an audio-mouthface transformer. Additionally, our method leverages the capabilities of Neural Radiance Fields (NeRF) to enrich the volumetric representation of the entire face through additive volumetric rendering techniques. Comprehensive experimental evaluations corroborate the effectiveness and superiority of our proposed approach.

Keywords:
Radiance Rendering (computer graphics) Residual Artificial neural network Deep neural networks Representation (politics) Face (sociological concept)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.34
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Mycorrhizal Fungi and Plant Interactions
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
Genomics and Phylogenetic Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Plant Pathogens and Fungal Diseases
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Cell Biology
© 2026 ScienceGate Book Chapters — All rights reserved.