Abstract

This paper addresses the challenge of reconstructing a scene with a neural radiance field (NeRF) for robot vision and scene understanding using multiple modalities. Researchers have introduced the use of NeRF to represent an object for synthesizing and rendering novel views of complex scenes by optimizing a 3-D radiance field for ray casting and rendering for 2-D RGB images. However, using RGB images alone introduces additional geometry ambiguities with transparent objects or complex scenes and cannot accurately depict the 3-D shapes. We discuss and solve this problem and use multiple modalities as input for the same NeRF model to build a multimodal NeRF by incorporating point clouds and infrared image supervision to prevent such bias. In contrast to RGB images, infrared images and point clouds are typically taken by separate cameras that cannot be aligned with the RGB camera. We further introduce the alignment of different modalities based on point cloud registration to estimate the relative transformation matrices between them before training a NeRF model with multiple modalities. We evaluate our model on chosen scenes from the ScanNet and M2DGR datasets and demonstrate that it outperforms existing state-of-the-art methods.

Keywords:
Radiance Computer vision Artificial intelligence Computer science Point cloud RGB color model Rendering (computer graphics) Modalities Modality (human–computer interaction) Computer graphics (images) Remote sensing Geography

Metrics

13
Cited By
4.37
FWCI (Field Weighted Citation Impact)
50
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
Computer Graphics and Visualization Techniques
Physical Sciences →  Computer Science →  Computer Graphics and Computer-Aided Design
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Neural Articulated Radiance Field

Atsuhiro NoguchiXiao SunStephen LinTatsuya Harada

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 5742-5752
JOURNAL ARTICLE

MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field

Zijiang YangZhongwei QiuChang XuDongmei Fu

Journal:   IEEE Transactions on Visualization and Computer Graphics Year: 2024 Vol: 31 (9)Pages: 5842-5853
JOURNAL ARTICLE

Dynamic Appearance Particle Neural Radiance Field

Ancheng LinYusheng XiangJun LiMukesh Prasad

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2025 Vol: 35 (7)Pages: 6853-6866
© 2026 ScienceGate Book Chapters — All rights reserved.