JOURNAL ARTICLE

MMNeRF: Multi-Modal and Multi-View Optimized Cross-Scene Neural Radiance Fields

Qi ZhangBo Han WangMing Chuan YangHang Zou

Year: 2023 Journal:   IEEE Access Vol: 11 Pages: 27401-27413   Publisher: Institute of Electrical and Electronics Engineers

Abstract

We present MMNeRF, a simple yet powerful learning framework for highly photo-realistic novel view synthesis by learning Multi-modal and Multi-view features to guide neural radiance fields to a generic model. Novel view synthesis has achieved great improvement with the significant success of NeRF-series methods. However, how to make the method generic across scenes has always been a challenging task. A good idea is to introduce 2D image features as prior knowledge for adaptive modeling, yet RGB features lack geometry and 3D spatial information, which causes shape-radiance ambiguity issues and lead to blurry and low-resolution results in the synthesis images. We propose a multi-modal multi-view method to make up for the existing methods. Specifically, we introduce depth features besides RGB features into the model and effectively fuse these multi-modal features by modality-based attention. Furthermore, Our framework innovatively adopts the transformer encoder to fuse multi-view features and uses the transformer decoder to adaptively incorporate the target view with global memory. Extensive experiments are carried out on both categories-specific and category-agnostic benchmarks, and the results demonstrate that our MMNeRF achieves state-of-the-art neural rendering performance.

Keywords:
Computer science Artificial intelligence Radiance Computer vision Encoder Ambiguity Modal View synthesis Fuse (electrical) Transformer Rendering (computer graphics) RGB color model Remote sensing Voltage

Metrics

3
Cited By
0.55
FWCI (Field Weighted Citation Impact)
53
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Computer Graphics and Visualization Techniques
Physical Sciences →  Computer Science →  Computer Graphics and Computer-Aided Design
3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics

Related Documents

JOURNAL ARTICLE

Neural radiance fields-based multi-view endoscopic scene reconstruction for surgical simulation

Zhibao QinKai QianShaojun LiangQinhong ZhengJun PengYonghang Tai

Journal:   International Journal of Computer Assisted Radiology and Surgery Year: 2024 Vol: 19 (5)Pages: 951-960
JOURNAL ARTICLE

Multi-scene Representation Learning with Neural Radiance Fields

Bofeng FuZheng Wang

Journal:   Journal of Physics Conference Series Year: 2021 Vol: 1880 (1)Pages: 012034-012034
BOOK-CHAPTER

CDNeRF: A Multi-modal Feature Guided Neural Radiance Fields

Qi ZhangQiaoqiao LiuHang Zou

Lecture notes in computer science Year: 2022 Pages: 204-215
© 2026 ScienceGate Book Chapters — All rights reserved.