JOURNAL ARTICLE

SceneFormer: Indoor Scene Generation with Transformers

Xinpeng WangChandan YeshwanthMatthias Niesner

Year: 2021 Journal:   2021 International Conference on 3D Vision (3DV) Pages: 106-115

Abstract

We address the task of indoor scene generation by generating a sequence of objects, along with their locations and orientations conditioned on a room layout. Large-scale indoor scene datasets allow us to extract patterns from user-designed indoor scenes, and generate new scenes based on these patterns. Existing methods rely on the 2D or 3D appearance of these scenes in addition to object positions, and make assumptions about the possible relations between objects. In contrast, we do not use any appearance information, and implicitly learn object relations using the self-attention mechanism of transformers. We show that our model design leads to faster scene generation with similar or improved levels of realism compared to previous methods. Our method is also flexible, as it can be conditioned not only on the room layout but also on text descriptions of the room, using only the cross-attention mechanism of transformers. Our user study shows that our generated scenes are preferred to the state-of-the-art FastSynth scenes 53.9% and 56.7% of the time for bedroom and living room scenes, respectively. At the same time, we generate a scene in 1.48 seconds on average, 20% faster than FastSynth.

Keywords:
Computer science Transformer Computer vision Artificial intelligence Bedroom Living room Object (grammar) Computer graphics (images) Engineering Architectural engineering Voltage

Metrics

93
Cited By
4.05
FWCI (Field Weighted Citation Impact)
49
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SceneFormer: Neural Architecture Search of Transformers for Remote Sensing Scene Classification

Lyuyang TongJie LiuBo Du

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2025 Vol: 63 Pages: 1-15
JOURNAL ARTICLE

Context-aware Scene Graph Generation with Seq2Seq Transformers

Yichao LuHimanshu RaiJason S. ChangB. A. KnyazevGuangwei YuShashank ShekharGraham W. TaylorMaksims Volkovs

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 15911-15921
JOURNAL ARTICLE

Composite Relationship Fields with Transformers for Scene Graph Generation

George AdaimiDavid MizrahiAlexandre Alahi

Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2023 Pages: 52-64
© 2026 ScienceGate Book Chapters — All rights reserved.