This repository collects summaries of over 300 recent studies on 3D scene generation, along with the downstream applications, and will be continuously updated.
If you have suggestions for new resources, improvements to methodologies, or corrections for broken links, please don't hesitate to open an issue or submit a pull request. Contributions of all kinds are welcome and greatly appreciated.
- Methods: A Hierarchical Taxonomy
- Datasets
- Applications and Tasks
Year | Venue | Acronym | Paper | Project | Repo@GitHub |
---|---|---|---|---|---|
2024 | SIGGRAPH | Streetscapes | Streetscapes Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | ||
2024 | NeurIPS | 4Real | 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | ||
2024 | arXiv | VividDream | VividDream: Generating 3D Scene with Ambient Dynamics | ||
2024 | arXiv | DimensionX | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | ||
2024 | arXiv | PaintScene4D | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | ||
2025 | ICLR | GenXD | GenXD: Generating Any 3D and 4D Scenes | ||
2025 | CVPR | StarGen | StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | ||
2025 | TMM | DreamJourney | DreamJourney: Perpetual View Generation with Video Diffusion Models | ||
2025 | arXiv | Free4D | Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency |
Year | Type | Source | Acronym | Paper | Project |
---|---|---|---|---|---|
2017 | Nature | Real | Laval Outdoor | Deep Sky Modeling for Single Image Outdoor Lighting Estimation | |
2019 | Nature | Real | LHQ | Aligning latent and image spaces to connect the unconnectable | |
2021 | Nature | Real | ACID | Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image |
Year | Venue | Acronym | Paper | Project | Repo@GitHub |
---|---|---|---|---|---|
2022 | CVPR | Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis | |||
2022 | ECCV | COINS | Compositional Human-Scene Interaction Synthesis with Semantic Control | ||
2023 | CVPR | SceneDiffuser | Diffusion-based Generation, Optimization, and Planning in 3D Scenes | ||
2023 | ICCV | DIMOS | Synthesizing Diverse Human Motions in 3D Indoor Scenes | ||
2023 | SIGGRAPH | InterPhys | Synthesizing Physical Character-Scene Interactions | ||
2024 | 3DV | InterScene | Synthesizing Physically Plausible Human Motions in 3D Scenes | ||
2024 | CVPR | GenZI | GenZI: Zero-Shot 3D Human-Scene Interaction Generation | ||
2024 | ICLR | UniHSI | UniHSI: Unified Human-Scene Interaction via Prompted Chain-of-Contacts | ||
2024 | arXiv | SIMS | SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation | ||
2025 | CVPR | TokenHSI | TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization |
Year | Venue | Acronym | Paper | Project | Repo@GitHub |
---|---|---|---|---|---|
2022 | NeurIPS | ProcTHOR | ProcTHOR: Large-Scale Embodied AI Using Procedural Generation | ||
2024 | CVPR | Holodeck | Holodeck: Language Guided Generation of 3D Embodied AI Environments | ||
2024 | CVPR | PhyScene | PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI | ||
2024 | NeurIPS | Architect | Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | ||
2024 | arXiv | GRUtopia | GRUtopia: Dream General Robots in a City at Scale | ||
2024 | arXiv | EmbodiedCity | EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment | ||
2024 | arXiv | InfiniteWorld | InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction | ||
2025 | ICLR | MetaUrban | MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility |