Skip to content

hzxie/Awesome-3D-Scene-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 

Repository files navigation

Awesome Logo Counter arXiv YouTube PR's Welcome

Overview

This repository collects summaries of over 300 recent studies on 3D scene generation, along with the downstream applications, and will be continuously updated.

If you have suggestions for new resources, improvements to methodologies, or corrections for broken links, please don't hesitate to open an issue or submit a pull request. Contributions of all kinds are welcome and greatly appreciated.

3D-Scene-Generation-Teaser

Table of Contents

Methods: A Hierarchical Taxonomy

Procedural Generation

Rule-based Generation

Year Venue Acronym Paper Project Repo@GitHub
1988 SIGGRAPH Terrain simulation using a model of stream erosion
1989 SIGGRAPH The synthesis and rendering of eroded fractal terrains
1993 Graphics Interface A fractal model of mountains and rivers link
1998 SIGGRAPH Realistic modeling and rendering of plant ecosystems link
2001 SIGGRAPH CityEngine Procedural modeling of cities
2005 VRST Modeling Landscapes with Ridges and Rivers
2006 TOG Procedural modeling of buildings
2007 GDTW Citygen Citygen: An Interactive System for Procedural City Generation link
2007 I3D Example-based model synthesis link GitHub
2007 TVCG Terrain Synthesis from Digital Elevation Models link
2008 CGF Real-Time Rendering and Editing of Vector-based Terrains link
2008 TOG Continuous model synthesis link
2008 TOG Interactive Procedural Street Modeling link
2009 CGF Arches: a Framework for Modeling Complex Terrains
2009 CGF Interactive Geometric Simulation of 4D Cities
2009 TOG Interactive design of urban spaces using geometrical and behavioral modeling youtube
2010 CGF Procedural Generation of Roads
2011 CGF Interactive Modeling of City Layouts using Layers of Procedural Content
2011 SI3D Urban Ecosystem Design
2011 TOG Metropolis procedural modeling link
2012 CGF Procedural Generation of Parcels in Urban Modeling
2012 TOG Inverse design of urban procedural models link
2013 TOG Terrain Generation Using Procedural Models Based on Hydrology youtube
2013 TOG Urban Pattern Urban Pattern: Layout Design by Hierarchical Domain Splitting link
2015 TOG WorldBrush WorldBrush: Interactive Example-Based Synthesis of Procedural Virtual Worlds link
2016 CGF Example-Driven Procedural Urban Roads
2016 3DV Proceduralization for Editing 3D Architectural Models
2016 TOG Interactive Sketching of Urban Procedural Models link
2017 TOG Authoring landscapes by combining ecosystem and terrain erosion simulation
2017 TOG Fast Weather Simulation for Inverse Procedural Design of 3D Urban Models link GitHub
2017 TOG Interactive Example-Based Terrain Authoring with Conditional Generative Adversarial Networks link
2019 TOG Synthetic Silviculture: Multi-scale Modeling of Plant Ecosystems link
2021 TOG Authoring Consistent Landscapes with Flora and Fauna link
2022 TOG Ecoclimates Ecoclimates: Climate-Response Modeling of Vegetation link
2022 TOG Procedural Urban Forestry link
2023 CVPR Infinigen Infinite Photorealistic Worlds using Procedural Generation link GitHub
2023 TOG Forming Terrains by Glacial Erosion link
2023 TOG Large-scale terrain authoring through interactive erosion simulation youtube GitHub
2023 TOG Authoring and Simulating Meandering Rivers link GitHub
2025 CVPRW Proc-GS Proc-GS: Procedural Building Generation for City Assembly with 3D Gaussians link GitHub

Optimization-based Generation

Year Venue Acronym Paper Project Repo@GitHub
2002 Graphics Interface Constraint-based Automatic Placement for Scene Composition link
2010 TOG Computer-Generated Residential Building Layouts link
2011 SIGGRAPH Interactive Furniture Layout Using Interior Design Guidelines link
2011 SIGGRAPH Make it home Make it home: automatic optimization of furniture arrangement link
2012 TOG Example-based synthesis of 3D object arrangements link
2015 TVCG Clutterpalette The Clutterpalette: An Interactive Tool for Detailing Indoor Scenes link
2018 CGF MIQP-based Layout Design for Building Interiors link
2018 CVPR Human-centric Indoor Scene Synthesis Using Stochastic Grammar GitHub
2018 VR Automatic Furniture Arrangement Using Greedy Cost Minimization youtube GitHub
2021 MM MageAdd MageAdd: Real-Time Interaction Simulation for Scene Synthesis GitHub
2021 TVCG Fast 3D Indoor Scene Synthesis by Learning Spatial Relation Priors of Objects
2021 arXiv LUMINOUS LUMINOUS: Indoor Scene Generation for Embodied AI Challenges GitHub
2022 NeurIPS ProcTHOR ProcTHOR: Large-Scale Embodied AI Using Procedural Generation link GitHub
2024 CVPR Infinigen Indoors Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation link GitHub

LLM-based Generation

Year Venue Acronym Paper Project Repo@GitHub
2023 NeurIPS LayoutGPT LayoutGPT: Compositional Visual Planning and Generation with Large Language Models link GitHub
2024 CVPR GraphDreamer GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs link GitHub
2024 ECCV AnyHome AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes link GitHub
2024 ECCV SceneTeller SceneTeller: Language-to-3D Scene Generation link GitHub
2024 ICML SceneCraft SceneCraft: An LLM Agent for Synthesizing 3D Scenes as Blender Code
2024 MM Controllable Procedural Generation of Landscapes GitHub
2024 SIGGRAPH Asia DIScene DIScene: Object Decoupling and Interaction Modeling for Complex Scene Generation link
2024 arXiv Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases
2024 arXiv I-Design I-Design: Personalized LLM Interior Designer link GitHub
2024 arXiv LLplace LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model
2024 arXiv CityCraft CityCraft: A Real Crafter for 3D City Generation GitHub
2024 arXiv CityX CityX: Controllable Procedural Content Generation for Unbounded 3D Cities link GitHub
2024 arXiv GraphCanvas3D Graph Canvas for Controllable 3D Scene Generation
2024 arXiv UrbanWorld UrbanWorld: An Urban World Model for 3D City Generation GitHub
2025 3DV 3D-GPT 3D-GPT: Procedural 3D Modeling with Large Language Models link GitHub
2025 AAAI SceneX SceneX: Procedural Controllable Large-scale Scene Generation link GitHub
2025 AAAI Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model GitHub
2025 CVPR Global-Local Tree Search in VLMs for 3D Indoor Scene Generation GitHub
2025 CVPR LayoutVLM LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models link GitHub
2025 CVPR The Scene Language The Scene Language: Representing Scenes with Programs, Words, and Embeddings link GitHub
2025 arXiv WorldCraft WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents
2025 arXiv Cube Cube: A Roblox View of 3D Intelligence GitHub
2025 arXiv Scenethesis Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation link
2025 arXiv Agentic 3D Scene Generation with Spatially Contextualized VLMs
2025 arXiv ReSpace ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment link GitHub
2025 arXiv DirectLayout Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning link GitHub
2025 ACL Findings UnrealLLM UnrealLLM: Towards Highly Controllable and Interactable 3D Scene Generation by LLM-powered Procedural Content Generation

Neural-3D Generation

Scene Parameters

Year Venue Acronym Paper Project Repo@GitHub
2018 SIGGRAPH DeepSynth Deep Convolutional Priors for Indoor Scene Synthesis link GitHub
2019 CVPR FastSynth Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models GitHub
2020 SIGGRAPH Deep Generative Modeling for Scene Synthesis via Hybrid Representations
2021 3DV SceneFormer SceneFormer: Indoor Scene Generation with Transformers link GitHub
2021 ICCV Sync2Gen Scene Synthesis via Uncertainty-Driven Attribute Synchronization GitHub
2021 NeurIPS ATISS ATISS: Autoregressive Transformers for Indoor Scene Synthesis link GitHub
2022 ECCV Pose2Room Pose2Room: Understanding 3D Scenes from Human Activities link GitHub
2022 SIGGRAPH Asia SUMMON Scene Synthesis from Human Motion link GitHub
2023 CVPR Learning 3D Scene Priors with 2D Supervision link GitHub
2023 CVPR MIME MIME: Human-Aware 3D Scene Generation link GitHub
2023 SIGGRAPH COFS COFS: COntrollable Furniture layout Synthesis
2023 NeurIPS Language-driven Scene Synthesis using Multi-conditional Diffusion Model link GitHub
2024 3DV RoomDesigner RoomDesigner: Encoding Anchor-latents for Style-consistent and Shape-compatible Indoor Scene Generation GitHub
2024 CVPR DiffuScene DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis link GitHub
2024 CVPR SceneWiz3D SceneWiz3D: Towards Text-guided 3D Scene Composition link GitHub
2024 CVPR PhyScene PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI link GitHub
2024 ECCV DreamScene DreamScene: 3D Gaussian-Based Text-to-3D Scene Generation via Formation Pattern Sampling link GitHub
2024 ICML GALA3D GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting link GitHub
2024 ICML Disentangled 3D Scene Generation with Layout Learning link
2024 MM RelScene RelScene: A Benchmark and baseline for Spatial Relations in text-driven 3D Scene Generation
2024 NeurIPS DeBaRA DeBaRA: Denoising-Based 3D Room Arrangement Generation
2024 SIGGRAPH INFERACT Physics-based Scene Layout Generation From Human Motion link
2024 arXiv Lay-A-Scene Lay-A-Scene: Personalized 3D Object Arrangement Using Text-to-Image Priors link GitHub
2025 3DV Ctrl-Room Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints link GitHub
2025 CVPR SceneFactor SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation link GitHub
2025 CVPR CASAGPT CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design GitHub
2025 arXiv Steerable Scene Generation with Post Training and Inference-Time Search link GitHub

Scene Graph

Year Venue Acronym Paper Project Repo@GitHub
2014 EMNLP Learning Spatial Knowledge for Text to 3D Scene Generation
2016 CGF Learning 3D Scene Synthesis from Annotated RGB-D Images
2017 TOG Adaptive synthesis of indoor scenes via activity-associated object relation graphs youtube
2018 TOG Language-Driven Synthesis of 3D Scenes from Scene Databases link
2019 ICCV Meta-Sim Meta-Sim: Learning to Generate Synthetic Datasets link GitHub
2019 SIGGRAPH GRAINS GRAINS: Generative Recursive Autoencoders for INdoor Scenes link GitHub
2019 SIGGRAPH PlanIT PlanIT: Planning and Instantiating Indoor Scenes with Relation Graph and Spatial Prior Networks GitHub
2020 CVPR 3D-SLN End-to-End Optimization of Scene Layout link GitHub
2020 ECCV Meta-Sim 2 Meta-Sim 2 Unsupervised Learning of Scene Structure for Synthetic Data Generation link GitHub
2021 ICCV Graph-to-3D Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs link GitHub
2023 NeurIPS CommonScenes CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion link GitHub
2023 TPAMI SceneHGN SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation With Fine-Grained Geometry link GitHub
2024 ECCV SEK External Knowledge Enhanced 3D Scene Generation from Sketch
2024 ECCV Forest2Seq Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
2024 ECCV EchoScene EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion link GitHub
2024 ICLR InstructScene InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior link GitHub
2025 AAAI MMGDreamer MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation link GitHub
2025 CVPR FreeScene FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts link GitHub
2025 arXiv Controllable 3D Outdoor Scene Generation via Scene Graphs GitHub
2025 arXiv HiScene HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation link

Semantic Layout

Year Venue Acronym Paper Project Repo@GitHub
2021 ICCV SGSDI Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images link GitHub
2021 ICCV GANcraft GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds link GitHub
2023 CVPR DisCoScene DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis link GitHub
2023 ICCV InfiniCity InfiniCity: Infinite-Scale City Synthesis link
2023 ICCV CC3D CC3D: Layout-Conditioned Generation of Compositional 3D Scenes link GitHub
2023 ICCV Set-the-Scene Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes link GitHub
2023 ICCV UrbanGIRAFFE UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields link GitHub
2023 TPAMI SceneDreamer SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections link GitHub
2023 arXiv CompoNeRF CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout link GitHub
2024 3DV Comp3D Compositional 3D Scene Generation using Locally Conditioned Diffusion link
2024 CVPR CityDreamer CityDreamer: Compositional Generative Model of Unbounded 3D Cities link GitHub
2024 CVPR BerfScene BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation link GitHub
2024 NeurIPS SceneCraft SceneCraft: Layout-Guided 3D Scene Generation link GitHub
2024 SIGGRAPH BlockFusion BlockFusion: Expandable 3D Scene Generation Using Latent Tri-plane Extrapolation link GitHub
2024 SIGGRAPH Asia Frankenstein Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane link GitHub
2024 arXiv Urban Architect Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior link GitHub
2025 CVPR GaussianCity Generative Gaussian Splatting for Unbounded 3D City Generation link GitHub
2025 ICLR Layout-your-3D Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint link GitHub
2025 arXiv Layout2Scene Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors
2025 arXiv CityDreamer4D CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities link GitHub
2025 arXiv PrITTI PrITTI: Primitive-based Generation of Controllable and Editable 3D Semantic Scenes link GitHub
2025 ICCV Sat2City Sat2City: 3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion link GitHub

Implicit Layout

Year Venue Acronym Paper Project Repo@GitHub
2021 CVPR GIRAFFE GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields GitHub
2021 ICCV GSN Unconstrained Scene Generation With Locally Conditioned Radiance Fields link GitHub
2021 ICML NeRF-VAE NeRF-VAE: A geometry aware 3d scene generative model
2022 NeurIPS GAUDI GAUDI: A Neural Architect for Immersive 3D Scene Generation GitHub
2023 CVPR Persistent Nature Persistent Nature: A generative model of unbounded 3D worlds link GitHub
2023 CVPR NeuralField-LDM NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models link
2023 arXiv Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data link GitHub
2024 CVPR DiffInDScene DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation link GitHub
2024 CVPR XCube XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies link GitHub
2024 CVPR SemCity SemCity: Semantic Scene Generation with Triplane Diffusion link GitHub
2024 ECCV PDD Pyramid Diffusion for Fine 3D Large Scene Generation link GitHub
2024 NeurIPS Director3D Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text link GitHub
2025 CVPR LT3SD LT3SD: Latent Trees for 3D Scene Diffusion link GitHub
2025 CVPR SplatFlow SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis link GitHub
2025 CVPR Prometheus Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation link GitHub
2025 ICLR DynamicCity DynamicCity: Large-Scale Occupancy Generation from Dynamic Scenes link GitHub
2025 arXiv NuiScene NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes link GitHub

Image-based Generation

Holistic Generation

Year Venue Acronym Paper Project Repo@GitHub
2019 ICIP 360-Degree Image Completion by Two-Stage Conditional Gans
2020 CVPR Sat2Ground Geometry-Aware Satellite-to-Ground Image Synthesis for Urban Areas GitHub
2020 WACV 360 Panorama Synthesis from a Sparse Set of Images with Unknown Field of View
2021 AAAI SIG-SS Spherical Image Generation from a Single Image by Considering Scene Symmetry GitHub
2021 CVPR EnvMapNet HDR Environment Map Estimation for Real-Time Augmented Reality link GitHub
2021 ICCV Sat2vid Sat2vid: Street-view panoramic video synthesis from a single satellite image
2022 3DV ImmerseGAN Guided Co-Modulated GAN for 360° Field of View Extrapolation link
2022 CVPR OmniDreamer Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation link GitHub
2022 ECCV BIPS BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning GitHub
2022 SIGGRAPH Asia Text2Light Text2Light: Zero-Shot Text-Driven HDR Panorama Generation link GitHub
2022 TMM PanoGAN Cross-View Panorama Image Synthesis GitHub
2022 TPAMI Sat2Str Geometry-Guided Street-View Panorama Synthesis from Satellite Imagery GitHub
2023 CVPR DiffCollage DiffCollage: Parallel Generation of Large Content with Diffusion Models link
2023 ICCV Sat2Density Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs link GitHub
2023 MM PanoDiff 360-Degree Panorama Generation from Few Unregistered NFoV Images GitHub
2023 NeurIPS MVDiffusion MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion link GitHub
2023 TPAMI Spherical Image Generation From a Few Normal-Field-of-View Images by Considering Scene Symmetry GitHub
2023 arXiv LDM3D LDM3D: Latent Diffusion Model for 3D
2023 arXiv Diffusion360 Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models GitHub
2024 ICLR PanoDiffusion PanoDiffusion: 360-degree Panorama Outpainting via Diffusion link GitHub
2024 CVPR ControlRoom3D ControlRoom3D 🤖Room Generation using Semantic Proxy Rooms link
2024 CVPR Sat2Scene Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion GitHub
2024 CVPR PanFusion Taming stable diffusion for text to 360â—¦ panorama image generation link GitHub
2024 ECCV DreamScene360 DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting link GitHub
2024 ECCV Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views link
2024 IJCAI FastScene FastScene: Text-Driven Fast Indoor 3D Scene Generation via Panoramic Gaussian Splatting GitHub
2024 NeurIPS DiffPano DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion link GitHub
2024 TPAMI PERF PERF: Panoramic Neural Radiance Field from a Single Panorama link GitHub
2024 TVCG Dream360 Dream360: Diverse and Immersive Outdoor Virtual Scene Creation via Transformer-Based 360° Image Outpainting
2024 WACV StitchDiffusion Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models link GitHub
2024 arXiv HoloDreamer HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions link GitHub
2024 arXiv SceneDreamer360 SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting link GitHub
2025 ICLR CubeDiff CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation link
2025 SIGGRAPH LayerPano3D LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation link GitHub
2025 arXiv A Recipe for Generating 3D Worlds From a Single Image link
2025 arXiv EmbodiedGen EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence link GitHub
2025 arXiv ImmerseGen ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies link
2025 arXiv HunyuanWorld 1.0 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels link GitHub

Iterative Generation

Year Venue Acronym Paper Project Repo@GitHub
2019 TOG 3D Ken Burns Effect from a Single Image link GitHub
2020 CVPR SynSin SynSin: End-to-end view synthesis from a single image link GitHub
2020 CVPR 3D Photo 3D Photography Using Context-Aware Layered Depth Inpainting link GitHub
2020 CVPR Single-View View Synthesis with Multiplane Images link GitHub
2020 NeurIPS GVS Generative View Synthesis: From Single-view Semantics to Novel-view Images link GitHub
2021 ICCV Worldsheet Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image link GitHub
2021 ICCV InfiniteNature Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image link GitHub
2021 ICCV GFVS Geometry-free view synthesis: Transformers and no 3d priors link GitHub
2021 ICCV Pathdreamer Pathdreamer: A World Model for Indoor Navigation link GitHub
2021 ICCV PixelSynth PixelSynth: Generating a 3D-Consistent Experience from a Single Image link GitHub
2022 CVPR LOTR Look outside the room: Synthesizing a consistent long-term 3d scene video from a single image link GitHub
2022 ECCV InfiniteNature-Zero InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images link GitHub
2022 NeurIPS SGAM SGAM: Building a Virtual 3D World through Simultaneous Generation and Mapping link GitHub
2023 AAAI SE3DS Simple and Effective Synthesis of Indoor 3D Scenes GitHub
2023 CVPR 3D Cinemagraphy 3D Cinemagraphy from a Single Image link GitHub
2023 CVPR Consistent View Synthesis with Pose-Guided Diffusion Models link
2023 ICCV DiffDreamer DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion Models link GitHub
2023 ICCV Text2Room Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models link GitHub
2023 ICCV Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models link GitHub
2023 MM Make-It-4D Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image GitHub
2023 NeurIPS SceneScape SceneScape: Text-Driven Consistent Scene Generation link GitHub
2023 NeurIPS PanoGen PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation link GitHub
2023 arXiv LucidDreamer LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes link GitHub
2023 arXiv Text2Immersion Text2Immersion: Generative Immersive Scene with 3D Gaussians link
2024 AAAI AOG-Net Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation GitHub
2024 CVPR WonderJourney WonderJourney: Going from Anywhere to Everywhere link GitHub
2024 CVPR 3D-SceneDreamer 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation link
2024 ECCV PanoFree PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance link GitHub
2024 MM iControl3D iControl3D: An Interactive System for Controllable 3D Scene Generation GitHub
2024 NeurIPS ODIN From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos link GitHub
2024 NeurIPS CAT3D CAT3D: Create Anything in 3D with Multi-View Diffusion Models link
2024 TVCG Text2NeRF Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields link GitHub
2024 arXiv OPa-Ma OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting GitHub
2024 arXiv Scene123 Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE link GitHub
2025 3DV RealmDreamer RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion link GitHub
2025 3DV Invisible Stitch Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting link GitHub
2025 AAAI BloomScene BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation GitHub
2025 CVPR WonderWorld WonderWorld: Interactive 3D Scene Generation from a Single Image link GitHub
2025 CVPR ArtiScene ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary link GitHub
2025 ICLR 3D-MOM Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images link GitHub
2025 arXiv WonderTurbo WonderTurbo: Generating Interactive 3D World in 0.72 Seconds link GitHub
2025 arXiv Bolt3D Bolt3D: Generating 3D Scenes in Seconds link
2025 arXiv SynCity SynCity: Training-Free Generation of 3D Worlds link GitHub

Video-based Generation

Two-stage Generation

Year Venue Acronym Paper Project Repo@GitHub
2024 SIGGRAPH Streetscapes Streetscapes Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion link
2024 NeurIPS 4Real 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models link GitHub
2024 arXiv VividDream VividDream: Generating 3D Scene with Ambient Dynamics
2024 arXiv DimensionX DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion link GitHub
2024 arXiv PaintScene4D PaintScene4D: Consistent 4D Scene Generation from Text Prompts link GitHub
2025 ICLR GenXD GenXD: Generating Any 3D and 4D Scenes link GitHub
2025 CVPR StarGen StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation link GitHub
2025 TMM DreamJourney DreamJourney: Perpetual View Generation with Video Diffusion Models link GitHub
2025 arXiv Free4D Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency link GitHub

One-stage Generation

Year Venue Acronym Paper Project Repo@GitHub
2023 arXiv GAIA-1 GAIA-1: A Generative World Model for Autonomous Driving link
2023 arXiv ADriver-I ADriver-I: A General World Model for Autonomous Driving
2024 ICLR MagicDrive MagicDrive: Street View Generation with Diverse 3D Geometry Control link GitHub
2024 CVPR Panacea Panacea: Panoramic and Controllable Video Generation for Autonomous Driving link GitHub
2024 CVPR Drive-WM Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving link GitHub
2024 CVPR 360DVD 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model link GitHub
2024 ECCV DriveDreamer DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving link GitHub
2024 ECCV DrivingDiffusion DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model link GitHub
2024 ECCV WoVoGen WoVoGen: World Volume-Aware Diffusion for Controllable Multi-camera Driving Scene Generation GitHub
2024 NeurIPS Vista Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability link GitHub
2024 arXiv DIAMOND Diffusion for World Modeling: Visual Details Matter in Atari link GitHub
2024 arXiv MagicDrive3D MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes link GitHub
2024 arXiv Delphi Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation link GitHub
2024 arXiv BEVWorld BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space GitHub
2024 arXiv DriveArena DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving link GitHub
2024 arXiv DiVE DiVE: DiT-based Video Generation with Enhanced Control link GitHub
2024 arXiv DreamForge DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes link
2024 arXiv SyntheOcc SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs link GitHub
2024 arXiv MagicDrive-V2 MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control link GitHub
2024 arXiv HoloDrive HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
2024 arXiv CogDriving Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention link
2024 arXiv Imagine360 Imagine360: Immersive 360 Video Generation from Perspective Anchor link GitHub
2024 arXiv InfiniCube InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models link
2024 arXiv DrivingWorld DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT link GitHub
2024 arXiv ViewCrafter ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis link GitHub
2024 arXiv ViewExtrapolator Novel View Extrapolation with Video Diffusion Priors link GitHub
2025 AAAI DriveDreamer-2 DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation link GitHub
2025 ICLR 4K4DGen 4K4DGen: Panoramic 4D Generation at 4K Resolution link
2025 ICLR GameGen-X GameGen-X: Interactive Open-world Game Video Generation link GitHub
2025 ICLR GameNGen Diffusion Models Are Real-Time Game Engines link
2025 ICLR Genex Generative World Explorer link GitHub
2025 ICLR GLAD Glad: A Streaming Scene Generator for Autonomous Driving
2025 CVPR DrivingSphere DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation link GitHub
2025 CVPR StreetCrafter StreetCrafter: Street View Synthesiswith Controllable Video Diffusion Models link GitHub
2025 CVPR DriveScape DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation link
2025 CVPR UniScene UniScene: Unified Occupancy-centric Driving Scene Generation link GitHub
2025 CVPR GEM GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control link GitHub
2025 CVPR UMGen Generating Multimodal Driving Scenes via Next-Scene Prediction link GitHub
2025 CVPR CAT4D CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models link
2025 CVPR Wonderland Wonderland: Navigating 3D Scenes from a Single Image link GitHub
2025 CVPR VideoScene VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step link GitHub
2025 CVPR Scene Splatter Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model link
2025 CVPR DynamicScaler DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes link
2025 ICML AdaWorld AdaWorld: Learning Adaptable World Models with Latent Actions link GitHub
2025 Nature WHAM World and Human Action Models towards gameplay ideation link
2025 arXiv DreamDrive DreamDrive: Generative 4D Scene Modeling from Street View Images link
2025 arXiv MaskGWM MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction link GitHub
2025 arXiv UniFuture Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception link GitHub
2025 arXiv DiST-4D DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation link GitHub
2025 arXiv GAIA-2 GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving link
2025 arXiv SteerX SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering link GitHub
2025 arXiv WonderVerse WonderVerse: Extendable 3D Scene Generation with Video Generative Models
2025 arXiv FlexWorld FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis link GitHub
2025 arXiv GaussVideoDreamer GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting
2025 arXiv WORLDMEM WORLDMEM: Long-term Consistent World Simulation with Memory link GitHub
2025 arXiv HoloTime HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation link GitHub
2025 arXiv MineWorld MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft GitHub
2025 arXiv GameFactory GameFactory: Creating New Games with Generative Interactive Videos link GitHub
2025 arXiv Matrix-Game Matrix-Game: Interactive World Foundation Model link GitHub
2025 arXiv CoGen CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving link
2025 ICCV WonderPlay WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions link
2025 arXiv Dreamland Dreamland: Controllable World Creation with Simulator and Generative Models link
2025 arXiv Voyager Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
2025 arXiv Matrix-Game Matrix-Game: Interactive World Foundation Model link GitHub
2025 arXiv Hunyuan-GameCraft Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition link
2025 arXiv CoCo4D CoCo4D: Comprehensive and Complex 4D Scene Generation link GitHub
2025 arXiv WonderFree WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration link GitHub
2025 ICCV DynamicVoyager Voyaging into Unbounded Dynamic Scenes from a Single View link GitHub

Datasets

Indoor Datasets

Year Type Source Acronym Paper Project
2012 Indoor, Nature Real SUN360 Recognizing scene viewpoint using panoramic place representation link
2012 Indoor Real NYUv2 Indoor Segmentation and Support Inference From RGBD Images link
2015 Indoor Real SunRGBD Sun RGB-D: A RGB-D scene understanding benchmark suite link
2016 Indoor Real SceneNN SceneNN: A Scene Meshes Dataset with aNNotations link
2017 Indoor Real 2D-3D-S Joint 2D-3D-Semantic Data for Indoor Scene Understanding link
2017 Indoor Real Matterport3D Matterport3D: Learning from RGB-D Data in Indoor Environments link
2017 Indoor Real ScanNet ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes link
2017 Indoor Real Laval Indoor Learning to Predict Indoor Illumination from a Single Image link
2018 Indoor, Urban Real RealEstate10K Stereo Magnification: Learning View Synthesis using Multiplane Images link
2019 Indoor Real Replica The Replica Dataset: A Digital Replica of Indoor Spaces link
2020 Indoor Real 3DSSG Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions link
2021 Indoor Real HM3D Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI link
2023 Indoor Real ScanNet++ ScanNet++: A high-fidelity dataset of 3D indoor scenes link
2023 Indoor, Nature, Urban Real DL3DV-10K DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision link
2012 Indoor Synthetic SceneSynth Example-based synthesis of 3D object arrangements link
2017 Indoor Synthetic SUNCG Semantic Scene Completion from a Single Depth Image link
2020 Indoor Synthetic Structured3D Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling link
2020 Indoor Synthetic HyperSim HyperSim: A photorealistic synthetic dataset for holistic indoor scene understanding link
2021 Indoor Synthetic 3D-FRONT 3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics link
2021 Indoor Synthetic 3D-Future 3D-FUTURE: 3D Furniture shape with TextURE link
2023 Indoor Synthetic SG-FRONT CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion link
2025 Indoor Synthetic SE(3) Scene Steerable Scene Generation with Post Training and Inference-Time Search link

Natural Datasets

Year Type Source Acronym Paper Project
2017 Nature Real Laval Outdoor Deep Sky Modeling for Single Image Outdoor Lighting Estimation link
2019 Nature Real LHQ Aligning latent and image spaces to connect the unconnectable link
2021 Nature Real ACID Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image link

Urban Datasets

Year Type Source Acronym Paper Project
2012 Urban Real KITTI Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite link
2016 Urban Real Cityscapes The Cityscapes dataset for semantic urban scene understanding link
2019 Urban Real SemanticKITTI SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences link
2020 Urban Real Waymo Scalability in Perception for Autonomous Driving: Waymo Open Dataset link
2020 Urban Real nuScenes nuScenes: A multimodal dataset for autonomous driving link
2023 Urban Real KITTI-360 KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2D and 3D. link
2020 Urban Real HoliCity HoliCity: A city-scale data platform for learning holistic 3D structures link
2022 Urban Real OmniCity OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images link
2024 Urban Real OSM CityDreamer: Compositional Generative Model of Unbounded 3D Cities link
2024 Urban Real GoogleEarth CityDreamer: Compositional Generative Model of Unbounded 3D Cities link
2017 Urban Synthetic CARLA CARLA: An Open Urban Driving Simulator link
2022 Urban Synthetic CarlaSC MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments link
2020 Urban Synthetic Virtual-KITTI-2 Virtual KITTI 2 link
2025 Urban Synthetic CityTopia CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities link

Applications and Tasks

3D Scene Editing

Year Venue Acronym Paper Project Repo@GitHub
2022 CVPR StyleMesh StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions link GitHub
2023 CVPR DisCoScene DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis link GitHub
2023 CVPR LEGO-Net LEGO-Net: Learning Regular Rearrangements of Objects in Rooms link GitHub
2023 CVPR Lift3D Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field link GitHub
2023 CVPR Text2Scene Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details
2023 ICRA CabiNet CabiNet: Scaling Neural Collision Detection for Object Rearrangement with Procedural Scene Generation link GitHub
2023 MM RoomDreamer RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
2024 CVPR SceneTex SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors link GitHub
2024 CVPR ControlRoom3D ControlRoom3D 🤖Room Generation using Semantic Proxy Rooms link
2024 ECCV StyleCity StyleCity: Large-Scale 3D Urban Scenes Stylization link GitHub
2024 ECCV RoomTex RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting link GitHub
2024 ECCV 3D-GOI 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing link
2024 MM SceneExpander SceneExpander: Real-Time Scene Synthesis for Interactive Floor Plan Editing GitHub
2024 NeurIPS Neural Assets Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models link
2024 NeurIPS DeBaRA DeBaRA: Denoising-Based 3D Room Arrangement Generation
2024 SIGGRAPH Asia InstanceTex InstanceTex: Instance-level Controllable Texture Synthesis for 3D Scenes via Diffusion Priors link
2024 TVCG SceneDirector SceneDirector: Interactive Scene Synthesis by Simultaneously Editing Multiple Objects in Real-Time GitHub
2024 VR DreamSpace DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation link GitHub
2025 3DV Ctrl-Room Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints link GitHub
2025 CVPR RoomPainter RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing

Human-Scene Interaction

Year Venue Acronym Paper Project Repo@GitHub
2022 CVPR Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis
2022 ECCV COINS Compositional Human-Scene Interaction Synthesis with Semantic Control link GitHub
2023 CVPR SceneDiffuser Diffusion-based Generation, Optimization, and Planning in 3D Scenes link GitHub
2023 ICCV DIMOS Synthesizing Diverse Human Motions in 3D Indoor Scenes link GitHub
2023 SIGGRAPH InterPhys Synthesizing Physical Character-Scene Interactions link
2024 3DV InterScene Synthesizing Physically Plausible Human Motions in 3D Scenes link GitHub
2024 CVPR GenZI GenZI: Zero-Shot 3D Human-Scene Interaction Generation link
2024 ICLR UniHSI UniHSI: Unified Human-Scene Interaction via Prompted Chain-of-Contacts link GitHub
2024 arXiv SIMS SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation link GitHub
2025 CVPR TokenHSI TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization link GitHub

Embodied AI

Year Venue Acronym Paper Project Repo@GitHub
2022 NeurIPS ProcTHOR ProcTHOR: Large-Scale Embodied AI Using Procedural Generation link GitHub
2024 CVPR Holodeck Holodeck: Language Guided Generation of 3D Embodied AI Environments link GitHub
2024 CVPR PhyScene PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI link GitHub
2024 NeurIPS Architect Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting link GitHub
2024 arXiv GRUtopia GRUtopia: Dream General Robots in a City at Scale link GitHub
2024 arXiv EmbodiedCity EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment link GitHub
2024 arXiv InfiniteWorld InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction GitHub
2025 ICLR MetaUrban MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility link GitHub

Robotics

Year Venue Acronym Paper Project Repo@GitHub
2023 NeurIPS UniPi Learning Universal Policies via Text-Guided Video Generation link
2023 NeurIPS HiP Compositional Foundation Models for Hierarchical Planning link GitHub
2024 CoRL Imagination Policy Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies link GitHub
2024 CoRL Eurekaverse Eurekaverse: Environment Curriculum Generation via Large Language Models link GitHub
2024 ICLR GR-1 Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation link GitHub
2024 ICML RoboGen RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation link GitHub
2024 ICML VLP Using Left and Right Brains Together: Towards Vision and Language Planning
2024 IROS ActNeRF Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions link GitHub
2024 NeurIPS CLOVER Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation GitHub
2024 arXiv GR-2 GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation link
2025 ICLR SlowFast-VGen SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation link GitHub
2025 ICML Video Prediction Policy Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations link GitHub
2025 arXiv VideoWorld VideoWorld: Exploring Knowledge Learning from Unlabeled Videos link GitHub
2025 arXiv Cosmos-Transfer1 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control link GitHub
2025 arXiv TesserAct TesserAct: Learning 4D Embodied World Models link GitHub

Autonomous Driving

Year Venue Acronym Paper Project Repo@GitHub
2023 arXiv GAIA-1 GAIA-1: A Generative World Model for Autonomous Driving link
2023 arXiv Cam4DOcc Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications GitHub
2024 CVPR Drive-WM Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving link GitHub
2024 ECCV DriveDreamer DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving link GitHub
2024 ECCV OccWorld OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving link GitHub
2024 ECCV WoVoGen WoVoGen: World Volume-Aware Diffusion for Controllable Multi-camera Driving Scene Generation GitHub
2024 ICLR MagicDrive MagicDrive: Street View Generation with Diverse 3D Geometry Control link GitHub
2024 NeurIPS Vista Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability link GitHub
2024 arXiv OccSora OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving link GitHub
2024 arXiv Delphi Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation link GitHub
2024 arXiv DriveArena DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving link GitHub
2024 arXiv DiVE DiVE: DiT-based Video Generation with Enhanced Control link GitHub
2024 arXiv DreamForge DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes link
2024 arXiv DrivingWorld DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT link GitHub
2025 AAAI Drive-OccWorld Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving link GitHub
2025 CVPR DrivingSphere DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation link GitHub
2025 ICLR GLAD Glad: A Streaming Scene Generator for Autonomous Driving
2025 arXiv DreamDrive DreamDrive: Generative 4D Scene Modeling from Street View Images link
2025 arXiv Cosmos-Transfer1 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control link GitHub

About

A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)

Topics

Resources

Stars

Watchers

Forks

Contributors 5