EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

EmbodiedGen is a generative engine to create diverse and interactive 3D worlds composed of high-quality 3D assets(mesh & 3DGS) with plausible physics, leveraging generative AI to address the challenges of generalization in embodied intelligence related research. It composed of six key modules: Image-to-3D, Text-to-3D, Texture Generation, Articulated Object Generation, Scene Generation and Layout Generation.

✨ Table of Contents of EmbodiedGen

🖼️ Image-to-3D
📝 Text-to-3D
🎨 Texture Generation
🌍 3D Scene Generation
⚙️ Articulated Object Generation
🏞️ Layout (Interactive 3D Worlds) Generation

🚀 Quick Start

✅ Setup Environment

git clone https://github.com/HorizonRobotics/EmbodiedGen.git
cd EmbodiedGen
git checkout v0.1.1
git submodule update --init --recursive --progress
conda create -n embodiedgen python=3.10.13 -y
conda activate embodiedgen
bash install.sh

✅ Setup GPT Agent

Update the API key in file: embodied_gen/utils/gpt_config.yaml.

You can choose between two backends for the GPT agent:

gpt-4o (Recommended) – Use this if you have access to Azure OpenAI.
qwen2.5-vl – An alternative with free usage via OpenRouter, apply a free key here and update api_key in embodied_gen/utils/gpt_config.yaml (50 free requests per day)

🖼️ Image-to-3D

Generate physically plausible 3D asset URDF from single input image, offering high-quality support for digital twin systems.

☁️ Service

Run the image-to-3D generation service locally. Models downloaded automatically on first run, please be patient.

# Run in foreground
python apps/image_to_3d.py
# Or run in the background
CUDA_VISIBLE_DEVICES=0 nohup python apps/image_to_3d.py > /dev/null 2>&1 &

⚡ API

Generate physically plausible 3D assets from image input via the command-line API.

img3d-cli --image_path apps/assets/example_image/sample_04.jpg apps/assets/example_image/sample_19.jpg \
    --n_retry 2 --output_root outputs/imageto3d

# See result(.urdf/mesh.obj/mesh.glb/gs.ply) in ${output_root}/sample_xx/result

📝 Text-to-3D

Create 3D assets from text descriptions for a wide range of geometry and styles.

☁️ Service

Deploy the text-to-3D generation service locally.

Text-to-image model based on the Kolors model, supporting Chinese and English prompts. Models downloaded automatically on first run, please be patient.

python apps/text_to_3d.py

⚡ API

Text-to-image model based on SD3.5 Medium, English prompts only. Usage requires agreement to the model license(click accept), models downloaded automatically. (ps: models with more permissive licenses found in embodied_gen/models/image_comm_model.py)

For large-scale 3D assets generation, set --n_pipe_retry=2 to ensure high end-to-end 3D asset usability through automatic quality check and retries. For more diverse results, do not set --seed_img.

text3d-cli --prompts "small bronze figurine of a lion" "A globe with wooden base" "wooden table with embroidery" \
    --n_image_retry 2 --n_asset_retry 2 --n_pipe_retry 1 --seed_img 0 \
    --output_root outputs/textto3d

Text-to-image model based on the Kolors model.

bash embodied_gen/scripts/textto3d.sh \
    --prompts "small bronze figurine of a lion" "A globe with wooden base and latitude and longitude lines" "橙色电动手钻，有磨损细节" \
    --output_root outputs/textto3d_k

🎨 Texture Generation

Generate visually rich textures for 3D mesh.

☁️ Service

Run the texture generation service locally. Models downloaded automatically on first run, see download_kolors_weights, geo_cond_mv.

python apps/texture_edit.py

⚡ API

Support Chinese and English prompts.

bash embodied_gen/scripts/texture_gen.sh \
    --mesh_path "apps/assets/example_texture/meshes/robot_text.obj" \
    --prompt "举着牌子的写实风格机器人，大眼睛，牌子上写着“Hello”的文字" \
    --output_root "outputs/texture_gen/robot_text"

bash embodied_gen/scripts/texture_gen.sh \
    --mesh_path "apps/assets/example_texture/meshes/horse.obj" \
    --prompt "A gray horse head with flying mane and brown eyes" \
    --output_root "outputs/texture_gen/gray_horse"

🌍 3D Scene Generation

🚧 Coming Soon

⚙️ Articulated Object Generation

🚧 Coming Soon

🏞️ Layout(Interactive 3D Worlds) Generation

💬 Generate Layout from task description

🚧 Coming Soon

🖼️ Real-to-Sim Digital Twin

For Developer

pip install .[dev] && pre-commit install
python -m pytest # Pass all unit-test are required.

📚 Citation

If you use EmbodiedGen in your research or projects, please cite:

@misc{wang2025embodiedgengenerative3dworld,
      title={EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence},
      author={Xinjie Wang and Liu Liu and Yu Cao and Ruiqi Wu and Wenkang Qin and Dehui Wang and Wei Sui and Zhizhong Su},
      year={2025},
      eprint={2506.10600},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2506.10600},
}

🙌 Acknowledgement

⚖️ License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
apps		apps
embodied_gen		embodied_gen
scripts		scripts
tests		tests
thirdparty		thirdparty
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
install.sh		install.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

✨ Table of Contents of EmbodiedGen

🚀 Quick Start

✅ Setup Environment

✅ Setup GPT Agent

🖼️ Image-to-3D

☁️ Service

⚡ API

📝 Text-to-3D

☁️ Service

⚡ API

🎨 Texture Generation

☁️ Service

⚡ API

🌍 3D Scene Generation

⚙️ Articulated Object Generation

🏞️ Layout(Interactive 3D Worlds) Generation

💬 Generate Layout from task description

🖼️ Real-to-Sim Digital Twin

For Developer

📚 Citation

🙌 Acknowledgement

⚖️ License

About

Uh oh!

Releases

Packages

Languages

License

HorizonRobotics/EmbodiedGen

Folders and files

Latest commit

History

Repository files navigation

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

✨ Table of Contents of EmbodiedGen

🚀 Quick Start

✅ Setup Environment

✅ Setup GPT Agent

🖼️ Image-to-3D

☁️ Service

⚡ API

📝 Text-to-3D

☁️ Service

⚡ API

🎨 Texture Generation

☁️ Service

⚡ API

🌍 3D Scene Generation

⚙️ Articulated Object Generation

🏞️ Layout(Interactive 3D Worlds) Generation

💬 Generate Layout from task description

🖼️ Real-to-Sim Digital Twin

For Developer

📚 Citation

🙌 Acknowledgement

⚖️ License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages