InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

Sirui Xu Hung Yu Ling Yu-Xiong Wang Liang-Yan Gui
University of Illinois Urbana-Champaign, Electronic Arts
CVPR 2025 Highlight 🏆

🏠 Overview

InterMimic features one unified policy, spanning diverse full-body interactions with dynamic, heterogeneous objects—and it works out-of-the-box for both SMPL-X and Unitree G1 humanoids.

📹 Demo

🔥 News

[2025-06-10] Release the instruction for the student policy inference.
[2025-06-03] Initial release of PSI and the processed data. Next release: teacher policy inference for dynamics-aware retargeting, and student policy inference.
[2025-05-26] It's been a while! The student policy training pipeline has been released! The PSI and other data construction pipelines will follow soon.
[2025-04-18] Release a checkpoint with high‑fidelity physics and enhanced contact precision.
[2025-04-11] The training code for teacher policies is live—try training your own policy!
[2025-04-05] We're excited by the overwhelming interest in humanoid robot support and are ahead of schedule in open-sourcing our Unitree-G1 integration—starting with a small demo with support for G1 with its original three-finger dexterous hands. Join us in exploring whole-body loco-manipulation with humanoid robots!
[2025-04-04] InterMimic has been selected as a CVPR Highlight Paper 🏆. More exciting developments are on the way!
[2025-03-25] We’ve officially released the codebase and checkpoint for teacher policy inference demo — give it a try! ☕️

📖 Getting Started

Dependencies

Follow the following instructions:

Create new conda environment and install pytorch:

conda create -n intermimic python=3.8
conda install pytorch torchvision torchaudio pytorch-cuda=11.6 -c pytorch -c nvidia
pip install -r requirement.txt

You may also build from environment.yml, which might contain redundancies,

conda env create -f environment.yml

Download and setup Isaac Gym.
Download the dataset, unzip it, and move the extracted folder to InterAct/OMOMO_new/. This build contains minor fixes to the original release, so your results may deviate slightly from those reported in the paper.
Activate the environment:
```
conda activate intermimic
```

Data Replay

To replay the groud truth data, execute the following commands:

sh scripts/data_replay.sh

Note: The output colors represent the ground truth contact markers for links.

Teacher Policy Training

To train a teacher policy, execute the following commands:

sh scripts/train_teacher.sh

A higher‑fidelity simulation enough for low-dynamic interaction (trading off some efficiency for realism):

sh scripts/train_teacher_new.sh

How to enable PSI

Open the training config, for example, omomo_train_new.yaml. Set

physicalBufferSize: <integer greater than 1>

Student Policy Training

To train a student policy with distillation, execute the following commands:

sh scripts/train_student.sh

Teacher Policy Inference

We’ve released a checkpoint for one (out of 17) teacher policy on OMOMO, along with some sample data. To get started:

Download the checkpoints and place them in the current directory.
Then, run the following commands:
```
sh scripts/test_teacher.sh
```
Run the high‑fidelity modeling (trading off some efficiency for realism):
```
sh scripts/test_teacher_new.sh
```
🔥 To try it on the Unitree G1 with its three-fingered dexterous hand—directly learned from MoCap without any external retargeting:
```
sh scripts/test_g1.sh
```

Student Policy Inference

After finish the student policy training, run the inference with

sh scripts/test_student.sh

📝 TODO List

Release inference demo for the teacher policy
Add support for Unitree-G1 with dexterous robot hands
Release training pipeline for the teacher policy
Release student policy distillation training
Release processed MoCap
Release inference pipeline for the student policy
Distilled reference data (physically correct HOI data❗️), and all related checkpoints
Release all data and processing scripts alongside the InterAct launch
Release physics-based text-to-HOI and interaction prediction demo

🔗 Citation

If you find our work helpful, please cite:

@inproceedings{xu2025intermimic,
  title = {{InterMimic}: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions},
  author = {Xu, Sirui and Ling, Hung Yu and Wang, Yu-Xiong and Gui, Liang-Yan},
  booktitle = {CVPR},
  year = {2025},
}

Our data is sourced from InterAct. Please consider citing:

@inproceedings{xu2025interact,
  title = {{InterAct}: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation},
  author = {Xu, Sirui and Li, Dongting and Zhang, Yucheng and Xu, Xiyan and Long, Qi and Wang, Ziyin and Lu, Yunzhi and Dong, Shuchang and Jiang, Hezi and Gupta, Akshat and Wang, Yu-Xiong and Gui, Liang-Yan},
  booktitle = {CVPR},
  year = {2025},
}

Please also consider citing the specific sub-dataset you used from InterAct.

Our integrated kinematic model builds upon InterDiff, HOI-Diff, and InterDreamer. Please consider citing the following if you find this component useful:

@inproceedings{xu2024interdreamer,
  title = {{InterDreamer}: Zero-Shot Text to 3D Dynamic Human-Object Interaction},
  author = {Xu, Sirui and Wang, Ziyin and Wang, Yu-Xiong and Gui, Liang-Yan},
  booktitle = {NeurIPS},
  year = {2024},
}

@inproceedings{xu2023interdiff,
  title = {{InterDiff}: Generating 3D Human-Object Interactions with Physics-Informed Diffusion},
  author = {Xu, Sirui and Li, Zhengyuan and Wang, Yu-Xiong and Gui, Liang-Yan},
  booktitle = {ICCV},
  year = {2023},
}

@article{peng2023hoi,
  title = {HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models},
  author = {Peng, Xiaogang and Xie, Yiming and Wu, Zizhao and Jampani, Varun and Sun, Deqing and Jiang, Huaizu},
  journal = {arXiv preprint arXiv:2312.06553},
  year = {2023}
}

Our SMPL-X-based humanoid model is adapted from PHC. Please consider citing:

@inproceedings{Luo2023PerpetualHC,
  author = {Zhengyi Luo and Jinkun Cao and Alexander W. Winkler and Kris Kitani and Weipeng Xu},
  title = {Perpetual Humanoid Control for Real-time Simulated Avatars},
  booktitle = {ICCV},
  year = {2023}
}

👏 Acknowledgements and 📚 License

This repository builds upon the following excellent open-source projects:

IsaacGymEnvs: Contributes to the environment code
rl_games: Serves as the core reinforcement learning framework
PHC: Used for data construction
PhysHOI: Contributes to the environment code
InterAct, OMOMO: Core resource for dataset construction
InterDiff: Supports kinematic generation
HOI-Diff: Supports kinematic generation

This codebase is released under the MIT License.
Please note that it also relies on external libraries and datasets, each of which may be subject to their own licenses and terms of use.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
InterAct		InterAct
assets		assets
intermimic		intermimic
scripts		scripts
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

🏠 Overview

📹 Demo

🔥 News

📖 Getting Started

Dependencies

Data Replay

Teacher Policy Training

Student Policy Training

Teacher Policy Inference

Student Policy Inference

📝 TODO List

🔗 Citation

👏 Acknowledgements and 📚 License

🌟 Star History

About

Uh oh!

Releases

Packages

Languages

License

Sirui-Xu/InterMimic

Folders and files

Latest commit

History

Repository files navigation

InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

🏠 Overview

📹 Demo

🔥 News

📖 Getting Started

Dependencies

Data Replay

Teacher Policy Training

Student Policy Training

Teacher Policy Inference

Student Policy Inference

📝 TODO List

🔗 Citation

👏 Acknowledgements and 📚 License

🌟 Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages