Home

Awesome

<p align="center"> <h1 align="center"><a href="https://yanjieze.com/H-InDex/">H-InDex</a>: Visual Reinforcement Learning with <a href="https://yanjieze.com/H-InDex/">H</a>and<a href="https://yanjieze.com/H-InDex/">-In</a>formed Representations for <a href="https://yanjieze.com/H-InDex/">Dex</a>terous Manipulation</h1> <h2 align="center">NeurIPS 2023</h2> <p align="center"> <a><strong>Yanjie Ze</strong></a> · <a><strong>Yuyao Liu*</strong></a> · <a><strong>Ruizhe Shi*</strong></a> · <a><strong>Jiaxin Qin</strong></a> · <a><strong>Zhecheng Yuan</strong></a> · <a><strong>Jiashun Wang</strong></a> · <a><strong>Xiaolong Wang</strong></a> · <a><strong>Huazhe Xu</strong></a> </p> </p> <h3 align="center"> <a href="https://yanjieze.com/H-InDex/"><strong>Project Page</strong></a> | <a href="https://arxiv.org/abs/2310.01404"><strong>arXiv</strong></a> | <a href=""><strong>Twitter</strong></a> </h3> <div align="center"> <img src="teaser.png" alt="Logo" width="100%"> </div>

🧾 Introduction

H-InDex is a visual reinforcement learning framework that leverages hand-informed representations to learn dexterous manipulation skills with great efficiency. H-InDex consistes of three stages: pre-training, offline adaptation, and reinforcement learning. In this repo, all the stages are provided, together with the pre-trained checkpoint and the adapted checkpoints.

We also encourage the user to use our pre-trained representations directly for their own downstream tasks.

To benchmark our method, we also provide several strong baselines in this repo, including VC-1, MVP, R3M, and RRL.

Enjoy Dexterity!

💻 Installation

See INSTALL.md.

We also provide some error catching solutions in INSTALL.md.

Feel free to post an issue if you have any questions.

🛠️ Usage

We use wandb to log the training process. Remember to set your wandb account before training by wandb login. You could also disable wandb by use_wandb=0 in our script.

Given a task name task_name, you could run the following pipeline.

🦉 Tasks

We provide 12 dexterous manipulation Tasks in total:

🙏 Acknowledgement

Our work is based on many open-source projects. The algorithms are mainly built upon RRL and TTP. The simulation environments are from DAPG and DexMV. The pre-trained hand representation is from FrankMocap. Baselines are from RRL, MVP, R3M and VC-1. We thank all these authors for their nicely open sourced code and their great contributions to the community.

🏷️ License

H-InDex is licensed under the MIT license. See the LICENSE file for details.

📝 Citation

If you find our work useful, please consider citing:

@article{Ze2023HInDex,
  title={H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation},
  author={Yanjie Ze and Yuyao Liu and Ruizhe Shi and Jiaxin Qin and Zhecheng Yuan and Jiashun Wang and Xiaolong Wang and Huazhe Xu},
  journal={NeurIPS}, 
  year={2023},
}