Home

Awesome

🌀 ControlNeXt

📝 Project Page | 📚 Paper | 🗂️ Demo (SDXL)

ControlNeXt is our official implementation for controllable generation, supporting both images and videos while incorporating diverse forms of control information. In this project, we propose a new method that reduces trainable parameters by up to 90% compared with ControlNet, achieving faster convergence and outstanding efficiency. This method can be directly combined with other LoRA techniques to alter style and ensure more stable generation. Please refer to the examples for more details.

We provide an online demo of ControlNeXt-SDXL. Due to the high resource requirements of SVD, we are unable to offer it online.

This project is still undergoing iterative development. The code and model may be updated at any time. More information will be provided later.

Experiences

We share more training experiences there and in the Issue. We spent a lot of time to find these. Now share with all of you. May these will help you!

Model Zoo

🎥 Examples

For more examples, please refer to our Project page.

ControlNeXt-SDXL

<p align="center"> <img src="ControlNeXt-SDXL/examples/demo/demo1.jpg" width="80%" alt="demo1"> <img src="ControlNeXt-SDXL/examples/demo/demo2.jpg" width="80%" alt="demo2"> <img src="ControlNeXt-SDXL/examples/demo/demo3.jpg" width="80%" alt="demo3"> <img src="ControlNeXt-SDXL/examples/demo/demo5.jpg" width="80%" alt="demo5"> </p>

ControlNeXt-SVD-v2

If you can't load the videos, you can also directly download them from here and here. Or you can view them from our Project Page or BiliBili.

<table style="margin: 0 auto; border-collapse: collapse;"> <tr> <td width="40%" style="border: none;"> <video width="100%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/9d45a00a-d3cd-48e1-aa78-1d3158bfd4f4" muted="false"></video> </td> <td width="40%" style="border: none;"> <video width="100%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/1004960a-82de-4f0d-a329-ba676b8cbd0d" muted="false"></video> </td> </tr> <tr> <td width="40%" style="border: none;"> <video width="100%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/7db1acd1-0c61-4855-91bb-e4e8f8989393" muted="false"></video> </td> <td width="40%" style="border: none;"> <video width="100%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/0f32df53-1827-404d-806a-23e65d357504" muted="false"></video> </td> </tr> </table>

<video width="80%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/c69b4f34-0851-4637-a9ef-fb91beed5666" muted="false"></video>

<video width="80%" height="auto" style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/user-attachments/assets/32a4d24b-bc39-4ea9-9fd4-ed78b4eec116" muted="false"></video>

ControlNeXt-SVD

If you can't load the videos, you can also directly download them from here.

<video width="80%" height='auto' style="display: block; margin: 20px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/c0ec7591-2100-42d9-99dd-cc55c5fa006f" muted="false"></video>

<video width="80%" height='auto' style="display: block; margin: 20px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/5da1ba44-cb92-49c2-95f4-05b8e02ed6eb" muted="false"></video>

<!-- <video width="40%" height='auto' style="display: block; margin: 20px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/bc819ca7-81f3-4d63-901a-a1a4b4afc395" muted="false"></video> <video width="40%" height='auto' style="display: block; margin: 20px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/adc01423-fcec-467e-a7ab-87a6e1ef5f62" muted="false"></video> --> <table> <tr> <td width=50% style="border: none"> <video width="80%" height='auto' style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/bc819ca7-81f3-4d63-901a-a1a4b4afc395" muted="false"></video> </td> <td width=50% style="border: none"> <video width="80%" height='auto' style="display: block; margin: 0px auto;" controls autoplay loop src="https://github.com/dvlab-research/ControlNeXt/assets/22709028/adc01423-fcec-467e-a7ab-87a6e1ef5f62" muted="false"></video> </td> </tr> </table>

ControlNeXt-SD1.5

<p align="center"> <img src="ControlNeXt-SD1.5/examples/deepfashion_multiview/eval_img/DreamShaper.jpg" width="90%" alt="DreamShaper"> </p> <p align="center"> <img src="ControlNeXt-SD1.5/examples/deepfashion_multiview/eval_img/Anythingv3_fischl.jpg" width="90%" alt="Anythingv3"> </p> <p align="center"> <img src="ControlNeXt-SD1.5/examples/deepfashion_caption/eval_img/chinese_style.jpg" width="90%" alt="Anythingv3"> </p>

If you find this work useful, please consider citing:

@article{peng2024controlnext,
  title={ControlNeXt: Powerful and Efficient Control for Image and Video Generation},
  author={Peng, Bohao and Wang, Jian and Zhang, Yuechen and Li, Wenbo and Yang, Ming-Chang and Jia, Jiaya},
  journal={arXiv preprint arXiv:2408.06070},
  year={2024}
}