Home

Awesome

<!-- markdownlint-disable MD033 MD041 --> <div align="center">

Mug Diffusion

<!-- prettier-ignore-start --> <!-- markdownlint-disable-next-line MD036 -->

🎶 A charting AI for rhythm games. 🤖

<!-- prettier-ignore-end --> </div> <p align="center"> <a href="https://raw.githubusercontent.com/Keytoyze/Mug-Diffusion/master/LICENSE"> <img src="https://img.shields.io/github/license/Keytoyze/Mug-Diffusion" alt="license"> </a> <a href="https://github.com/Keytoyze/Mug-Diffusion"> <img src="https://img.shields.io/github/stars/Keytoyze/Mug-Diffusion.svg" alt="stars"> </a> <a href="https://github.com/Keytoyze/Mug-Diffusion/issues"> <img src="https://img.shields.io/github/issues/Keytoyze/Mug-Diffusion" alt="issues"> </a> <a href="https://github.com/Keytoyze/Mug-Diffusion/forks"> <img src="https://img.shields.io/github/forks/Keytoyze/Mug-Diffusion" alt="forks"> </a> <a href="http://qm.qq.com/cgi-bin/qm/qr?_wv=1027&k=L1ZmRMFhYXk13TkNzepCq5dFXXAl_yR5&authKey=DihDdCE8PIVSPDyCz6fpYpyVvn%2BOQoVJbMaXyUafMfA4%2FPtULGXg4decNAEGSf8i&noverify=0&group_code=548470510"> <img src="https://img.shields.io/badge/QQ%E7%BE%A4-548470510-orange" alt="forks"> </a> </p> <p align="center"> <a href="https://github.com/Keytoyze/Mug-Diffusion">English</a> | <a href="https://github.com/Keytoyze/Mug-Diffusion/blob/master/README_CN.md">中文</a> </p>

MuG Diffusion is a charting AI for rhythm games based on Stable Diffusion (one of the most powerful AIGC models) with a large modification to incorporate audio waves. Given an audio file, MuG Diffusion is able to generate high-quality diverse charts, which is aligned with the music and highly controllable. Currently, it supports 4K vertical scroll rhythm game (VSRG) only, with the following control options:

MuG Diffusion aims to support other rhythm games in the future (osu!standard, 5-8K VSRG, maimai, etc), and hopes to provide a beneficial AIGC tool for all the charters and players.

Installation and Running

Bundled Executable

I packaged a bundled executable containing all the dependencies and model weights in the Windows platform, which is available at:

Unzip the file and double click "Mug Diffusion.exe", which will open a browser interface for controlling. It takes around 30 seconds on my computer (NVidia 3050Ti, 4GB memory) to generate four charts for a 3-minute-long audio.

Running from Source

If you use other platforms, other GPU types or want to run from source, here are the instructions.

pip install -r requirements.txt
python webui.py

Model Structure and Methodology

Acknowledgement

In order to ensure the fairness and transparency of training, the dataset list is published in here.

Thank all the Charters / Mappers in the community. It's you who endowed MuG Diffusion with intelligence. Besides, I would like to thank the Malody development teams (and many other supporters that cannot be listed due to space limit TAT) for the financial support.

Thank raber for webui development, RiceSS for logo design, and many testers for their support.

Special thanks:

Credits

Charts created through MuG Diffusion are fully open source, explicitly falling under the CC0 1.0 Universal Public Domain Dedication. The model weights and charts created are non-commercial.

Besides, all charts created by AI are tagged with AIMode: MuG Diffusion vx.x.x in the [Meta] section. Please keep its integrity or mark it explicitly unless you modify the most of the notes, otherwise you will be at risk of abusing AI.