Awesome
xtts-finetune-webui
This webui is a slightly modified copy of the official webui for finetune xtts.
If you are looking for an option for normal XTTS use look here https://github.com/daswer123/xtts-webui
TODO
- Add the ability to use via console
Key features:
Data processing
- Updated faster-whisper to 0.10.0 with the ability to select a larger-v3 model.
- Changed output folder to output folder inside the main folder.
- If there is already a dataset in the output folder and you want to add new data, you can do so by simply adding new audio, what was there will not be processed again and the new data will be automatically added
- Turn on VAD filter
- After the dataset is created, a file is created that specifies the language of the dataset. This file is read before training so that the language always matches. It is convenient when you restart the interface
Fine-tuning XTTS Encoder
- Added the ability to select the base model for XTTS, as well as when you re-training does not need to download the model again.
- Added ability to select custom model as base model during training, which will allow finetune already finetune model.
- Added possibility to get optimized version of the model for 1 click ( step 2.5, put optimized version in output folder).
- You can choose whether to delete training folders after you have optimized the model
- When you optimize the model, the example reference audio is moved to the output folder
- Checking for correctness of the specified language and dataset language
Inference
- Added possibility to customize infer settings during model checking.
Other
- If you accidentally restart the interface during one of the steps, you can load data to additional buttons
- Removed the display of logs as it was causing problems when restarted
- The finished result is copied to the ready folder, these are fully finished files, you can move them anywhere and use them as a standard model
- Added support for finetune Japanese
Changes in webui
1 - Data processing
2 - Fine-tuning XTTS Encoder
3 - Inference
Google colab
🐳 Run in Docker
docker run -it --gpus all --pull always -p 7860:7860 --platform=linux/amd64 athomasson2/fine_tune_xtts:huggingface python app.py
Install
- Make sure you have
Cuda
installed git clone https://github.com/daswer123/xtts-finetune-webui
cd xtts-finetune-webui
pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
If you're using Windows
- First start
install.bat
- To start the server start
start.bat
- Go to the local address
127.0.0.1:5003
On Linux
- Run
bash install.sh
- To start the server start
start.sh
- Go to the local address
127.0.0.1:5003
On Apple Silicon Mac (python 3.10 env)
- Run
pip install --no-deps -r apple_silicon_requirements.txt
- To start the server
python xtts_demo.py
- Go to the local address
127.0.0.1:5003
~