Home

Awesome

<h1 align="center">TTS Generation WebUI / Harmonica</h1> <div align="center"> <h3 align="center">

Download Installer || Installation || Docker Setup || Feedback / Bug reports

</h3>

banner

GitHub stars GitHub Discord Open In Colab GitHub forks

</div> <div align="center">

Videos

</div>
Watch the videoWatch the videoWatch the video
<div align="center">

Models

Text-to-speechAudio/Music GenerationAudio Conversion/Tools
BarkMusicGenRVC
TortoiseMAGNeTDemucs
Maha TTSStable AudioVocos
MMS(Extension) RiffusionWhisper
Vall-E X(Extension) AudioCraft Mac
StyleTTS2(Extension) AudioCraft Plus
SeamlessM4T
(Extension) XTTSv2
(Extension) MARS5
(Extension) F5-TTS
(Extension) Parler TTS
</div> <div align="center">

Examples

</div>
<video src="https://github.com/user-attachments/assets/16ac948a-fe98-49ad-ad87-19c41fe7e65e" width="300"></video><video src="https://github.com/user-attachments/assets/55bde4f7-bbcc-4ecf-8f94-b315b9d22e74" width="300"></video><video src="https://github.com/user-attachments/assets/fcee8906-a101-400d-8499-4e72c7603042" width="300"></video>
<div align="center">

Screenshots

</div>
react_1react_2react_3
gradio_1gradio_2gradio_3

Changelog

Nov 23:

Nov 22:

Nov 15:

Nov 14:

Nov 11:

October 2024

Oct 28:

Oct 24:

Oct 22:

Oct 21:

Oct 19:

Oct 18:

Oct 16:

Oct 3:

September 2024

<details> <summary>Click to expand</summary>

Sep 23:

Sep 22:

Sep 21:

Sep 19:

Sep 2:

</details>

August 2024

<details> <summary>Click to expand</summary>

Aug 31:

Aug 20:

Aug 5:

Aug 4:

Aug 3:

Aug 2:

Aug 1:

</details>

July 2024

<details> <summary>Click to expand</summary>

July 31:

July 29:

July 26:

July 25:

July 21:

July 20:

July 15:

July 14:

July 13:

July 11:

July 9:

July 8:

July 6:

July 1:

</details>

June 2024

<details> <summary>Click to expand</summary> June 22: * Add Stable Audio to Gradio.

June 21:

May 17:

May 9:

May 4:

</details>

April 2024

<details> <summary>Click to expand</summary> Apr 28: * Add Maha TTS to React UI. * Add GPU Info to React UI.

Apr 6:

Apr 5:

Apr 4:

</details>

March 2024

<details> <summary>Click to expand</summary>

Mar 28:

Mar 27:

Mar 26:

Mar 22:

Mar 16:

Mar 14:

Mar 13:

Mar 11:

Mar 10:

Mar 5:

Mar 3:

</details>

February 2024

<details> <summary>Click to expand</summary>

Feb 21:

Feb 8:

Feb 6:

</details>

January 2024

<details> <summary>Click to expand</summary>

Jan 21:

Jan 16:

Jan 15:

Jan 14:

Jan 13:

Jan 12:

Jan 11:

Jan 9:

Jan 8:

</details>

2023

<details> <summary>Click to expand</summary>

October 2023

Oct 26:

Oct 24:

September 2023

Sep 21:

Sep 9:

Sep 5:

August 2023

Aug 27:

Aug 26:

Aug 24:

Aug 21:

Aug 20:

Aug 18:

Aug 16:

Aug 15:

Aug 13:

Aug 11:

Aug 8:

Aug 7:

Aug 6:

Aug 4:

Aug 3:

Aug 2:

July 2023

July 26:

July 24:

July 23:

July 21:

July 19:

July 16:

July 10:

July 9:

July 5:

July 2:

July 1:

June 2023

Jun 29:

Jun 27:

Jun 20

Jun 19

June 18:

Jun 14:

June 5:

June 4:

June 3:

May 2023

May 21:

May 17:

May 16:

May 13:

May 10:

May 4:

May 3:

May 2:

Before:

</details>

Upgrading (For old installations)

In case of issues, feel free to contact the developers.

<details> <summary>Click to expand</summary>

Upgrading from v6 to new installer

Recommended: Fresh install

In-place upgrade, can delete some files, tweaks

Is there any more optimal way to do this?

Not exactly, the dependencies clash, especially between conda and python (and dependencies are already in a critical state, moving them to conda is ways off). Therefore, while it might be possible to just replace the old installer with the new one and running the update, the problems are unpredictable and unfixable. Making an update to installer requires a lot of testing so it's not done lightly.

</details>

Installation

Manual installation (not recommended)

React UI

Docker Setup

tts-generation-webui can also be ran inside of a Docker container. To get started, pull the image from GitHub Container Registry:

docker pull ghcr.io/rsxdalv/tts-generation-webui:main

Once the image has been pulled it can be started with Docker Compose:

docker compose up -d

The container will take some time to generate the first output while models are downloaded in the background. The status of this download can be verified by checking the container logs:

docker logs tts-generation-webui

Building the image yourself

If you wish to build your own docker container, you can use the included Dockerfile:

docker build -t tts-generation-webui .

Please note that the docker-compose needs to be edited to use the image you just built.

Extra Voices for Bark, Prompt Samples

<div align="center">

PromptEcho

Bark Speaker Directory

</div>

Bark Readme

README_Bark.md

Info about managing models, caches and system space for AI projects

https://github.com/rsxdalv/tts-generation-webui/discussions/186#discussioncomment-7291274

Open Source Libraries

<details> <summary>This project utilizes the following open source libraries:</summary> </details>

Ethical and Responsible Use

This technology is intended for enablement and creativity, not for harm.

By engaging with this AI model, you acknowledge and agree to abide by these guidelines, employing the AI model in a responsible, ethical, and legal manner.

License

Codebase and Dependencies

The codebase is licensed under MIT. However, it's important to note that when installing the dependencies, you will also be subject to their respective licenses. Although most of these licenses are permissive, there may be some that are not. Therefore, it's essential to understand that the permissive license only applies to the codebase itself, not the entire project.

That being said, the goal is to maintain MIT compatibility throughout the project. If you come across a dependency that is not compatible with the MIT license, please feel free to open an issue and bring it to our attention.

Known non-permissive dependencies:

LibraryLicenseNotes
encodecCC BY-NC 4.0Newer versions are MIT, but need to be installed manually
diffqCC BY-NC 4.0Optional in the future, not necessary to run, can be uninstalled, should be updated with demucs
lameencGPL LicenseFuture versions will make it LGPL, but need to be installed manually
unidecodeGPL LicenseNot mission critical, can be replaced with another library, issue: https://github.com/neonbjb/tortoise-tts/issues/494

Model Weights

Model weights have different licenses, please pay attention to the license of the model you are using.

Most notably:

Compatibility / Errors

Audiocraft is currently only compatible with Linux and Windows. MacOS support still has not arrived, although it might be possible to install manually.

Torch being reinstalled

Due to the python package manager (pip) limitations, torch can get reinstalled several times. This is a wide ranging issue of pip and torch.

Red messages in console

These messages:

---- requires ----, but you have ---- which is incompatible.

Are completely normal. It's both a limitation of pip and because this Web UI combines a lot of different AI projects together. Since the projects are not always compatible with each other, they will complain about the other projects being installed. This is normal and expected. And in the end, despite the warnings/errors the projects will work together. It's not clear if this situation will ever be resolvable, but that is the hope.