Home

Awesome

Thorsten-Voice logo

Motivation for Thorsten-Voice project :speaking_head: :speech_balloon:

A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Personal words by Thorsten Müller

I contribute my voice as a person believing in a world where all people are equal. No matter of gender, sexual orientation, religion, skin color and geocoordinates of birth location. A global world where everybody is warmly welcome on any place on this planet and open and free knowledge and education is available to everyone. :earth_africa: (Thorsten Müller)

Please keep in mind, that i am no professional voice talent. I'm just a normal guy sharing his voice with the world.

Social media

YouTube Channel Subscribers <a href="https://twitter.com/intent/follow?screen_name=ThorstenVoice"><img src="https://img.shields.io/twitter/follow/ThorstenVoice?style=social&logo=twitter" alt="follow on Twitter"></a> Web

Feel free to contact me on social media 🤗.

PlatformLink
YoutubeThorstenVoice on Youtube
LinkedInThorsten Müller on LinkedIn
TwitterThorstenVoice on Twitter
HuggingfaceThorstenVoice on Huggingface
InstagramThorstenVoice on Instagram

Voice-Datasets

All my "Thorsten-Voice" datasets are listed and downloadable on Zenodo. Qoutation is highly appreciated in case you use them in your projects, products or papers.

DatasetDOI Link
Thorsten-Voice Dataset 2021.02 (Neutral)DOI
Thorsten-Voice Dataset 2021.06 (Emotional)DOI
Thorsten-Voice Dataset 2022.10 (Neutral)DOI
Thorsten-Voice Dataset 2023.09 (Hessisch)DOI

Thorsten-Voice Dataset 2021.02 (Neutral)

DOI

@dataset{muller_2021_5525342,
  author       = {Müller, Thorsten and
                  Kreutz, Dominik},
  title        = {Thorsten-Voice Dataset 2021.02},
  month        = sep,
  year         = 2021,
  note         = {{Please use it to make the world a better place for 
                   whole humankind.}},
  publisher    = {Zenodo},
  version      = {3.0},
  doi          = {10.5281/zenodo.5525342},
  url          = {https://doi.org/10.5281/zenodo.5525342}
}

Dataset summary

Dataset evolution

As described in the PDF document (evolution of thorsten dataset) this dataset consists of three recording phases.

If you want to use a dataset subset you can see which files belong to which recording phase in recording quality csv file.

Thorsten-Voice Dataset 2021.06 (Emotional)

DOI

@dataset{muller_2021_5525023,
  author       = {Müller, Thorsten and
                  Kreutz, Dominik},
  title        = {Thorsten-Voice Dataset 2021.06 emotional},
  month        = sep,
  year         = 2021,
  note         = {{Please use it to make the world a better place for 
                   whole humankind.}},
  publisher    = {Zenodo},
  version      = {2.0},
  doi          = {10.5281/zenodo.5525023},
  url          = {https://doi.org/10.5281/zenodo.5525023}
}

All emotional recordings where recorded by myself and i tried to feel and pronounce that emotion even if the phrase context does not match that emotion. Example: I pronounced the sleepy recordings in the tone i have shortly before falling asleep.

Dataset summary

Thorsten-Voice Dataset 2022.10 (Neutral)

DOI

:speaking_head: Listen to some audio recordings from this dataset here.

@dataset{muller_2022_7265581,
  author       = {Müller, Thorsten and
                  Kreutz, Dominik},
  title        = {Thorsten-Voice Dataset 2022.10},
  month        = nov,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {1.0},
  doi          = {10.5281/zenodo.7265581},
  url          = {https://doi.org/10.5281/zenodo.7265581}
}

Thorsten-Voice Dataset 2023.09 (Hessisch)

DOI

@dataset{muller_2024_10511260,
  author       = {Müller, Thorsten and
                  Kreutz, Dominik},
  title        = {Thorsten-Voice Dataset 2023.09 Hessisch},
  month        = jan,
  year         = 2024,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.10511260},
  url          = {https://doi.org/10.5281/zenodo.10511260}
}

Thorsten-Voice Dataset FULL 44kHz

Celebrating 🎉 5 years of Thorsten-Voice project (est. october 2019) i released ALL recordings in FULL samplerate (44kHz) in an ALL-IN-ONE dataset on 🤗HuggingFace! Obviously again in CC0 license!

@misc {thorsten_müller_2024,
    author       = { {Thorsten Müller} },
    title        = { TV-44kHz-Full (Revision ff427ec) },
    year         = 2024,
    url          = { https://huggingface.co/datasets/Thorsten-Voice/TV-44kHz-Full },
    doi          = { 10.57967/hf/3290 },
    publisher    = { Hugging Face }
}

TTS Models

Based on these opensource voice datasets several TTS (text to speech) models have been trained using AI / machine learning technology.

There are multiple german models available trained and used by by the projects Coqui AI, Piper TTS and Home Assistant. You can find more information on how to use them, audio samples and video tutorials on the Thorsten-Voice project website.

Listen to audio samples and installation / usage instructions here (🇩🇪):

In addition Silero, Monatis and ZDisket used my voice datasets for model training too. More samples and details can be found on Silero Thorsten-Voice audio samples. See this colab notebook for more details.

ZDisket made a tool called TensorVox for setting up an TTS environment on Windows and included a german TTS model trained by monatis. Thanks for sharing that. See it in action on Youtube.

Support & Thanks

If you like my voice contribution and would like to support my effort for an opensource voice technology future, you can support me, if you like:

I want to say thank you to great people who supported me on this journey with nice words, support and compute power: Thanks El-Tocino, Eren Gölge, Gras64, Kris Gesling, Nmstoker, Othiele, Repodiac, SanjaESC, Synesthesiam.

Special thanks to my dear colleague, Sebastian Kraus, for supporting me with audio recording equipment and for being the creative mastermind behind the logo design and of course to the dear Dominik (@domcross) for him being so close by my side on this amazing journey.

"Thorsten-Voice" youtube channel

On my Thorsten-Voice youtube channel you can find step by step (cooking recipes) tutorial on opensource voice technology. If you're interested i'd be happy to welcome you as new subscriber on my wonderful youtube community.TS** on my little .

Conference speaker

I really like to talk about the importance of an opensource voice technology future. If you would like me to be a speaker on a conference or event i'd happy to be contacted using the Thorsten-Voice website contact form. See some of my speaker references on Thorsten-Voice website.