Awesome
Text-to-sound Synthesis
This is the open source code for our paper "Text-to-sound Synthesis". We will release all of the code, pre-trained models after the paper is accepted.
Overview
Pretrained Model
We release four text-to-sound pretrained model. Including VQVAE trained on Audioset, Vocoder trained on Audioset, generation model trained on Audiocaps and Audioset.