Home

Awesome

Text-to-sound Synthesis

This is the open source code for our paper "Text-to-sound Synthesis". We will release all of the code, pre-trained models after the paper is accepted.

Overview

Pretrained Model

We release four text-to-sound pretrained model. Including VQVAE trained on Audioset, Vocoder trained on Audioset, generation model trained on Audiocaps and Audioset.

Inference

Training

Cite