Home

Awesome

(Feel free to suggest changes)

Papers

Expansive Summaries

<details> <summary> End-to-End Adversarial Text-to-Speech: http://arxiv.org/abs/2006.03575 (Click to Expand)</summary> </details> <details> <summary> Fast Speech2: http://arxiv.org/abs/2006.04558 (Click to Expand)</summary> </details> <details> <summary> Glow-TTS: https://arxiv.org/pdf/2005.11129.pdf (Click to Expand)</summary> </details> <details> <summary> Non-Autoregressive Neural Text-to-Speech: http://arxiv.org/abs/1905.08459 (Click to Expand)</summary> </details> <details> <summary> Double Decoder Consistency: https://erogol.com/solving-attention-problems-of-tts-models-with-double-decoder-consistency (Click to Expand)</summary> </details> <details> <summary> Parallel Tacotron2: http://arxiv.org/abs/2103.14574 (Click to Expand)</summary> </details> <details> <summary> WaveGrad2: https://arxiv.org/pdf/2106.09660.pdf (Click to Expand)</summary> <img src="https://user-images.githubusercontent.com/1402048/122779556-447b3600-d2ae-11eb-8544-187ea5668966.png" height="450"/> </details>

Multi-Speaker Papers

Expansive Summaries

<details> <summary> Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation: http://arxiv.org/abs/2005.08024 </summary>

Demo page: https://ttaoretw.github.io/multispkr-semi-tts/demo.html <br> Code: https://github.com/ttaoREtw/semi-tts image

</details> <details> <summary> Attentron: Few-shot Text-to-Speech Exploiting Attention-based Variable Length Embedding: https://arxiv.org/abs/2005.08484 </summary>

Demo page: https://hyperconnect.github.io/Attentron/ <br> image image

</details> <details> <summary> Towards Universal Text-to-Speech: http://www.interspeech2020.org/uploadfile/pdf/Wed-3-4-3.pdf </summary>

image image

</details> <details> <summary> AdaSpeech: Adaptive Text to Speech for Custom Voice: https://openreview.net/pdf?id=Drynvt7gg4L </summary>

Demo page: https://speechresearch.github.io/adaspeech/ <br> image image image image image image

</details>

Attention


Vocoders

<details> <summary>WaveGrad: https://arxiv.org/pdf/2009.00713.pdf </summary>

Code: https://github.com/ivanvovk/WaveGrad image

</details>

From the Internet (Blogs, Videos etc)

Videos

Paper Discussion

Talks

General

Jupyter notebooks

Blogs