Rattibha

TheSequence

4 Tweets Jan 01, 2023

3 research papers to understand text-to-image synthesis models better.
1. The original latent diffusion paper @LMU_Muenchen
2. @GoogleAI’s Parti
3. The original VQGAN+CLIP paper
A thread 🧵👇

1. The original latent diffusion paper
Latent diffusion is one of the most popular techniques for text-to-image synthesis models.
Paper: arxiv.org
Code: github.com

github.com

arxiv.org

2. Google AI’s Parti
Google’s attempt to provide an alternative to text-to-image diffusion methods by using autoregressive models.
Paper: arxiv.org
Code: github.com