How to get URL link on X (Twitter) App
The differences are more data, more training, and a less restrictive filtering of the dataset. The dataset for v2.0 was filtered aggressively LAION’s NSFW filter, making it a bit harder to get similar results generating people.
Text-to-Image
https://twitter.com/RiversHaveWings/status/1589724378492592128The upscaler is itself also a diffusion model. It was trained on a high-resolution subset of the LAION-2B dataset. Being a 2x upscaler, it can take the usual 512x512 images obtained from Stable Diffusion and upscale it to 1024x1024. (2/5)
What does "fine-tuned decoders" even mean? Well, basically, Stable Diffusion actually is a diffusion model that operates in a compressed space, which is then "decoded" into a full-resolution image. This decoder itself is also a trained neural network.