KREA AI Profile picture
Aug 24 23 tweets 6 min read
init images in #stablediffusion explained! ✨👇🏼 Image
the use of init images consists of a simple trick that allows us to start our generations from an image.
init images give us a lot of control over our generations.

we can choose where to place each element and what style, colors, or pose it should have. Image
while this technique might look similar to #dalle2 variations, this is a completely different thing.
DALL-E 2 can produce variations that respect the style and content of our images.

with init images, we aggregate generated content on top of our image, and we can even decide how much original content we want to preserve.
the technique behind init images is quite simple...

if we know a bit about how diffusion models work.
let’s get started 🤓
diffusion models are currently state-of-the-art for image generation.

they are at the core of AI models like DALL-E 2 (@OpenAI), Imagen (@GoogleAI), and Stable Diffusion. (@StabilityAI).
the process of generating images with diffusion consists of a sequence of denoising steps that recursively remove noise from a noisy image until it gets to something that is noise-free and looks realistic.
take a deep breath here 🧘
if you played with dream studio, you’ll recognize the “step” parameter.

this parameter controls the number of denoising steps that we want stable diffusion to perform.

the larger, the more noise it will remove (although after 50 steps most of the noise is already gone). Image
the following image depicts this process.

the denoising sequence goes from right (complete noise) to left (realistic cute cat), and how the output at each denoising step is the input of the one that proceeds. Image
the technique behind init images consists of:

1) adding noise to our real image
2) inserting it in the middle of the denoising process, and
3) making the diffusion model “think” that it needs to keep denoising from there. Image
if we add our init image too early in the diffusion sequence (to the right of the previous diagram), we will see a lot of change in the final result.

if we add it too late in the process (to the left of the diagram), the init image will be unchanged.
knowing where to place our init images is part of the prompt engineering process with this technique.
and folks are already getting GOOD at it!!
@blessedbck is using this technique with mastery to generate stunning vintage photographs 🤯 Image
init images are also great for transferring a specific style to our generations. Image
they also enable us to create videos; we just need to apply the technique to each frame.

here’s some ongoing experimentation using the movements of a jellyfish.

and here's a bonus tip: init images work great for memes!

this is “Snoop de Lucia” Image
hope this thread was useful, don’t hesitate to post questions if something is not clear ✨
Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with KREA AI

KREA AI Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @krea_ai

Aug 22
The Prompter #004 is out! ✨👇

theprompter.substack.com/p/the-prompter…
📰 AI news

@StabilityAI released the beta of Dream Studio.

@marco_ppasini created an amazing (and FAST) music generator.
🛠️ tools for prompting

@pharmapsychotic released a tool to preview animations with disco diffusion.

@Artbreeder is working on integrating their collage tool with Stable Diffusion.

@cut_pow shared a Colab Notebook with a method to apply inpainting to Disco Diffusion.

...
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(