What fascinates me about generating images with VQGAN+CLIP is that it CAN generate depth and drama, but only if you know how to ask for them.

"A herd of sheep grazing on a lush green hillside" alone

vs with "amazing awesome and epic" added
ai-weirdness.ghost.io/the-art-of-ask… A flatly illuminated green clifftop with low shrubs and palmDeeply eroded green cliffs rise above some grass-topped mesa
Because CLIP is trained on internet images and text, it associates the "good" images with certain phrases.

"A herd of sheep grazing on a lush green hillside" before vs after adding "in the style of disney trending on artstation | unreal engine"
A flatly illuminated green clifftop with low shrubs and palmSteep cliffs encircle some grassy mesas, with clumps of gras
I experimented with different ways of asking CLIP+VQGAN for an attractive version of "a herd of sheep grazing on a lush green hillside"

"Award winning national geographic photography" produced impressive scenery but the sheep look like people crawling under green blankets. Near photorealistic red cliffs and jungle textures in the ba
Adding "by Bob Ross" to "a herd of sheep grazing on a lush green hillside" did get CLIP+VQGAN to improve the composition dramatically, but gave all the sheep Bob Ross hair. Steep domed mountains rise from among lakes and happy little
Adding "by Tim Burton" to "a herd of sheep grazing on a lush green hillside" got CLIP+VQGAN to generate this very cool looking image. Not sure what happened to the sheep though. Skeletal trees, dark picket fences, and swirling, brooding m
I hate that one of the most effective ways to prompt CLIP+VQGAN to generate a realistic and attractive landscape is to ask for this:

"A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper" Clumps of mist scatter a mountainous and dramatically lit la
Using the spammy "A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper" prompt as a starting point leads CLIP+VQGAN to some irritatingly gorgeous places.

Here, I added "cubist cezanne". Meadows rise in terraces into the misty distance, sharply ou
I had VQGAN+CLIP generate "A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper by lisa frank" and got this absolutely apocalyptic landscape.

I think those slippery purple things may be what's become of the sheep. A mostly purple sky is shot with lightning and rainbows, whi
This experiment illustrates an interesting aspect of generating stuff with big internet-trained models: it's seen a lot of crummy examples of what you're looking for, and those are just as valid to it as the good ones.

It CAN generate the good stuff. But how do you ask for it?
For more technical details on CLIP+VQGAN and other methods of steering CLIP, plus some gorgeous example images, I recommend this post by @sea_snell
You can generate CLIP+VQGAN images yourself for free! I used a version by @RiversHaveWings inspired by @advadnoun's Big Sleep notebook.

Tutorial linked here:
Here's an online @runwayml demo of a much earlier AI called AttnGAN. It tries.
No, you're absolutely right, the sheep are uniformly cursed.

Part of why I chose a herd of grazing sheep is image recognition algorithms have historically had trouble with distinguishing the sheep from the landscape: aiweirdness.com/post/171451900…

"a crummy image of a herd of sheep grazing on a lush green hillside"
Photo is of sheeplike blobs in front of lumpy hills. Everyth
"a herd of sheep grazing on a lush green hillside realistic please and not some horrible AI atrocity"

much better than i expected to be honest
The image has a feel of a low resolution photo filter with a

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Janelle Shane

Janelle Shane Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @JanelleCShane

5 Jan
So I was all set to dig into the DALL-E article and really think about how they got it to generate coherent pictures from descriptions

And then I saw they included giraffe

Currently obsessively flipping through all their giraffe remixes Text prompt is "A prof...
"giraffe octopus" is giving it more trouble but some of these are still pretty good. most of the generated image...
Giraffe-invertebrate hybrids are giving it more trouble.

Writing the word "giraffe" real small next to a drawing of a mantis does not make it a giraffe-mantis hybrid but I like the thought. text prompt: "A profes...
Read 13 tweets
21 Dec 20
Got GPT-3 to generate new Christmas carols.

Here is "Mild is Rudolph". Are you not cheered and comforted?
aiweirdness.com/post/638130829… Mild is Rudolph  Mild is Rudolph’s image in the snow He ha
Impressed at the humanlike structure of GPT-3's generated carols

yet disconcerted at what it thinks is a "joyful noise" O Come Rudolph, Come  O Come Rudolph, Come Ye Faithful Oh co
GPT-3 knows that what one does is praise Rudolph. It's interesting to see what powers it thinks Rudolph has. Oh look! There’s Santa and Parson Brown Defying the laws o
Read 8 tweets
11 Feb 20
people sometimes wonder why i am like this candy heart shaped cookies with messages: Bog Love, All Hover, Yak O Way, and Love 2000 Hogs Yea
nothing says love like candy hearts written by a confused neural net
aiweirdness.com/post/170685749… Team Bear Time Hug Fang Bog Love all hover sweat poo u hack stank love hole love 2000 hogs yea heart me time bear wink bear bear bear be my bear yak o way you are bare mage love oy in a fan cool cud you rear my hag or my bun book
This particular neural net didn't have any prior training - all it knows about english is what was in the existing candy heart messages I gave it.

It has sort of figured out which letter combos might be possible, but uses them weirdly. aiweirdness.com/post/170820844… am good love bun dear me heart me me love have nice heat team ooo time bat oh love cat bee love love bot my my tank love o saol bite me my my 13 you are by by by by <3
Read 8 tweets
7 Feb 20
WHAT HAS OCCURRED CANNOT BE UNDONE

I have trained a neural net on a crowdsourced set of vintage jello-centric recipes

I believe this to possibly be the worst recipe-generating algorithm in existence FAIR AND MOOSE   Ingredient...
The training data contained a lot of things.

It contained eel only once. For some reason the AI has decided to use eel a LOT.

It also invents ingredients
EELS IN SILENCE 1/2 lb. but...
Here is another AI eel recipe.

The title is misleading.

Unfortunately that's not a good thing. PINOES WITH VEGETABLE EELS ...
Read 14 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(