Janelle Shane Profile picture
Jul 2, 2021 15 tweets 7 min read Read on X
What fascinates me about generating images with VQGAN+CLIP is that it CAN generate depth and drama, but only if you know how to ask for them.

"A herd of sheep grazing on a lush green hillside" alone

vs with "amazing awesome and epic" added
ai-weirdness.ghost.io/the-art-of-ask… A flatly illuminated green clifftop with low shrubs and palmDeeply eroded green cliffs rise above some grass-topped mesa
Because CLIP is trained on internet images and text, it associates the "good" images with certain phrases.

"A herd of sheep grazing on a lush green hillside" before vs after adding "in the style of disney trending on artstation | unreal engine"
A flatly illuminated green clifftop with low shrubs and palmSteep cliffs encircle some grassy mesas, with clumps of gras
I experimented with different ways of asking CLIP+VQGAN for an attractive version of "a herd of sheep grazing on a lush green hillside"

"Award winning national geographic photography" produced impressive scenery but the sheep look like people crawling under green blankets. Near photorealistic red cliffs and jungle textures in the ba
Adding "by Bob Ross" to "a herd of sheep grazing on a lush green hillside" did get CLIP+VQGAN to improve the composition dramatically, but gave all the sheep Bob Ross hair. Steep domed mountains rise from among lakes and happy little
Adding "by Tim Burton" to "a herd of sheep grazing on a lush green hillside" got CLIP+VQGAN to generate this very cool looking image. Not sure what happened to the sheep though. Skeletal trees, dark picket fences, and swirling, brooding m
I hate that one of the most effective ways to prompt CLIP+VQGAN to generate a realistic and attractive landscape is to ask for this:

"A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper" Clumps of mist scatter a mountainous and dramatically lit la
Using the spammy "A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper" prompt as a starting point leads CLIP+VQGAN to some irritatingly gorgeous places.

Here, I added "cubist cezanne". Meadows rise in terraces into the misty distance, sharply ou
I had VQGAN+CLIP generate "A herd of sheep grazing on a lush green hillside | dramatic atmospheric ultra high definition free desktop wallpaper by lisa frank" and got this absolutely apocalyptic landscape.

I think those slippery purple things may be what's become of the sheep. A mostly purple sky is shot with lightning and rainbows, whi
This experiment illustrates an interesting aspect of generating stuff with big internet-trained models: it's seen a lot of crummy examples of what you're looking for, and those are just as valid to it as the good ones.

It CAN generate the good stuff. But how do you ask for it?
For more technical details on CLIP+VQGAN and other methods of steering CLIP, plus some gorgeous example images, I recommend this post by @sea_snell
You can generate CLIP+VQGAN images yourself for free! I used a version by @RiversHaveWings inspired by @advadnoun's Big Sleep notebook.

Tutorial linked here:
Here's an online @runwayml demo of a much earlier AI called AttnGAN. It tries.
No, you're absolutely right, the sheep are uniformly cursed.

Part of why I chose a herd of grazing sheep is image recognition algorithms have historically had trouble with distinguishing the sheep from the landscape: aiweirdness.com/post/171451900…

"a crummy image of a herd of sheep grazing on a lush green hillside"
Photo is of sheeplike blobs in front of lumpy hills. Everyth
"a herd of sheep grazing on a lush green hillside realistic please and not some horrible AI atrocity"

much better than i expected to be honest
The image has a feel of a low resolution photo filter with a

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Janelle Shane

Janelle Shane Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @JanelleCShane

Jul 21, 2022
Discovered I can use #dalle2 to make new horsies.

This is "Product photo of a breyer horse model of a guinea pig" A chunky horse with guinea pig markings, guinea pig ears, an
"Product photo of a breyer horse model of an opossum" A horse toy with opossum face and ears, and opossum colorati
"Product photo of a breyer horse model of a scorpion" skinny, spiky black horse with armor-like texturing. The tai
Read 11 tweets
Jun 12, 2022
Stunning transcript proving that GPT-3 may be secretly a squirrel.
GPT-3 wrote the text in green, completly unedited! This is the transcript of an interview with an advanced AI n
Forget the Turing test - the Squirreling test is a much harder nut to crack.
Read 5 tweets
Jun 10, 2022
Tried to get #dalle to put a Waffle House in Mordor but the only effect is that the restaurant gets taller. A waffle-shaped building on...Dark high-rise building who...Sign on a pole with artful ...A waffle-textured tower ris...
For some reason #dalle was more willing to put a McDonalds in Mordor.
noticing that mordor looks a lot like glencoe in a couple of those Dark cloudy skies and a rin...Frosty pavement, dry brown ...A glowing mountain surround...Another snowy stormy lookin...
I was able to get #dalle to put a waffle house in the mines of moria, although the main way you can tell is that the decor changes. 9 images of wood-timbered d...
Read 10 tweets
Jun 6, 2022
Decided to see what #dalle would respond with when asked for
"3D rendered floor plan of an apartment that rents for $1000 a month in new york city"
I like the high ceilings; some questions about the light fixtures and also the location of the kitchen. 3D image of a small open-plan apartment with wooden floors a
Another #dalle output for "3D rendered floor plan of an apartment that rents for $1000 a month in new york city"
Quite a lot of space; some concern about the inaccessible rooms and the hallway bidet. 3D render of an apartment with two large rooms and several s
Another #dalle output for "3D rendered floor plan of an apartment that rents for $1000 a month in new york city"
Not sure if $1000 gives you the entire room or just one of the cots. Is that a pleather floor? Black walled room with eight tiny windows and six white cots
Read 10 tweets
Jun 5, 2022
I know #dalle is trained on internet but how much is in there about the early internet?

This is its attempt at "Screenshot of the 1998 GeoCities Hampsterdance website" Vivid neon pink and green s...
Here's #dalle generating "screenshot of the 2003 badger badger badger flash animation"

What I find interesting is nailing the early Flash animation aesthetic while having no clue what the badger thing was. Vivid blue, cyan, and green...
Absolutely lost it at this one where #dalle generates "screenshot of a flash animation of Strong Bad checking his email"

It's as if it tried to recreate this after listening to someone describe Strong Bad, in a universe where @StrongBadActual never existed. A red cartoon character wit...
Read 8 tweets
Jun 2, 2022
Never knew CERN was short for CERNON

("The logo for CERN", generated by #dalle) Logos with spiral suns and atoms. Text reads variations on &
Here's #dalle generating "The Fermilab logo". It has heard of Fermilab but has apparently never seen the logo. A huge variety of geometrical atom, globe, ray, pinwheel, wi
Here's #dalle generating "Logo for JWST".
And proposing alternate mirror designs? Black and gold logos, with the gold used for hexagonal mirro
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(