🧵: I discovered this woman, who I call Loab, in April. The AI reproduced her more easily than most celebrities. Her presence is persistent, and she haunts every image she touches. CW: Take a seat. This is a true horror story, and veers sharply macabre.
I'll explain negative prompt weights, in case you don't know. With these, instead of creating an image of the text prompt, the AI tries to make the image look as different from the prompt as possible. This logo was the result of the negatively weighted prompt "Brando::-1".
I wondered: is the opposite of that logo, in turn, going to be a picture of Marlon Brando? I typed "DIGITA PNTICS skyline logo::-1" as a prompt. I received these off-putting images, all of the same devastated-looking older woman with defined triangles of rosacea(?) on her cheeks.
My friend made this image of a "[...] hyper compressed glass tunnel surrounded by angels [...] in the style of Wes Anderson". I innocently combined this image with the original image of Loab in an image prompt, without text. For reasons we can't fully explain, nightmares ensued.
Through some kind of emergent statistical accident, something about this woman is adjacent to extremely gory and macabre imagery in the distribution of the AI's world knowledge.
Since Loab was discovered using negative prompt weights, her gestalt is made from a collection of traits that are equally far away from something. But her combined traits are still a cohesive concept for the AI, and almost all descendent images contain a recognizable Loab.
*EXTREME* GORE WARNING. The angel hallway + Loab also produced art with such copious gore that probably very few people want to see them, but here are two. I don't feel comfortable posting the most disturbing ones, borderline snuff images of dismembered, screaming children.
There is something moving to me about these grotesque scenes and the desperation, panic, and sadness that they convey. Again, these are produced with other images as inputs, and no text. They are the result of "cross-breeding" images of Loab with images of other things.
The images that result from crossing Loab with other images can in turn be crossbred with other images. The AI can latch onto the idea of Loab so well that she can persist through generations of this type of crossbreeding, without using the original image. Here is Loab as Kirby:
Here is Loab as a bee, and Loab celebrating Pride month. Loab can be recognizably transposed into many genres and contexts.
Even when her red cheeks or other important features disappear, the "Loabness" of the images she has a hand in making is undeniable. She haunts the images, persists through generations, and overpowers other bits of the prompt because the AI so easily optimizes toward her face.
Combining Loab with text prompts works great, too. Her signature rosacea cheeks even turn blue when I prompt for a Na'vi version of her from Avatar 2: The Way of Water (2022).
I started going kind of insane at this point. I had hundreds of Loab images and was starting to combine her with 3 or 4 other images at once. Most of the horror images I post even outside of this thread are descendents of the Loab lineage. Sometimes it takes more abstract forms.
The concept of "Loabness" became more abstract to me. I would include her in prompts that I knew would almost distort her beyond recognition. After she disappeared from the image breeding lineage, she would sometimes reappear, later down the line, out of nowhere.
I was ripping Loab apart, and putting her back together. She is an emergent island in the latent space that we don't know how to locate with text queries. But for the AI, Loab was an equally strong point of convergence as a verbal concept. And really, it was usually stronger!
The big lesson for me here with Loab is that image prompting can essentially be used as your custom vector to query the latent space. You can produce novel styles (and characters!) that you literally discover. Negative prompt weighting can help you find emergent accidents, too.
The other lesson is that image prompts, and later raw vectors and CLIP embeddings, can be used as adversarial attacks, targeting weird stuff in the distribution. I think my process in itself constitutes art, but it also reveals the AI's weakness for malicious use in other cases.
By the way. Loab seems to be recognized extremely consistently with @Buntworthy's days-old implementation of image prompts for Stable Diffusion. She's everywhere, hiding. Good luck sleeping tonight!
I'm going to continually update this thread with Loab-influenced content since I have at least a thousand images with her fingerprints on them. Check back for your daily Loab sighting. She finds everyone sooner or later. You just have to know where to look.
Loab: the first cryptid of the latent space. Perfect.
It’s true. StackOverflow is basically obsolete. My coding workflow has changed completely. ChatGPT even implemented several novel body parts to improve the functionality of the Baby class (!!!)
Learning a lot about the long, untold history of the “no-no button” - even Darwin was fascinated by it.
I've tried to simplify this in interviews (& I suck at math): The deep learning process organizes all faces along a main face-axis in multidimensional space. If you follow this vector to its end you find Loab: the last face; an eigenface that one type of face-ness extends toward.
Probably, all the different face-nesses are represented on different axes and the Loab axis is just one of many. All faces are found in relationship to these axes. Collapse the idea to an x,y plane: imagine a simple system of 2 face-ness axes, where one is a Loabness axis.
The map of possible faces is finite though, so it has a geometry with edges and vertices and outward protrusions. A negative weighted prompt makes you likely to get caught inside one of these extremities with the spooky eigenfaces and the very few non-face things that are nearby.
A brief typology of AI feet.
These are created with the simple prompt "feet" in Midjourney.
Foot type 1: Non-terminal toes. Toes extend from the foot and fuse to each other, or loop back into the foot. A foot is a bundle of toes fused together into a thick main trunk.
Foot type 2: Prehensile heels. Heels are elongated, similar to high-heeled shoes or lotus foot binding. They act as the "thumb" of the foot. They grow vestigial toes of their own.
Foot type 3: The Foot-in-itself. These represent the transcendental ideal of a foot, liberated from its real-world context as a human appendage. They are generally oriented towards the heavens, like monumental pillars of glory.