Reid Southen Profile picture
Sep 16, 2024 18 tweets 8 min read Read on X
I'm tired of having to prove to people that generative AI systems infringe by default, so here's a megathread of images I've prompted you can use in your discussions.

Article by @GaryMarcus and myself at the end. 1/🧵 Image
We've show that very simple descriptions can result in images that are strikingly similar to the training data. 2/🧵 Image
This happens with countless properties, including multiple different Batman incarnations. 3/🧵 Image
The differences you often see from the training data are related to how Midjourney tries to stylize the lighting to be more filmic or photographic, which is why so many people like its output. 4/🧵 Image
This this is shockingly easy to do, getting output VERY close to the training data with very few words. 5/🧵 Image
Sometimes you stumble on a prompt that will produce nearly the same image EVERY time. 6/🧵 Image
This goes for games as well, these prompts produced almost the same Last of Us image every single generation, ad infinitum. 7/🧵 Image
The main thing people push back on is that, "You asked for those properties though!"

This is true, in order to show how the data can be extracted. But you don't need to name a property. This single 2x2 grid was generated from just "movie screencap," and has multiple IPs. 8/🧵 Image
And if you're patient, and do some digging, you'll realize it will give you frames that are VERY similar to existing images , with just 1-3 non-specific words. 9/🧵 Image
And even when they don't match closely to existing frames, just 1 word can give you multiple properties. These are all very obvious, well known properties, but what happens when it steals from one you don't recognize? 10/🧵 Image
Midjourney and others shouldn't be able to produce such obvious infringing content without asking for it. It shouldn't be able to do it WITH asking for it either, but this shows how egregious their company and model are. 11/🧵 Image
Continuing with more images and thoughts, this output in particular is proof they train on movie trailers, because this scene wasn't in the film. 12/🧵 Image
But it's also clear they're either training on full films and/or or screencap websites as well, because some shots I was only able to find by combing through the movies themselves, which was the case with some of the Top Gun stuff, and even Avengers. 13/🧵 Image
Here's a small portion of a sprawling PureRef working document I have where I was trying to match frames to films that Midjourney spat out. It's time consuming, but I've found a lot more than we've shared. 14/🧵 Image
Here are additional frames that were prompted in Midjourney simply with "popular movie screencap". Why is it giving so many perfect IP images with no indication that's what I wanted? That's not how these systems are supposed to work. 15/🧵


Image
Image
Image
Image
Here are more when simply prompting Midjourney with "popular movie screencap." 16/🧵


Image
Image
Image
Image
And are some from just the Midjourney prompt "movie screencap." 17/🧵


Image
Image
Image
Image
As Cap said, I can do this all day, but I'll leave it here for now. "Popular movie screencap."

Please read mine and @GaryMarcus' article.



18/🧵 spectrum.ieee.org/midjourney-cop…
Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Reid Southen

Reid Southen Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Rahll

May 7, 2024
OpenAI's new opt-out announcement is a load of BS, just like their last one. Let me break it down and explain why it's nonsense, and how they're lying to you. 🧵

openai.com/index/approach…
Image
This is the first red flag. If OpenAI truly thinks training fair use, why offer opt-out at all? 🧵 Image
They are NOT pro writers, artists, or journalists, and claim to not be in those lines of business, yet here they are developing AI to disrupt those businesses.

And we know for a fact they don't listen to members of these communities.

Just straight lies.🧵 Image
Read 12 tweets
Apr 22, 2024
• A quarter of illustrators (26%) and over a third of translators (36%) have already lost work due to generative AI.

• Over a third of illustrators (37%) and over 4 in 10 translators (43%) say the income from their work has decreased in value because of generative AI. Image
Image
Image
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(