anton 🏴‍☠️ Profile picture
Feb 5 22 tweets 4 min read
announcing stable attribution - a tool which lets anyone find the human creators behind a.i generated images.

stableattribution.com
given any image generated by stable diffusion 1.x or 2.x, stable attribution finds the images in the model's training set which most contributed to the generated image
we (chroma) have been in crash development on this since mid-december. once we saw the technical approach was feasible, all our effort went into making this easy to use, scalable, and highly available
the discourse around generative image models, especially the use of creative's work in their training sets, has grown ever more difficult.

i was often in error myself, but things built up so i kept digging
many of my assumptions were wrong
- the training data was not public domain, and often not licensed at all
- the models do replicate images in their training sets, and can be made to do so reliably

i started paying attention to what creatives were actually saying.
historically, creatives have been among the first to embrace new technologies. ever since the renaissance, artists have picked up every new tool as it's become available, and used it to make great things.

these people aren't 'luddites'
creatives want credit and compensation for their work. seeing generative image models in action shocked them (as it did everyone), but still more shocking was the fact that their work had contributed to these models without their knowledge or consent
this is completely normal, and should be very familiar to technologists. the first thing after the title of every scientific paper you read is the list of the contributing authors, details of their affiliations, in rough order of contribution.
the responses from technologists were, frankly, embarrassingly midwit - either attacking artists as 'luddites', claiming that the machine 'learns' like a human so artists have no claim to attribution on the derivative works (wrong), and a bunch of other pretty dumb things
here's a particularly embarrassing example, which manages to be simultaneously wrong about
- how image compression works
- how latent diffusion models work
- whether they can reproduce images in their training set
- what a derivative work is, legally
understanding what models are doing based on their training data is core chroma technology.

so instead of wading into the discourse, we decided to try to solve the actual problem - source attribution
i'll post some technical details in another thread, but the short version is we found an approach that would work for stable diffusion - the training set (LAION) is publicly available and the weights are open source, which allowed us to build & test effectively
what we have right now is the first cut of the algorithm - we wanted to get this out fast, while it still mattered.

i dreaded waking up one day in a horror hellscape of dmca 2.0, hence crash development rather than an extended research program.

it can be made to work better.
it *only works correctly* if you upload images from stable diffusion 1.x / 2.x

we cannot stop you uploading any image you want into it, because almost all available hosted versions of stable diffusion have the watermark in the oss repo removed

here's what we hope to achieve:

- show that attribution is possible. this lets creators get credit for their work, and opens the door to them being compensated for every generated image their work contributed to / influenced.
besides being the right thing to do, crediting and/or compensating creators for their contributions to generative models creates a virtuous cycle; cutting them out is eating the seed corn for future models
- get ahead of the legal situation before it goes really badly for everyone. generative image models are amazing technology; stable attribution demonstrates they can be made and used in a way which benefits everyone involved
i spoke to law profs about this - the analogy which kept coming up is the vcr. initially basically a piracy machine, it brought to life an enormous content market. had it been banned, creators would have been worse off in the long run.
here's how you can help.

- help us find links to artists' work. we have urls for the source images, but they're often served by a cdn, so we can't link directly to profiles, though we *really* want to.
- get the word out, especially among the creative community. there is a better way.
i’ll be doing a twitter space about this tonight including time for q&a twitter.com/i/spaces/1jMJg…
the purpose of technology is human flourishing.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with anton 🏴‍☠️

anton 🏴‍☠️ Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @atroyn

Feb 6
stable attribution has been out for about 24 hours now, it's certainly created a response.
as promised i'll go into a high level technical overview in this tread.
first, i want to repeat, we cannot detect whether images people upload to stable attribution are generated by stable diffusion or not.

our algorithm works on all input images. the results are only meaningful for images generated with the vanilla open source stable diffusion.
'is stable attribution just doing similarity search?' no.

however, similarity search is a reasonable starting point, because of how latent diffusion models (ldm) like stable diffusion (sd) are trained, though not all similarity search is created equal.
Read 26 tweets
Nov 26, 2022
i read maybe 10x faster than i can intelligibly listen to something, and i frequently skip around to remind myself of something or make sure i understood it

read faster nerds
i got over “listening” to “books” when my parents stopped reading to me as a kid because i’d learned to read for myself
i’m increasingly convinced most people “in tech” are functionally illiterate
Read 4 tweets
Jun 26, 2022
robotics, especially general purpose robotics, is a crucial technology on the evolutionary path of humanity, but practically no robotics company is an investable business.

this is a market failure and existential threat.
before i get started i want to say i admire almost every robotics founder i meet for daring to even try. particular favorite robotics companies include @agilityrobotics, starsky (now defunct), @SkydioHQ, @zipline and many others
in short, robotics companies almost all have terrible fundamentals from a venture capital perspective.

there are two reasons;
Read 19 tweets
May 23, 2022
we should teach the a.i to make guns, hear me out

this is the best path toward general assistive models for engineering design
it’s been demonstrated that improving attention based / generative models like transformers (e.g. for large language models) is mainly about scaling parameters.

in text, the bottleneck is compute, but that’s because we have a very large always available text corpus, the web
we’ve also seen pretty good performance for transformers / attention based models in some physical tasks like protein folding, drug discovery etc. in part because of the more limited state spaces and some ability to simulate

(e.g. alphafold) deepmind.com/publications/h…
Read 10 tweets
Apr 28, 2022
my most prog opinion is that nobody should have to work for a living and my most reactionary opinion is that most people aren’t good at anything and don’t want to be
i’m glad that these are completely consistent positions
i remember at a dinner @Aella_Girl was like “but what would you do if you were on a desert island” and my immediate reflexive response was “get off the island”, which is also entirely consistent with my worldview
Read 5 tweets
Apr 26, 2022
americans are not a free people until it’s legal to drink a beer walking down the street or with your friends in the park.

this is a right more fundamental than the second amendment because beer is far more ancient than firearms.
‘here in the united states we believe in freedom’ says the guy who has to wrap his pbr tallboy in a paper bag if he wants to drink it right away from the corner store
in this country it’s not even legal to drink a beer *as a passenger* in a vehicle

that’s fucked up and you people don’t even realize it
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(