Jason Kint Profile picture
Dec 27, 2023 19 tweets 7 min read Read on X
ok, I've now read the full NYT complaint filed this morning vs OpenAI and Microsoft. I'm impressed - it's future-focused around fair value for work vital to democracy. It also contains 220k pages of exhibits although the pages of Ex J stood out to me. more on that in a minute. /1 Image
The complaint is a must-read imho, it's the only way to understand the alleged violations and the extent as to which the systems have been designed and tuned in order to generate certain output. It's filed in SDNY and it may well be a landmark case. /2 Image
It's rooted in copyright law and the US Constitution and that's very much where it begins. /3 Image
And as it notes there is a lot of money at stake. But it does well to look towards the future showing how violations used to create a substitute undermin existing (and future) biz models (including AI licensing) which fund the critical and costly journalism around the globe. /4 Image
The complaint makes it clear early on that the goal in negotiations to license its content is to receive fair value and to help shepherd a future world with responsible AI and a healthy news ecosystem. /5 Image
It cites a number of examples as to the human and financial cost that goes into journalism which can span multiple continents and require working through very challenging access limitations. /6 Image
That cost in 2023 includes mass shootings, wars, terrorist attacks, elections, financial infrastructure and natural disasters around the world. There can be no debate as to what it is at stake here. /7 Image
And with a clearly tied role for Microsoft, the complaint highlights abuses even in the most recent months. It shows this example of content lifted verbatim from a NYT report and then compares it to the approach taken by a search engine. /8 Image
Here is the search comparison using Microsoft's own search engine. The difference in handling of copy from the content is immediately obvious and impossible to debate. /9 Image
The complaint also steps through the preference and weighting used for sources with claims NYT-sourced content is more valued for training. And that undermining that real investment will undermine the entire market for journalism - including licensing it for future AI. /10 Image
There are a number of examples in the complaint around weighting showing not all brands and content are equal but I found the overweighting of WebText2 as a pretty good example of how "high-quality" content is given preference. /11 Image
And Google PageRank as one of the oldest approaches on the web to sorting authority across websites... here it's noted that nearly all of the few entries above NYT are social media so significantly less helpful to training a model. /12 Image
So back to Exhibit J. Unlike the other 220k+ pages of exhibits documenting registered works, this exhibit contains 100 examples of alleged copyright violations with nearly identical content being outputted by ChatGPT. Again, it's impossible to argue with this. /13 Image
Here are four examples. Again, the lawsuit includes one hundred of them. You get the point. I find this exhibit to be an incredibly powerful illustration for a lawsuit that will go before a jury of Americans. Again, it's impossible to argue with this. /14


Image
Image
Image
Image
And finally the lawsuit rips a gigantic hole into the presentation of OpenAI as a benevolent nonprofit. /15 Image
and on top of this, it also systematically walks through Microsoft's role in facilitating and contributing to the alleged copyright. Side note from me: Microsoft has gained one trillion dollars in market cap in 2023. /16 Image
Finally a quote from me on all of this that I supplied to news outlets earlier today. /17 Image
Here is a link to the full complaint and all of the exhibits. I would start with the 69 page complaint and then skip to Exhibit J if interested and I were you. Cheers. /18 courtlistener.com/docket/6811704…
Here is exhibit J. It’s an incredibly powerful exhibit for a jury trial once you explain copyright. /19 storage.courtlistener.com/recap/gov.usco…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jason Kint

Jason Kint Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @jason_kint

Jun 17
Woah. Exhibit list just posted for Facebook trial in DE starting in a few weeks. We finally have confirmation Sheryl Sandberg was deposed by the SEC - one week prior to Zuckerberg which also kept secret until a lawsuit unsealed it. Sandberg was also sanctioned in this case. /1 Image
This matters as it gets at Who Knew What When at FB ahead of the world finding out its platform was leaking personal data for years. Zuckerberg was dodgy at best under oath to Congress, FB responses to Parliaments focused on 2018 news. But exhibits include Jan 2017 MZ emails. /2 Image
The DE lawsuit claims Facebook's $5 billion record settlement was inflated in order to protect its CEO, Zuckerberg, and also includes (civil) insider trading claims. Zuckerberg was ordered to sit for multiple day depo this year, will have to testify live. /3 Image
Read 20 tweets
Jun 15
Scanning front pages across America this morning. Still today, the local A1 best captures the biggest story of the day. The majors from NY to LA to Detroit to even Arkansas. /1 Image
Image
Image
Image
From Washington DC all of the way up to the major newspapers in Alaska… the No Kings protest images are everywhere capturing the moment. /2 Image
Image
All of them capture peaceful protest, democracy in action, and what America is all about at a time when social media algorithms may distort what the day was all about. Illinois to Colorado. /3 Image
Image
Read 11 tweets
Jun 10
Incredible work being done by the press to keep facts building on facts. Grateful. This entire WSJ report overnight starting with this lede on how White House orders sparked LA crackdown is both chilling and informative. /1 Image
This statement. “We came to the United States for protection of what we encountered in Russia. It seems that we are encountering here what we fled.” /2 Image
WSJ separating out cases of targeting groups who have not committed crimes but even noting here incredible resources being used against what appears to be clear, First Amendment protected activity alerted the community. Here is the must-read report. /3 wsj.com/us-news/protes…Image
Read 6 tweets
May 28
Confession. Having watched Scott Pelley's outstanding work over nearly three decades, I almost didn't take the time to watch his W.F. commencement speech thinking the news reports told me enough of the facts. Frankly, that would have been a huge mistake on my part. Huge. 1/5
Disclosure: I'm a 60 Minutes fan. In fact, I read Don Hewitt's "Tell Me a Story" after nearly a decade in sports media and it likely tipped the scale in 2007 when I decided to jump to work at CBS. I find Pelley and team brilliant in telling stories in barely 15 min segments. 2/5
“If liberty means anything at all, it means telling someone something that they don’t want to hear. I fear there may be some people in the audience who don’t want to hear what I have to say today but I appreciate your forbearance in this small act of liberty.” - Scott Pelley 3/5
Read 6 tweets
May 16
wow, another order for Mark Zuckerberg to sit for another court deposition. This time in a case involving privacy violations with ingesting web-wide health data. Remember they paid billions in cases to try to avoid this. Data and privacy issues are especially sensitive. /1 Image
Zuckerberg depositions are interesting as they often go on for hours with highly informed attorneys driving for answers. And those answers may be put up against the often questioned veracity of his answers to Congress. Yes, as a CEO, he has testified to Congress A LOT. /2
I think his first real depo was SEC on very sensitive data scandal leading to $5B+ settlements with FTC+SEC. That scandal is still playing out in courts (did he overpay to protect himself?) It took 3yrs to get unsealed after I caught it in a footnote. /3
Read 7 tweets
May 13
The Verge comes in with a massive scoop on the backstory reporting it was Musk - and Sacks - behind the scenes trying to blow up IP to train AI on behalf of his allies. This wouldn't be a surprise to anyone. /1 Image
they have reports and details on the carnage and firing of the leadership and on the possible incorrect assumption that the new people in charge were running their playbook. /2 Image
It may be rare that @mrddmia is in agreement with Dems but in the world of accountability for big tech abuse whether over data, monetization, IP, censorship, privacy, you name it, these aren't partisan issues. appreciate the shared voice from advocates all around. /3 Image
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(