Charles Rollet Profile picture
Jul 23 6 tweets 2 min read Read on X
Scoop: @BusinessInsider obtained an internal list of websites that could and couldn't be used for training Anthropic's latest AI models.

Anthropic's contractor Surge AI left the list fully public on Google Docs.

'Sites you can use' include Bloomberg, Harvard, & the Mayo Clinic Image
Check out the list here:

Many of the whitelisted sources copyright or otherwise restrict their content.

At least 3 - the Mayo Clinic, Cornell University, & Morningstar - told BI they didn't have any AI training agreements with Anthropic.s3.documentcloud.org/documents/2602…
The spreadsheet also includes a blacklist of websites that Surge AI's gig workers were "now disallowed" from using.

The blacklist includes companies like the NYT & Reddit which have sued AI startups for scraping without permission.

Full doc here:
documentcloud.org/documents/2602…
The spreadsheet was used by Surge AI workers helping teach Anthropic's latest AI models be "helpful, honest, & harmless."

This was for RLHF purposes, not pre-training.

But legally, it's "probably not going to make a material difference in terms of fair use" a law prof told BI
Anthropic said it wasn't aware of the list and that Surge AI created it. (Surge declined to comment on this point.)

Surge locked down dozens of files for the project shortly after BI reached out & said it's "looking closely" into the security lapse.
For more, read our @BusinessInsider article: businessinsider.com/anthropic-surg…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Charles Rollet

Charles Rollet Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @CharlesRollet1

Dec 18, 2023
BREAKING: Uyghur students in a major PRC city are *all* being tracked by a police "anti-terrorism" system which automatically flags "abnormal behaviors" such as "gathering at religious centers" via @ipvideo ipvm.com/reports/hangzh…
For only $23k a PRC AI company () has been contracted to build a "management and control platform" for "Uyghur students in colleges and universities" chinaoly.com
Image
The system tracks Uyghur students' purchases, online and offline behavior, VPN usage... and even alarms if they start gathering at "religious centers" Image
Read 7 tweets
Nov 13, 2023
BREAKING: Hikvision won a 'Smart Campus' project in China that "automatically sends an alert" on ethnic minority students "suspected of fasting during Ramadan" based on "dining records of such students"
via @ipvideoipvm.com/reports/hikvis…
Hikvision confirmed it won the project, but claims - without evidence - that it never developed/deployed the Ramadan warnings s.ipvm.com/uploads/987c/e…
Image
Here's the key section of the project's tender describing these Ramadan warnings (page 260):

It's part of a $9m USD project for Minjiang University in Fujian Province. s.ipvm.com/uploads/embedd…
Image
Read 9 tweets
Nov 8, 2023
NEW: Hikvision is blaming an 'employee error' for offering "ethnic minority" recognition technology in its latest software...

But questions remain about whether Hikvision has ever actually canceled the technology, as it insists.🧵


via @ipvideoipvm.com/reports/hikvis…
Last month @ipvideo showed live Hikvision software offering users tech that purports to detect "whether they are an ethnic minority" (是否少数民族)


This directly contradicts Hikvision's claims to have removed and cancelled this kind of tech in 2018. ipvm.com/reports/hikvis…
Image
@ipvideo Two days after @ipvideo's report, Hikvision sent a letter blaming an employee who "failed to run the document through the required editing and screening process and posted the old version", insisting this was "phased out" & "prohibited" in 2018.
s.ipvm.com/uploads/embedd…
Read 9 tweets
Jul 26, 2023
NEW: Hikvision's AI software can identify Uyghurs and is powered by @nvidia hardware, according to a recent PRC surveillance contract.

Hikvision claims it removed ethnic minority recognition tech five years ago.

via @ipvideo https://t.co/kYZAvxV21Mipvm.com/reports/hikvis…
Image
The Hikvision AI software can detect "Whether ethnic minority: unknown, non-minority, Uyghur" (是否少数民族:未知、非少数民族、维族).

This software requires a server with "no less than eight" @NVIDIA T4 GPUs to run.

https://t.co/MGki8j989xs.ipvm.com/uploads/embedd…
Image
@nvidia NVIDIA says it has "no involvement" in this contract & is not "aware of any customer planning to provide NVIDIA" but "will review any information provided".

NVIDIA says the T4 is a 5-yr old GPU "sold mass market for many years, and we do not have visibility into resale". Image
Read 5 tweets
May 31, 2023
BREAKING: Protest signs and protestors' faces are automatically reported to PRC police thanks to an AI system touted by Dahua, a huge Chinese video surveillance manufacturer. Dahua calls this solution a "banner_alarm" ipvm.com/reports/dahua-… via @ipvideo Image
@ipvideo "in the designated area, if a person holding a banner is detected and lasts for a certain period of time, an alarm will be generated"

指定区域内,检测到人举横幅且持续一定时间则产生报警 twitter.com/i/web/status/1… Image
This is intended for usage within China by PRC police and other PRC authorities, with Dahua describing it as a "social governance" (社会治理) and "social safety" (社会治安) solution
Read 7 tweets
May 2, 2023
BREAKING: Chinese police in Shanghai are building a surveillance system that alerts them every time a foreign journalist tries to visit Xinjiang ipvm.com/reports/shangh… via @ipvideo
Here's the PRC police tender found by @ipvideo which details this system. It automatically notifies police every time "foreign journalists living in China" buy flight or train tickets to #Xinjiang.
s.ipvm.com/uploads/embedd… Image
This is only one feature of a sweeping surveillance system. Another feature: notifying PRC police of every Uyghur coming to Shanghai Image
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(