A hearing happened yesterday that could radically change how we use the internet. Depending on how the Supreme Court rules on Van Buren vs U.S., third-party scraping could become a criminal activity. #scrapingisnotacrime Image
Scraping of publicly available data makes a more equal & inclusive internet. By blocking third-party scrapers, we would lose monitoring by data journalists, watchdog orgs, and researchers.

Here’s some of the stuff that would disappear due to being labeled criminal activity:
❌ Researcher projects & civil society institutions that monitor big tech uses data like @LauraEdelson2’s NYU Ad Observatory that observes political ad targeting thehill.com/policy/technol…

—GONE
❌ Or studies that how disinfo spread on social media networks - like the recently released incredible study on Facebook echo chambers from @hamahmrd’s @Amir_Ghasemianwork @aaronclauset @markusmobius & @DavMicRot @duncanjwatts

—GONE
❌ Being able to find lost websites or older versions - such as @internetarchive's quiet work of documenting culture + knowledge archive.org

—GONE
❌ Consumer protection - such as @camelcamelcamel price trackers for consumers camelcamelcamel.com

Or @JuliaAngwin’s @suryamattu’s work on Amazon’s biased pricing algorithm
propublica.org/article/amazon…

—GONE
❌ Algorithmic monitoring of racist, sexist software. All the work that @YESHICAN’s @Data4BlackLives is encouraging data activists to do would be illegal d4bl.org/action.html

--GONE
❌ Election campaigning data and election analysis - such as @alloydotus or @alex_gaynor's 2020 election scraper made with a team of contributors on Github github.com/alex/nyt-2020-…

—GONE
❌ Even something as basic as collecting lots of headlines such as @importio's article assessing media coverage would be considered illegal import.io/post/trump-vs-…

—GONE
❌ COVID dashboards such as @alexismadrigal and @TheAtlantic's testing tracker covidtracking.com
Or @AviSchiffmann's @TheDanielConlon's very useful covid tracker - ncov2019.live

—GONE
❌ White hat hackers who help tech platforms find vulnerabilities in their stack - @rootkovska’s work on Intel, @0xcharlie’s work on Apple, @dakami’s work on DNS — the work would be considered criminal

—GONE
Scraping in the connective tissue of the internet!

Without scraping:
1.) interoperability between platforms & scripts that make the cloud seamless could disappear.
2.) Important services that keep our internet healthy would disappear.
"Extracting or scraping that data for analysis flips the power balance so that the audience decides what can be done with the data." says the @markup in a great thread from them that explains the problems of CFAA
If the SC gives a shitty misinterpretation (legalese calls it “broad interpretation”) of the CFAA, scraping could become a criminal activity because of an outdated law from 1986 when we didn’t even have the WWW or a commercialized internet!
And the crazy part is that as backwards as the CFAA is big tech LOVES the CFAA (@superwuster has dubbed it the Worst Law in tech for many good reasons documented here newyorker.com/news/news-desk… )
The CFAA gives Big Tech cover to not share any data and claim that they're protecting users despite the fact they gather data from users and keep it in their walled gardens. Big tech is manipulating what is considered personal data and privacy
PERSONAL DATA- The data we put up on sites with the purpose of public SHARING is our personal data. Eg when you post your data to Linkedin, you want others to see it. But who owns that data? Can Linkedin prevent others from scraping the data that you’ve already made public?
Linkedin says yes, they own your data. They tried to do that in Linkedin vs HiQ-
ere.net/data-privacy-t…

But Linkedin lost! Good summary from @binarybits on @arstechnica. arstechnica.com/tech-policy/20… and @EFF
eff.org/deeplinks/2019…
Companies like Linkedin and Facebook are arguing that they are protecting users' privacy when they prevent scraping or data sharing. They are collapsing the context of data use and clumping all types of data together.
e.g. they don’t differentiate between 1:1 DM messages from the university you list. We need to distinguish data use by type in order to understand that scraping should be allowed for publicly shareable data.
PRIVACY - Big tech is weaponizing privacy to deflect attention from how they are managing our data in practice. Facebook's blatant attempt at this: @EFF’s @mitchstoltz’s eff.org/deeplinks/2020…
Right now, privacy advocates like @EPICprivacy are buying it. They filed several amicus briefs in defense of the CFAA in the name of privacy - seriously WTF
epic.org/amicus/cfaa/li…
epic.org/amicus/cfaa/va…
Part of the issue is that talking about scraping solely through privacy is incredibly limiting and narrows our understanding of what is at stake. We need to expand our language to personhood, which is about self-determination and agency over our own livelihoods and lives.
With personhood, we could ask, does criminalizing scraping compromise people’s personhood? And you see all the ways it would prevent monitoring of abuses & therefore impede one’s personhood or a group’s communityhood.
More about personhood in this essay: deepdives.in/you-are-not-yo…
Correction to tweets above - I meant to tag @themarkup ! Thanks you for all that you’re doing !
❌ federal gov would lose ability to respond to federal disasters if third party scraping were a crime says @denicewross -

@disastahguy that would affect @disastertechinc services!
Gahhh @binarybits’s latest article on CFAA has me feeling so nervous like election week again ... arstechnica.com/tech-policy/20…

Thanks @vvas for the tip
EVEN my dog is alarmed about the impending Supreme Court ruling on CFAA this week.

She is proudly wearing @themarkup shirt #scrapingisnotacrime & talking all dogs to the cause

instagram.com/p/CIUJ2H3pNRP/… Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Tricia Wang

Tricia Wang Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @triciawang

8 Oct
I found out that my mother is being radicalized by Chinese Wechat & local Chinese American groups in California to either not vote at all or vote for Trump. During this quarantine she has a lot of time at home and she's been seeing a lot of misinfo
Everything she has said is the almost WORD FOR WORD the same things that I have heard other Chinese Americans say in my networks.
Their main issue is that the “next generation” won’t benefit from their hard work and that their children or grandchildren will fall behind if Democrats are in control cuz they’ll implement things like affirmative action & equality measures.
Read 22 tweets
20 Mar
The lack of context in which @CarolYujiaYin’s video of using health codes in China is being shared on twitter & linked to @techreview’s really good social distancing article seems to be creating confusion about how it’s actually being implemented.
My researchers in Wuhan & I (virtually) have been studying how people are using tech during the Quarantine. So here are some notes to clarify how tracking is actually done.
As @CarolYujiaYin said, every city has their health code rating sys. 健康码 for #COVID19 to determine if one is “safe” or not to enter a specific building or area.

BUT it’s WAY more confusing in practice! There are many competing systems w/ their own definition of “safe”.
Read 19 tweets
13 Jun 19
1. I’m so grateful to @tristanharris @aza being vulnerable enough to suggest a new field to study human + tech. It just goes to show there’s tons of confusion around the role of research & insights in industry - which is the topic of my upcoming talk at @MindtheProduct.
2. @zeynep is right about the willful blindness of industry tech & @BostonJoan have pointed to entire field of STS - BUT i have sympathy for @tristanharris @aza cuz i see tons of confusion on several fronts between industry & academia that can possibly make even well meaning...
3. ... tech folks feel like a new area of study is needed. One of the big problems is how research is manifested in industry under confusing fields of UX, CX, design research -as @lifewinning noted there is def a trend with throwing an X in industry to make a field sound TOUGH
Read 26 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!