We’ve spent the last ten months building #CitizenBrowser, a project that aims to peek inside the Black Box of social media algorithms, by building a nationwide panel to share data with us. Today, we are publishing our first story from the project. /1
.@corintxt crunched the numbers and found that after Facebook flipped the switch for political ads, partisan content elbowed out reputable news outlets in our panelists’ news feeds. themarkup.org/citizen-browse… /2
You can learn more in our methodology, where we describe how we did this and what steps we took to ensure that we preserved the panelists' privacy. themarkup.org/citizen-browse… /3
Personally, this project is the culmination of years of experiments trying to figure out how to collect data from social media platforms in a way that can lead to meaningful reporting. I’ve described a couple of highlights below 👇 /4
My first attempt was in 2016 at Propublica, when I was working with @JuliaAngwin . We were interested in seeing if there was a difference in the Ad interests FB disclosed to users in their settings and the interests they showed to marketers. /5
We collected the data set of groups FB showed marketers through the FB Ad portal quite easily
but collecting what they showed to users was a non-trivial problem. /6
In order to collect this data I built a browser extension that let readers share their data with us. We released this as a part of @propublica’s Breaking The Black Box series. propublica.org/article/breaki… /7
Sadly, this extension no longer works, but the 52k categories we collected from volunteers are still available to download. propublica.org/datastore/data… /8
It was during this investigation that we first learned about the now infamous ‘ethnic affinity’ ad categories on Facebook. propublica.org/article/facebo… /9
The second attempt was with @kashhill at Gizmodo investigating how Facebook's People You May Know algorithm suggested people to befriend. /10
Since FB doesn’t have an API for this service. I built a tool that allowed people to collect all the friends' suggestions Facebook made to them on a daily basis. If they found something interesting they could share it with us. gizmodo.com/keep-track-of-… /11
Facebook wasn’t happy about our tool and asked us to take it down. To this day there is no easy way for users to get their friend suggestions data from Facebook. gizmodo.com/facebook-wante… /12
#CitizenBrowser is the next step in this journey. We built an app that we paid panelists to install, which collects data from their Facebook feeds at regular intervals. /13
It’s an ambitious experiment. But given that there is no other way to see what is happening on this important platform, we gave it our best shot. Grateful for @JuliaAngwin and @nabihasyed's fearless leadership that allowed us to push the limits. /14
Also grateful for our team, who just ran a marathon at sprint speed to get this project done! @angiewaller @leonyin @corintxt @jonkeegan @sammorrisdesign @mynameisfiber @magiccia @cortesi @jeffcrouse @iaaaan @rinapalta @simplysecureorg /eof

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Surya Mattu

Surya Mattu Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @suryamattu

22 Sep 20
Today we’re releasing Blacklight, a real-time website privacy inspector that illuminates the hidden tracking technologies on any website (data detectives, assemble!). You can try it out yourself at: themarkup.org/blacklight
In our companion investigation ‘The high privacy cost of building a “Free” website, we show how common, free website-building tools offered by ad-tech companies lead to trackers loading on users’ browsers, often without the website operators’ knowledge or disclosure to users.
Read 13 tweets
7 Aug 18
Facebook wasn't happy that we built a tool to let users collect their own 'People You May Know Data'
gizmodo.com/facebook-wante…
We carried out an investigation into the PYMK algorithm and found some pretty wild stuff. You can read the stories we wrote here gizmodo.com/tag/people-you…
It's worth reiterating that Facebook doesn't even think of this as your data. So you couldnt ask for it even under GDPR rules
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!