Parag Agrawal Profile picture
May 16, 2022 15 tweets 3 min read Read on X
Let’s talk about spam. And let’s do so with the benefit of data, facts, and context…
First, let me state the obvious: spam harms the experience for real people on Twitter, and therefore can harm our business. As such, we are strongly incentivized to detect and remove as much spam as we possibly can, every single day. Anyone who suggests otherwise is just wrong.
Next, spam isn’t just ‘binary’ (human / not human). The most advanced spam campaigns use combinations of coordinated humans + automation. They also compromise real accounts, and then use them to advance their campaign. So – they are sophisticated and hard to catch.
Some final context: fighting spam is incredibly *dynamic*. The adversaries, their goals, and tactics evolve constantly – often in response to our work! You can’t build a set of rules to detect spam today, and hope they will still work tomorrow. They will not.
We suspend over half a million spam accounts every day, usually before any of you even see them on Twitter. We also lock millions of accounts each week that we suspect may be spam – if they can’t pass human verification challenges (captchas, phone verification, etc).
The hard challenge is that many accounts which look fake superficially – are actually real people. And some of the spam accounts which are actually the most dangerous – and cause the most harm to our users – can look totally legitimate on the surface.
Our team updates our systems and rules constantly to remove as much spam as possible, without inadvertently suspending real people or adding unnecessary friction for real people when they use Twitter: none of us want to solve a captcha every time we use Twitter.
Now, we know we aren’t perfect at catching spam. And so this is why, after all the spam removal I talked about above, we know some still slips through. We measure this internally. And every quarter, we have estimated that <5% of reported mDAU for the quarter are spam accounts.
Our estimate is based on multiple human reviews (in replicate) of thousands of accounts, that are sampled at random, consistently over time, from *accounts we count as mDAUs*. We do this every quarter, and we have been doing this for many years.
Each human review is based on Twitter rules that define spam and platform manipulation, and uses both public and private data (eg, IP address, phone number, geolocation, client/browser signatures, what the account does when it’s active…) to make a determination on each account.
The use of private data is particularly important to avoid misclassifying users who are actually real. FirstnameBunchOfNumbers with no profile pic and odd tweets might seem like a bot or spam to you, but behind the scenes we often see multiple indicators that it’s a real person.
Our actual internal estimates for the last four quarters were all well under 5% – based on the methodology outlined above. The error margins on our estimates give us confidence in our public statements each quarter.
Unfortunately, we don’t believe that this specific estimation can be performed externally, given the critical need to use both public and private information (which we can’t share). Externally, it’s not even possible to know which accounts are counted as mDAUs on any given day.
There are LOTS of details that are very important underneath this high-level description. We shared an overview of the estimation process with Elon a week ago and look forward to continuing the conversation with him, and all of you.
For those of you who want to learn more in the meantime: blog.twitter.com/en_us/topics/c…
blog.twitter.com/content/blog-t…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Parag Agrawal

Parag Agrawal Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @paraga

May 13, 2022
A lot has happened over the past several weeks. I’ve been focused on the company and haven't said much publicly during this time, but I will now.
We announced changes to our leadership team and operations yesterday. Changes impacting people are always hard. And some have been asking why a “lame-duck” CEO would make these changes if we’re getting acquired anyway. The short answer is very simple:
While I expect the deal to close, we need to be prepared for all scenarios and always do what’s right for Twitter. I’m accountable for leading and operating Twitter, and our job is to build a stronger Twitter every day.
Read 9 tweets
Dec 11, 2019
I’m incredibly excited for Twitter to kick off @bluesky, a new independent effort to develop a decentralized standard for social media. Please see @jack’s thread for more context. I have the privilege of finding a lead for this team.
The ideal candidate is someone who is hands-on technically, curious across disciplines, has experience working in the open on the blockchain, an opinionated vision for this effort, and is thrilled by the idea of shaping the future of social media.
The path ahead for @bluesky is full of uncertainty and challenges, which will be difficult but energizing for the right team. Some of the hurdles we can predict include:
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(