Safety Profile picture
Mar 29, 2023 5 tweets 1 min read Read on X
Sharing information surrounding media that we have determined will not be allowed based on our policies. This thread is intended to cover the most common questions being asked.
We don't immediately detect every single violating image on our platform the minute it is posted. It may be detected proactively using models/ algorithms or detected through user reports. Various events and new information can result in more severe treatment of content.
Once we determine media will not be allowed we run automated processes that find and restrict tweets, which is the only way we can remove it fast and at scale. Rules are applied to everyone and considering context is not possible at thousands of tweets hourly.
If the media is being shared mainly for awareness we will not apply a strike or even a timeout for posting it. If you continue to repost the violating content, that will incur a strike and will result in escalating time outs.
There is a valid debate about whether we should/ shouldn't remove incitement related media. Many different users believe their tweet should be allowed including the original creator of the media. We have biased towards not allowing Twitter to be used to incite violence.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Safety

Safety Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Safety

Apr 22, 2023
Posting a reminder on how to appeal a suspended account along with a few tips. We are working appeals in less than 48 hours in most cases, but we do require that a user appeals their suspension directly (you can't appeal on someone's behalf). Some tips for a successful appeal… twitter.com/i/web/status/1…
Explain the reason for the policy violation that led to the suspension or if not sure, ask for the reason your account was suspended. Soon the suspension reason will be displayed on the account page, but we will continue to send email notifications as well.
Let us know if you believe the suspension reason is an error or if you are acknowledging the policy violation and taking steps to avoid similar violations in the future. Appeals that do not have this information are likely to be denied.
Read 4 tweets
Apr 17, 2023
We’re adding more transparency to the enforcement actions we take on Tweets. As a first step, soon you’ll start to see labels on some Tweets identified as potentially violating our rules around Hateful Conduct letting you know that we’ve limited their visibility. 🧵… twitter.com/i/web/status/1…
These actions will be taken at a tweet level only and will not affect a user’s account. Restricting the reach of Tweets helps reduce binary “leave up versus take down” content moderation decisions and supports our freedom of speech vs freedom of reach approach.
We may get it wrong occasionally, so authors will be able to submit feedback on the label if they think we incorrectly limited their content’s visibility. In the future, we plan to allow authors to appeal our decision to limit a Tweet’s visibility.
Read 4 tweets
Mar 21, 2023
We recently partnered with @Sprinklr for an independent assessment of hate speech on Twitter, which we’ve been sharing data on publicly for several months.

Sprinklr’s AI-powered model found that the reach of hate speech on Twitter is even lower than our own model quantified 🧵
What’s driving the difference? The context of conversation and how we determine toxicity.

Sprinklr defines hate speech more narrowly by evaluating slurs in the nuanced context of their use. Twitter has, to this point, taken a broader view of the potential toxicity of slur usage.
To quantify hate speech, Twitter & Sprinklr start with 300 of the most common English-language slurs. We count not only how often they’re tweeted but how often they’re seen (impressions).

Our models score slur Tweets on “toxicity,” the likelihood that they constitute hate speech
Read 7 tweets
Feb 1, 2023
We’re moving faster than ever to make Twitter safer and keep child sexual exploitation (CSE) material off our platform. Here’s an update on our work:
Our recent approach is more aggressive in that we’re proactively and severely limiting the reach of any content that we detect may contain CSE material. This includes moving swiftly to remove the content and suspend the bad actor(s) involved.
In January, we suspended ~404k accounts that created, distributed, or engaged with this content, which represents a 112% increase in CSE suspensions since November. Image
Read 5 tweets
Jan 28, 2023
As we shared earlier, we have been proactively reinstating previously suspended accounts. Starting February 1, anyone can appeal an account suspension and be evaluated under our new criteria for reinstatement.
We did not reinstate accounts that engaged in illegal activity, threats of harm or violence, large-scale spam and platform manipulation, or when there was no recent appeal to have the account reinstated.
Going forward, we will take less severe actions, such as limiting the reach of policy-violating Tweets or asking you to remove Tweets before you can continue using your account. Account suspension will be reserved for severe or ongoing, repeat violations of our policies.
Read 5 tweets
Jan 27, 2023
We’ve heard from some of you that it’s not always clear what behaviors can result in spam for users and your account potentially being enforced for platform manipulation. Here are some things to avoid:
Don’t post the same (or almost the same) content or links over and over again, especially in threads where the topic is not directly related to the content or links you are posting.
Don’t mention accounts repeatedly an excessive number of times, especially with the same type of content and/or links. This could also lead to users blocking you and/or reporting your behavior as targeted harassment, particularly when it includes hateful conduct.
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(