Kareem Carr, Statistics Person Profile picture
Oct 4, 2020 11 tweets 2 min read Read on X
Don't know what a P-VALUE is?

Don't know why P-VALUES work?

Don't know why sometimes P-VALUES don't work?

THIS IS THE THREAD FOR YOU. 🧵
DEFINITION OF A P-VALUE. Assume your theory is false. The P-VALUE is the probability of getting an outcome as extreme or even more extreme than what you got in your experiment.
THE LOGIC OF THE P-VALUE. Assume my theory is false. The probability of getting extreme results should be very small but I got an extreme result in my experiment. Therefore, I conclude that this is strong evidence that my theory is true. That's the logic of the p-value.
THE P-VALUE IS REASONABLE IN THEORY BUT TRICKY IN PRACTICE. In my opinion, the p-value is just a mathematical version of the way humans think. If we see something that seems unlikely given our beliefs, we often doubt those beliefs. In practice, the p-value can be tricky to use.
THE P-VALUE REQUIRES A GOOD DEFINITION OF WHEN YOUR THEORY IS FALSE. There are usually an infinite number of ways to define a world where your theory is false. P-values often fail when people use overly simplistic mathematical models of the processes that created their data.
If the mismatch between their mathematical models of the world and the actual world is too large then the probabilities we compute can become completely disconnected from reality.
THE P-VALUE MAY REQUIRE AN ACCURATE MODEL OF YOU (THE OBSERVER). The probability of getting the result you got depends on many things. If you sometimes do things like throw out data or repeat measurements then you're part of the system.
Your behavior affects the probability of getting your experimental results. Therefore, to be completely realistic, you need to have an ACCURATE model of your own behavior when you gather and analyze data. This is hard and a big part of why the p-value often fails as a tool.
BY DEFINITION, P-VALUES MUST SOMETIMES BE WRONG. When using p-values, we're working off of probabilities. By logic of the p-value itself, even with perfect use, some of your decisions will be wrong. You have to embrace this if you're going to use the p-values.
Badly defining what it means for your model to be false. Inaccurately modeling the chances of getting your data including your own behaviors. Not treating a p-value as a decision rule that can sometimes be wrong. These factors all contribute to misuse of the p-value in practice.
Hope this cleared some things up for you. Thanks for coming to my p-value TED talk!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Kareem Carr, Statistics Person

Kareem Carr, Statistics Person Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @kareem_carr

Apr 25
If you think about how statistics works it’s extremely obvious why a model built on purely statistical patterns would “hallucinate”. Explanation in next tweet. Image
Very simply, statistics is about taking two points you know exist and drawing a line between them, basically completing patterns.

Sometimes that middle point is something that exists in the physical world, sometimes it’s something that could potentially exist, but doesn’t. Image
Imagine an algorithm that could predict what a couple’s kids might look like. How’s the algorithm supposed to know if one of those kids it predicted actually exists or not?

The child’s existence has no logical relationship to the genomics data the algorithm has available.
Read 4 tweets
Apr 19
"Why are US taxpayers funding Harvard?"

These grants aren't charity. They're highly competitive contracts where the US government determines Harvard is the best institution for conducting specific research, and then pays Harvard for services rendered to US taxpayers.
Each grant represents a fair contract that a group at Harvard won after being in competition with hundreds or even thousands of other groups. These are not handouts.
The US government pays Harvard and other universities to provide answers to questions that aren't directly profitable in themselves, but which provide a foundation for private sector innovation, and help maintain American dominance over geopolitical rivals like China.
Read 5 tweets
Apr 6
As a someone who translates ideas into math for a living, I noticed something weird about the tariff formula that I haven't seen anybody else talk about. 🧵 Image
The formula defines the tariff rate as exactly the percent you need to charge on imports to make up for the trade deficit. Basically,

trade deficit = tariff rate x imports

It's constructed as if tariffs are a kind of compensation for trade deficits but this raises a question.
If tariffs are something foreign countries owe to the American people for having a trade deficit, then forcing US businesses to make up for the difference, by paying extra money to the US government, is kind of a weird solution.
Read 9 tweets
Apr 1
Whenever I see students with good grades but lots of college rejections, my first thought is a bad personal essay. As predicted, this guy's essay was kind of a disaster.

Since I did get into Harvard, I'll give my two cents on the essay:
Maybe this will help some people. Here's the essay: Image
Image
Red flag 1:
A guy pausing his rapidly growing $30M business for 4 years to attend college makes zero financial and professional sense.
Read 12 tweets
Mar 8
In honor of international women's day, let's take a moment to remember the most famous statistician in history.

You've definitely heard of her, but you probably have no idea she was a statistician.

It's Florence Nightingale. Image
Image
Nightingale was first female member of the Royal Statistical Society and a pioneer in using statistical analysis to guide medical decisions and public health policy.
Florence Nightingale's most famous statistical analysis was her investigation into the mortality rates of soldiers during the Crimean War. She demonstrated that the majority of deaths among soldiers were due to preventable diseases rather than battlefield injuries! Image
Read 5 tweets
Feb 18
Took one for the team and made a histogram of the Elon social security data. Not sure why his data scientists are just giving him raw tables like that. Image
Image
It’s also weird that they keep tweeting out these extremely strong claims without taking a few days to do some basic follow up work.
It doesn’t come off like they even:
- plotted the data
- talked to any of the data collectors
- considered any alternative explanations
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(