Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Dr Kareem Carr

@kareem_carr

Oct 3, 2020 • 8 tweets • 2 min read • Read on X

Scrolly

Statisticians like me say CORRELATION ISN'T CAUSATION but that's not the whole story.

There are at least FOUR different scenarios!

A thread. 🧵

1. CORRELATED BY CHANCE. There's always a possibility that variables will correlate by chance. If you have a lot of data, you're almost certain to get a few high correlations. You will know you're in this situation if the same variables are much less correlated in new data.

2. CORRELATED DUE TO STRUCTURE. Clocks are correlated with each other but there's nothing about Clock A that can be changed in order to cause a change in Clock B or vice versa. There is no third thing you can change that will cause both clocks to change. There is no causation.

You might be tempted to say that the clocks have the common cause of being created by humans. Imagine two random stars that have a cyclical change in brightness every 24 hours. They will be correlated as well. It's not about who created them. It's about their similar structure.

3. MURKY CAUSATION. In the simplest case, if A and B are correlated and there is some causation then this could mean that A causes B, B causes A or some third thing C causes both A and B. In the most complex case, there could be complicated feedback loops between A and B.

In these cases, when we say "correlation isn't causation", what we mean is that we can't identify exactly what kind of causation there is but there is some.

4. EVEN MURKIER CAUSATION. A and B might not be related at all in the real world but something about your data collection may have caused data about A to be related to data about B. Technically, you could say you or your data collection are the cause of the correlation.

However, in the context of the original variables themselves and the real world, A is not causally related to B.

Hope this was educational! 🧵

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @kareem_carr

Dr Kareem Carr

@kareem_carr

Nov 21, 2025

My personal list of hidden gems. For every 10 likes, I’ll explain why one of these made the list.

Most statistics books are bad. Reading them is like chewing dry cardboard.

So when you find one that's good, it's a big deal.

I'm giving you seven.

This one is for the coders. All the concepts are expressed clearly, often from first principles, in code.

If this is how your brain works, you won't find many books like this, so it's worth checking out.

Read 8 tweets

Dr Kareem Carr

@kareem_carr

Jun 5, 2025

You may have heard hallucinations are a big problem in AI, that they make stuff up that sounds very convincing, but isn't real.

Hallucinations aren't the real issue. The real issue is Exact vs Approximate, and it's a much, much bigger problem.

When you fit a curve to data, you have choices.

You can force it to pass through every point, or you can approximate the overall shape of the points without hitting any single point exactly.

When it comes to AI, there's a similar choice.

These models are built to match the shape of language. In any given context, the model can either produce exactly the text it was trained on, or it can produce text that's close but not identical

Read 10 tweets

Dr Kareem Carr

@kareem_carr

Jun 2, 2025

I’m deeply skeptical of the AI hype because I’ve seen this all before. I’ve watched Silicon Valley chase the dream of easy money from data over and over again, and they always hit a wall.

Story time.

First it was big data. The claim was that if you just piled up enough data, the answers would be so obvious that even the dumbest algorithm or biggest idiot could see them.

Models were an afterthought. People laughed at you if you said the details mattered.

Unsurprisingly, it didn't work out.

Next came data scientists. The idea was simple: hire smart science PhDs, point them at your pile of data, wait for the monetizable insights to roll in.

Read 13 tweets

Dr Kareem Carr

@kareem_carr

Jun 1, 2025

As a statistician, this is extremely alarming. I’ve spent years thinking about the ethical principles that guide data analysis. Here are a few that feel most urgent:

RESPECT AUTONOMY

Collect data only with meaningful consent. People deserve control over how their information is used.

Example: If you're studying mobile app behavior, don’t log GPS location unless users explicitly opt in and understand the implications.

DO NO HARM

Anticipate and prevent harm, including breaches of privacy and stigmatization.

Example: If 100% of a small town tests positive for HIV, reporting that stat would violate privacy. Aggregating to the county level protects individuals while keeping the data useful.

Read 9 tweets

Dr Kareem Carr

@kareem_carr

May 8, 2025

The kids using ChatGPT to cheat are massively fumbling the ball.

I would give almost anything to experience learning something like calculus for the first time with an AI assistant.

I have wasted an ungodly amount of time on poorly written math textbooks.

Confusing notation. Poorly worded statements that I puzzled over for hours. Typos that had me questioning my sanity for days.

These kids won't ever have to go through that.

They'll take a picture of the page, ask ChatGPT what it means, and instantly get an explanation tailored to exactly their level.

Read 7 tweets

Dr Kareem Carr

@kareem_carr

May 7, 2025

Hot take: Students using chatgpt to cheat are just following the system’s logic to its natural conclusion, a system that treats learning as a series of hoops to jump through, not a path to becoming more fully oneself.

The tragedy is that teachers and students actually want the same thing, for the student to grow in capability and agency, but school pits them against each other, turning learning into compliance and grading into surveillance.

Properly understood, passing up a real chance to learn is like skipping out on great sex or premium ice cream. One could but why would one want to?

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Dr Kareem Carr

Try unrolling a thread yourself!

More from @kareem_carr

Dr Kareem Carr

Dr Kareem Carr

Dr Kareem Carr

Dr Kareem Carr

Dr Kareem Carr

Dr Kareem Carr

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!