Crémieux Profile picture
Aug 18 • 13 tweets • 4 min read • Read on X
The study they're reporting on is not causally informative and does not indicate actual reasons to be concerned.

The reasons for this are very simple, as I'll show below.

🧵
First, sampling.

If I want to do a study on Holocaust survivors and I go and seek out people who survived it, I am looking for a select sample.

If, instead, I look in datasets that were sampled without respect to Holocaust survival and find survivors, my sample is nonselect.
Why does this matter?

Select respondents differ from nonselect ones because they elect to be sampled or because I was able to find them by virtue of something that differentiates them from the population.

For example, my Holocaust survivors might be part of a support group.
It might be that only the ones who suffer the most go to support groups. If I sample from the support group, I could contaminate my results because I'm sampling super-sufferers, not the norm.

That's actually a problem and I've pointed to it before.

The study CBS is referring to was based on a moderate-sized (n = 315 parents) convenience sample.

What happens when you use a large, population-representative sample to look at this screen time/tablets question?

Well, usually nothing more than chance:
Second, causality.

The study in question at best merely associated tablet use with outbursts of anger, it didn't provide meaningful causal evidence to explain this association.
When the relationship screen time has with internalizing problems and attention problems has been looked at with genetic data, the whole relationship has been found to be due to genetic confounding.

Accordingly, to conclude anything about the causality of the relationship between table use or screen time or whatever and kids' behavioral problems, we need to do things differently: we need large, representative samples with causal designs.
But wait, you might say: They had a random-intercepts cross-lagged panel model, so they could make causal inferences!

True, but the result is embarrassing, and especially so given the stronger literature on this finds nothing. They had two focal p-values: 0.045 and 0.040.
p-values this marginal were almost-certainly not found by mistake. They represent p-hacking, and if you're willing to talk to the press about p-hacked findings, maybe we should question your other work too.
Here's the study:

And an article on marginal p-values: jamanetwork.com/journals/jamap…
cremieux.xyz/p/ranking-fiel…
To make the p-hacking clearer, consider what the authors discussed in the abstract versus the results.

They generally found nonsignificant paths, but only discussed the two significant ones. Unless they're proposing age alters the nature of the causal relationship, that's no good. The fact that they only talked about the p < 0.05 results even though the others weren't distinguishable is just odd. It would be a weak result either way (especially for a reason I'm about to say), but the way they decided to write about this suggests they really wanted to get published, not to deliver the null they really found.

Also, unmentioned in the abstract: They had lots of attrition. 16% of the sample was lost by T2, and by T3, they had lost a third. Without imputation, that p = 0.04 is almost-certainly going >0.05, and it might even be >0.05 just using a different imputation method than FIML.Image
After re-estimating the model from the correlation matrix, I can confidently say that the highlighted significant results (p = 0.04/0.045) are down to the use of FIML.

The method of p-hacking was easily identifiable.

Good job accepting this, @JAMAPediatrics!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Crémieux

Crémieux Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @cremieuxrecueil

Aug 19
There was a point in time when London shut down 70% of its police stations as part of a series of austerity cuts.

That was a bad idea🧵 Image
Background:

A 2010 report from the British government led to a 29% budget cut for London's police.

In response, the mayor figured cutting down police stations and redistributing the frontline officers across the remainder could save money while achieving similar results. Image
The police stations the mayor's office decided to shut down were fairly geographically equally distributed in London, and they respected local crime trends.

It's therefore plausible that the remaining stations could make up for the absence of the ones that were shut down. Image
Read 11 tweets
Aug 15
These are the Baths of Caracalla. Or at least, this is what remains of them today.

These ruins might not look impressive now, but when they were constructed they might have been one of the finest examples of Roman architecture.

But then Europe forgot how to build them🧵 Image
To get an idea of what the Baths looked like in their heyday, look at this rendering.

This palatial compound must have been a sight to behold since the baths rivaled medieval cathedrals like Laon, Notre-Dame, and Salisbury in scale. Image
To put numbers on it, the bath building itself was 228 meters long, 116 meters wide, and 38.5 meters tall, with capacity for an estimate 1,600 bathers in a complex with 13 hectares of sumptuous decoration.
Read 8 tweets
Aug 15
New survey: Over half of researchers in Denmark and an international sample from Britain, America, Croatia, and Austria anonymously admitted that they:

- Cite papers they don't read
- Cite irrelevant papers
- Don't put in effort in peer review
- Misreport nonsignificant findingsImage
In another set of questions, the Danish researchers were asked to report their prevalence of engaging in questionable research practices in recent publications and what they estimated it was across the field.

Unless they undersold their prevalence, they were pessimistic. Image
When the international sample was asked to do the same thing, they were also pessimistic. Image
Read 7 tweets
Aug 13
Here is the geographic distribution of Adolf Hitler's Y-chromosomal haplogroup.

That Hitler had this Y-haplogroup that's relatively uncommon for Germans has prompted some to claim Hitler had Jewish paternal ancestry.

But did he?🧵 Image
Firstly, how do we know this is Hitler's Y-haplogroup?

Illicit journalistic methods. Less politely: stalking.

A pair of Dutch journalists stalked Hitler's living relatives, gathered their DNA without their consent, and sequenced it to figure this out. Here's one example:Image
What they discovered was that most of the male relatives were E1b1b.

As reported by the company FamilyTreeDNA in 2010, 9% of Germans have this haplogroup, of whom 20% are Jewish.

So given this is Hitler's Y-haplogroup, we're sitting at maybe 20% with no other information. Image
Read 13 tweets
Aug 10
I'm delighted to see my charts have been posted in response to this multiple times.

If you look at American PISA performance, you see a consistent picture: Americans utterly dominate once you split them by race.

🧵
Image
Some people argue 'Ah, but you didn't split Europe by race. We would see the same picture!'

Nope. We have some European countries split by race (e.g., U.K.) and that's not the case, and notice the subtitle means most non-Europeans are already removed from the equation.
However, this is the case in some other settler nations like New Zealand. Their Whites perform as well as American Whites!

Read 8 tweets
Aug 9
Stereotype threat is a proposed explanation for the Black-White IQ gap.

But is it real?🧵

In the original study, the scores for Blacks and Whites in conditions of stereotype threat and without that threat looked like this: Image
Most people read about stereotype threat and think it fully explains the Black-White gap in scores, as in this hypothetical model: Image
But look back at the first figure and you'll notice the words "Adjusted by SAT."

That's right: in its original study, stereotype threat was something *added on top of* the existing Black-White gap, not something that explains it.

The reality of the test was more like this: Image
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(