Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Nicolas Sommet 🇺🇦

Sep 8, 2022 • 15 tweets • 7 min read • Read on X

Scrolly

Power analysis for #interactions can be tough!

📢 Our new preprint features:
𝟭 An intuitive taxonomy of 12 types of interaction
...with the 𝘕s to reach power = .80/.90
𝟮 A 😭 meta-study
𝟯 Simulations testing 3 ways to ↗️ power
𝟰 A cool web app!

🧵

osf.io/xhe3u/

𝟭𝗮 As we know from popular blogs/papers, power analyses differ b/w main effects & interactions because:

👉a main effect corresponds to a difference b/w means

👉a two-way interaction corresponds to a difference b/w mean subdifferences

(using simple b/w-Ss designs as examples)

𝟭𝗯 Thus, when running a power analysis…

✅ It is OK to use a generic value to define the expected effect size of a main effect (e.g., a medium-sized difference of 𝘥 = 0.35)

❌ But it is NOT OK to use a generic value to define the expected effect size of an interaction

𝟭𝗰 To determine the type of interaction you expect, we argue that you must answer two Qs:

𝗤𝟭 What is the expected shape of my interaction?
➡️Reversed? Fully attenuated? Partially attenuated?

𝗤𝟮 What are the expected sizes of my simple slopes?
➡️Small? Medium? Large?

𝟭𝗱 This results in 12 basic types of interactions.

👇 see Table 👇

E.g., a “0.35 | 0.00 fully attenuated interaction” (in red) involves a medium-sized simple slope & a null simple slope. If such an interaction is true, 𝘕 = 1,024 will give you an 80% probability to detect it.

𝟮𝗮 From there, we wanted to know how researchers handle power analysis when having an interaction hypothesis.

We ran a prereg meta-study & built a sample of 159 studies testing interactions published 10 influential psychology journals.

Three (kinda depressing) conclusions.

𝟮𝗯 Conclusions #1 🙁

The majority of the studies in the lit test partially attenuated interactions (the most difficult to detect)

𝟮𝗰 Conclusions #2 ☹️

Less than 5% of the studies report an adequate power analysis (many use an inadequate generic value to define the expected effect size of the interaction)

𝟮𝗱 Conclusions #3😢

The overall median power to detect a medium-sized interaction of a given shape is .18.

𝟯𝗮 From there, we wanted to find solutions to the problem of power when testing interactions.

We ran zillions of simulations to generate power curves for our 12 types of interaction & tested ways to increase power without increasing 𝘕.

Three (rather comforting) strategies.

𝟯𝗯 Strategy #1 🙂

🟦If preregistering a one-tailed test (rather than using a two-tailed test), 21% fewer participants are needed to reach a power of .80 (blue curves)

𝟯𝗰 Strategy #2 😀

🟩If using a mixed design* (rather than a between-participant design), 75% fewer participants are needed to reach a power of .80 (green curves)

*assuming a conservative between-measurements correlation of ρ = .50

𝟯𝗱 Strategy #3 😃

🟨 If using a planned contrast analysis* (rather than the orthodox factorial approach), 60% fewer participants are needed to reach a power of .80 (yellow curves)

*only applies to fully attenuated interactions

𝟰 Finally, we developed INT×Power, a user-friendly web application that enables researchers to draw their interaction & determine the sample size needed to reach a power of .80 with & without using these three strategies.

The beta version of the app:
👉intxpower.com

THANKS for reading this long thread

The preprint (osf.io/xhe3u/) is not submitted yet, so comments, suggestions, & criticisms are welcome and will be considered (feel free to email me).

I mean, let's be honest, there's probably at least ONE mistake in this appendix 🙃

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @nicolas_sommet

Nicolas Sommet 🇺🇦

@nicolas_sommet

Dec 15, 2022

At the top U.S. universities in psychology, there are 17 Democrats for every 1 Republican.

Some argue that this kind of political imbalance is not a cause for concern.

Others believe that it poses an existential threat to the field.

🧵 A dialectical thread 🧵

𝗧𝗛𝗘𝗦𝗜𝗦

Two studies suggest that the political orientation of researchers or their studies has little effect on research outcomes.

@BreznauNate

𝗧𝗛𝗘𝗦𝗜𝗦 · Study 1/2

📈 @BreznauNate et al. gave the same dataset to 73 research teams & asked them to test the following politically charged research question: Does immigration reduce support for social welfare policies?

Read 12 tweets

Nicolas Sommet 🇺🇦

@nicolas_sommet

Sep 12, 2022

@dom_muller

@dom_muller @mjbsp @FGabarrot @cedricbatailler I get your point & that of @AntalHaans/@seriousstats. FYI, we intend to adjust the wording of the piece to be more precise & add a subsection on interaction ES. That being said, I don't think there are any mathematical flaws in the preprint or any confound in the sims. 1/5

@dom_muller

@dom_muller @mjbsp @FGabarrot @cedricbatailler @AntalHaans @seriousstats First, I agree that:
1) power calculation is the same for main effects & interactions (eq. 1) and
2) ES calculation is essentially the same for main effects & interactions (eqs. 2 & 3, respectively).
(note that the overall interaction ES is displayed by the web app) 2/5

@dom_muller

@dom_muller @mjbsp @FGabarrot @cedricbatailler @AntalHaans @seriousstats Second, as an Editor, you surely agree that—despite these formulas being found in most stat textbooks—people often overestimate the expected ES of their interactions by calculating, e.g., “the 𝘕 to detect a medium-sized [partially attenuated] interaction of 𝘥 = 0.35 (sic).” 3/5

Read 5 tweets

Nicolas Sommet 🇺🇦

@nicolas_sommet

Sep 27, 2018

The Spirit Level has been cited ≈10K times (≈700 times in 2018).

The book is straightforward: It uses cross-sectional data to show negative effects of #IncomeInequality on health.

The problem: It does NOT hold up to scrutiny.

🍒 Thread 🍒

#1 🍒-picking.

In the Spirit Level, some countries are excluded from the analysis without justification. When including these countries and using the latest estimates available, the core findings of the book disappear. [2/5]

#2 A second bite at the 🍒

Papers using large survey data with much more countries fail to reproduce the findings. E.g., Jen et al. shows that income inequality actually reduces the chances to report a poor health (especially, in developing countries). [3/5]

Read 5 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Nicolas Sommet 🇺🇦

Try unrolling a thread yourself!

More from @nicolas_sommet

Nicolas Sommet 🇺🇦

Nicolas Sommet 🇺🇦

Nicolas Sommet 🇺🇦

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!