Tweet

Noah Haber

20 Mar, 24 tweets, 5 min read

Systematic reviews and meta-analyses are like plywood. While they often have a pretty veneer, they are only as useful as the layers of materials they are made of and how it's all put together.

In this essay I will

Hm, might do this, since plywood is so, so much cooler than people give it credit for, and there's some good analogy making with how cross-grain layers are complementary and hold things in check.

I have nerd sniped myself.

Ugh, fine, I'll do it.

Systematic reviews and meta-analyses are like plywood, and plywood is crazy cool stuff that I bet you never even thought about.

But first we have to talk about wood.

Thread!

Wood is not the inert solid block stuff you might think it is.

Solid wood is sneaky evil stuff. It is constantly changing shape, twisting, expanding, contracting, and trying to ruin what you have made. But there are some rules to how it "moves."

The ways it expands and twists have mostly to do with what kind of wood it is, and the direction of the grain This is worthy of many a thread rant by itself, but alas.

thewoodwhisperer.com/articles/wood-…

Think about these twists and movements as something like "bias" in a study.

Like in studies, you can often deal with these biases, but it takes planning and work and skill, but often we just need something that stays put.

Enter: plywood, because plywood is a goddamn miracle.

Plywood is made of sheets of wood (or wood-like stuff) glued together.

And much like a meta-analysis, the strength and stability of the wood depends entirely on what those layers are made of, and how they are put together, and its WAY cooler than you think.

Obviously, plywood made of junk wood is junk (but may be good enough to get the job done)

So it's always good to know how good "good enough" is.

But the construction of ply really cool; it isn't just glued together in sheets; it's glued together in sheets of CHANGING DIRECTIONS.

By alternating the directions, the layers hold each other's movement in check.

A whole bunch of bendy wood that is identical and just layered on top of each other and glued would be just as bendy and unstable.

The results of a meta-analysis of a bunch of studies which all share the same biases will share the same biases of its component parts.

Meta-analysis is strong and stable only when it's made of strong and compatible/complementary study material. If it has gaps ("voids" in woodwork parlance), it'll be weak.

But ply also does another thing: it hides what's in the middle via its outer veneer.

If you just look at it from the outside, you can't tell what's up. A lot of nice looking ply is just very very thin veneer of nice looking stuff containing sandwiched junk.

A whole lot of systematic reviews look much stronger than they are, and you have to look inside.

But not all ply is the same! Different kinds of plys work better for different tasks, and what its made of determines what it's good for.

For example; MDF core ply has two outer wood veneers, and the center is a solid layer of MDF. MDF is cheap, but it has some cool properties.

MDF is cheap, heavy, and stays VERY stable / flat (unless it comes in contact with water), but has poor shear strength and is very brittle.

So, MDF core plywood is a great choice for stuff needs to stay very flat, stable, and void-free, but is well supported and well protected.

A meta-analysis for clinical practice might select more "pragmatic" studies, rather than typical phase-III type RCTs.

Pragmatic trials may or may not have hyper control over adherence, blinding, etc, but can be the right tool for the task at hand.

It's not all just good v bad.

@jamesheathers

Don't you worry @jamesheathers, I'm getting there.

https://twitter.com/jamesheathers/status/1373310366361485320

Brief story that just about everyone who does statistical consulting will understand.

I was working in a public workshop, and someone came in and wanted to trim a sheet of wood so he could make it into a cutting board.

What he had in his hands was a piece of big box store ply.

Ply in general isn't gonna made for a good cutting board for all sorts of reasons, but one HUGE one is this: industrial ply glues are made of nasty stuff you REALLY don't want in your food, like formaldehyde.

I had to regret to inform him that I couldn't help.

There are plenty of food-safe glues out there, and that's what your wooden cutting board is put together with.

But in general, you have to trust the manufacturer to have used the appropriate stuff to put it together, since you can't really tell.

The glue here is the analytical/statistical methods. It's the code that holds it all together. If it's made of the wrong stuff, applied badly, etc, it all falls apart, and may be dangerous.

But it's often really hard examine the details of the stats and code.

Plywood gets a bad rep in part because it's often cheaply made crap (and some serious no true Scotsman-ing from traditional woodworkers).

If you get (and pay for) the right stuff, you'll get great results.

Meta-analysis is the same way. You get what you pay for.

So, whether you're going out to build something w/ ply or reading a meta-analysis, carefully examine what it's made of.

Bring a wood nerd ("expert") along to check it out too, since they'll likely be better able to critically evaluate what you have.

/thread

Thread revived to mention something else that's super cool about ply and glue. Wood glue is insanely strong, provided that you are gluing faces/edges, and aren't trying to glue end grain.

(image credit: thepowertoolwebsite.com/best-glue-for-…)

Gluing end-grain is a no-go; it'll be super weak and break.

But ply has alternating directions. Its "edges" are always part end grain, part edge grain.

That means that if you glue ply by the edge, it'll always be a medium strength joint (at least on that side of the joint.

No analogy here, it's just a super cool plywood fact that people don't think about.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @NoahHaber

Noah Haber

@NoahHaber

19 Mar

https://twitter.com/NeilLewisJr/status/1372756741335654404

FWIW, having "grown up" in econ (and now spending 90% of my time in a different field entirely), this statement strikes me as a pretty accurate description of economists as a whole, and a major source of inter-field friction.

https://twitter.com/NeilLewisJr/status/1372756741335654404

I do think that there is something to the fungibility of a lot of econ-style frameworks and ways of approaching problems, BUT in combination with hyperconfidence it gets econs (including me) in trouble.

I've had to learn to unlearn a lot of that hyperconfidence.

Note: fungibility is NOT AT ALL the same thing as superiority, and I think that particular line may be where the error is.

I (clearly) think there is a HUGE amount of untapped value in bridging disciplinary gaps, as indicated on the fact that I've bet the farm on it.

Read 6 tweets

Noah Haber

@NoahHaber

19 Mar

Now that everyone is (justifiably) up in arms about CurateScience, may I turn your attention to SciScore™, claimed to be "the best methods review tool for scientific articles." (direct quote, plastered all over their website)

sciscore.com

For starters, for a "tool" all about transparency, reproducibility, etc., it's pretty damn hard to find what it is they are actually reviewing for.

There is no page explaining it, nothing in the FAQ, and you have to dig around a bit.

To find that, there is a link on the "RTI" tab that goes to the preprint:

"Rigor and Transparency Index, a new metric of quality for assessing biological and medical science methods"

doi.org/10.1101/2020.0…

FWIW; having a paper for their metric is a good thing, at least.

Read 9 tweets

Noah Haber

@NoahHaber

8 Mar

At the risk of getting involved in a discussion I really don't want to be involved in:

Excepting extreme circumstances, even very effective or very damaging policies won't produce discernable "spikes" or "cliffs" in COVID-19 outcomes over time.

That includes school policies.

"There was no spike after schools opened" doesn't mean that school opening didn't cause (ultimately) large increases in COVID cases.

Similarly "There was no cliff after schools closed" doesn't really mean that the school closure didn't substantially slow spread.

That's one of the things that makes measurement of this extremely tricky; the effects of school policies would be expected to appear slowly over time, and interact with the local conditions over that period of time.

Infectious diseases are super sneaky that way.

Read 7 tweets

Noah Haber

@NoahHaber

7 Mar

https://twitter.com/ass_deans/status/1367824665146761225

I am in this picture, and honestly it's the most important thing.

https://twitter.com/ass_deans/status/1367824665146761225

Poorly designed quality assessment metrics are arguably even more objectionable than poorly designed primary methods.

Two examples of VERY poorly designed quality metrics are the Newcastle-Ottowa scale and SciScore.

Compare to something like ROBINS-I (which is really good).

One of these days I should really make a quality assessment metric assessment metric (or at least write about what makes the bad ones bad)

Read 5 tweets

Noah Haber

@NoahHaber

16 Feb

https://twitter.com/KateGrabowski/status/1361487134545367041

YES!!!

Most days in metascience, it feels like the odds are impossible, it's hard to believe that we'll ever make any progress at all.

And then every so often, something great happens.

This is a big deal for the future of science and publication, and I am STOKED!

https://twitter.com/KateGrabowski/status/1361487134545367041

@KateGrabowski

Full disclosure: I contribute every so often to the NCRC team under the fantastic leadership of @KateGrabowski and many others, and have been a fan of both NCRC and eLife since they started (well before I started helping).

At some point I'll do a long thread about why this small thing is a WAY bigger deal than it sounds, but to tease: this heralds active exploration of a fundamental and long overdue rethinking and reorganizing of how science is assessed and distributed.

Read 4 tweets

Noah Haber

@NoahHaber

14 Feb

Folks: There are serious statistical, design, language, and ethics concerns with that vitamin-D RCT.

AT BEST, it's completely meaningless due to negligent statistical treatment and design, but there's more "questions"

Help us out: avoid sharing until the critics have had time.

Seriously, we (people who are frankly pretty good at this) are having a very hard time figuring out what the hell is happening in that trial.

Please give us time to do our work.

https://twitter.com/fperrywilson/status/1360944814271979523

This thread is a good place to start if you want a taste.

But "super sus" is right; there is just so much here that doesn't make any sense at all, and this thread only scratches the surface.

It's gonna be a while before we figure this out.

https://twitter.com/fperrywilson/status/1360944814271979523

Read 46 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Noah Haber

Try unrolling a thread yourself!

More from @NoahHaber

Noah Haber

Noah Haber

Noah Haber

Noah Haber

Noah Haber

Noah Haber

Did Thread Reader help you today?

Like this author's thread?