What can you do when an experiment yields null results?
I developed a theory to explain why (some) people derive pleasure from political engagement
14 out of 15 predictions failed
▶️A step-wise process to maximize the informational value of null results doi.org/10.1017/pls.20…
The need-based theory of political motivation in a nutshell:
Building on the ‘Pleasure Principle’, I suspect that people will seek behaviors that made them feel good in the past.
Citizens will therefore seek political engagement when it previously made them feel good.
To understand what actions provide pleasure, I built on the concept of basic psychological needs.
Differences in intrinsic political motivation are theorized to reflect whether political engagement was previously experienced as satisfying basic psychological needs.
Popper taught us that we cannot confirm scientific theories.
After having conducted a pre-registered, well-powered experiment I now know what other philosophers in response to Popper had already said decades ago:
Falsifying a theory is also hard to impossible.
14 out of 15 empirical results were not consistent with my theoretical predictions.
But does this prove the theory is wrong? No, because so many explanations could account for the hypotheses-inconsistent findings.
In contrast to the usual strategy of strengthening a finding to avoid a false-positive, the goal in the case of null results is to avoid a false negative.
Hence, I conducted various posthoc analyses to test rivaling explanations for the absence of an observed effect.
▶️Analyses on reliability and treatment-induced attrition suggested that measurement issues likely have not hidden true effects
▶️Also, I used machine-learning approaches by @Susan_Athey et al. to assess whether effects might be hidden in subgroups
▶️I used equivalence tests by @lakens, @annemscheel, @peder_isager to ensure not to miss any potentially meaningful but insignificant effects
▶️Finally, I checked various design features, showing that indeed only one experimental arm can be understood as a valid test
In hindsight, I would now likely also add Bayesian analyses to the list of measures for reacting to or preparing for null results. We are learning more and more about how to make null findings informative. That's great.
All of these tests increased my confidence in the conclusion that, indeed, there is something wrong with the theory itself in its current form. Hence, it likely requires refutation or revision.
Unfortunately, the study is therefore not the big leap for research on intrinsic political motivation that I once hoped it would be.
I discuss what we still can learn for how to study why some people find pleasure in politics and others do not. /END
• • •
Missing some Tweet in this thread? You can try to
force a refresh
After Hansen&Treul reported evidence for a substantive finding in a JOP article, Saraceno reproduced the analysis and found inconsistent evidence for different model specifications.
The journal then invited all three authors to team up and work together to check the robustness.
The newly minted team decided that multiple analytical choices are reasonable for six variables, leading in combination to 288 possible models with reasonable analytical specifications.
WOW 😍
Boba appears a huge leap towards transparent sensitivity analyses in empirical research
▶️easy multiverse analyses
▶️generates code for all reasonable analytical paths
▶️software-independent (R, Python...)
▶️amazing #dataviz arxiv.org/abs/2007.05551
You will write your syntax and specify where two or more analytical choices are reasonable.
Boba will generate the code for all combinations of reasonable choices for you.
Boba is a domain-specific language (and a visual analysis system) so it works software-independent for #Rstats, #Python etc...
Regression analysis with observational data remains the primary analytical method in political science.
However, we will never know whether we controlled for all unobserved confounders.
New sensitivity tools help to assess how likely results are robust to missing controls.
The robustness value (@analisereal,@chadhazlett) applies to multivar. regressions: It is the minimum strength of association that unobserved confounding would need to have, both with the treatment and with the outcome, to change the research conclusions.
V-DEM, potentially the most comprehensive measure on the state of democracy, has published a new report, and the results are staggering.
▶️Hungary is no longer considered a democracy
▶️India is close, too
▶️Now, the majority of people worldwide live in autocracies
(thread)
It is a prime example of how to communicate complex data on important questions:
Accessible language combined with intuitive graphs that help to convey clear messages.
I'll copy key graphs and phrases while reading along.
Assessing "the state of democracy" is hard because the concept is complex and controversial.
But even if conceptual questions were settled, collecting all necessary measures to assess how democratic a country is is a huge undertaking.
People again discuss that certain countries can no longer be considered democracies (Hungary? US?)
Under which conditions do countries qualify as a 'democracy' and what's the alternative category called?
This thread links to a seminal piece on 'electoral authoritarianism'
One line of disagreement is whether to distinguish countries by the *degree* of democratic quality.
While these fine-grained distinctions have value, in his piece Schedler makes the point that "[some] regimes are not less democratic than democracies, but plainly undemocratic."
This raises the question about the essence of what a democracy is.
According to Schedler, having a choice is at the heart of the democratic idea.
Choosing is institutionalized via elections.
However, both plainly democratic and plainly undemocratic countries conduct elections.
In a new study at #APSR, we show how established operationalization strategies of multi-dimensional concepts can systematically lead to wrong conclusions.
Focusing on populist attitudes, we demonstrate simple methods to align theory and measurement.
👇Summary+Preprint+Shiny App
Our (H. Schoen+@ChristianSchim) argument refers to a specific but common type of multi-dimensional concepts which are sometimes called ‘non-compensatory’. Multi-dimensional concepts are non-compensatory when higher values on one component cannot offset lower values on another.
Think of democracy: If we believe that a country only counts as a democracy if it provides both rule of law and free elections then no valid measure of democracy will assign high democracy scores to a country with low 'rule of law' scores even with the election are very,very fair