Same question was also asked by a reviewer. This is where peer review improves a paper, IMO.
So we did two types of analyses (in section 5.3): 1. We estimated what the power is in these experiments (spoiler: not so low). 2. We asked what the FDR would be with 100% power.
>>
Jan 1, 2022 • 14 tweets • 4 min read
How are effects of online A/B tests distributed? How often are they not significant? Does achieving significance guarantee meaningful business impact?
We answer these questions in our new paper, “False Discovery in A/B Testing”, recently out in Management Science >>
The paper is co-authored with Christophe Van den Bulte and analyzes over 2,700 online A/B tests that were run on the @Optimizely platform by more than 1,300 experimenters.