重读的功率和样本量计算

论文标题

重读的功率和样本量计算

Power and Sample Size Calculations for Rerandomization

论文作者

Branson, Zach, Li, Xinran, Ding, Peng

论文摘要

功率分析是实验设计的重要方面，因为它们有助于确定实践中如何实施实验。通常，指定所需的功率水平并计算获得该功率所需的样本量。此类计算对于完全随机的实验而言是众所周知的，但是使用其他实验设计可能会有很多好处。例如，最近已经确定，重读对象是随机分组直到获得协变量平衡的，这提高了因果效应估计量的精度。这项工作确立了重新指定的治疗控制实验的力量，从而允许样本量计算器。我们发现令人惊讶的结果是，虽然在恢复性下的功率通常比完全随机化更大，但对于非常小的治疗效果可能会发生相反的情况。原因是，重读下的推论可能相对更加保守，因为它可以在相同的名义意义水平下具有较低的I型错误，并且这种额外的保守性对权力产生了不利影响。这一令人惊讶的结果是由于治疗效应异质性引起的，在功率分析中通常忽略了数量。我们发现异质性增加了大效应大小的功率，但降低了小效应大小的功率。

Power analyses are an important aspect of experimental design, because they help determine how experiments are implemented in practice. It is common to specify a desired level of power and compute the sample size necessary to obtain that power. Such calculations are well-known for completely randomized experiments, but there can be many benefits to using other experimental designs. For example, it has recently been established that rerandomization, where subjects are randomized until covariate balance is obtained, increases the precision of causal effect estimators. This work establishes the power of rerandomized treatment-control experiments, thereby allowing for sample size calculators. We find the surprising result that, while power is often greater under rerandomization than complete randomization, the opposite can occur for very small treatment effects. The reason is that inference under rerandomization can be relatively more conservative, in the sense that it can have a lower type-I error at the same nominal significance level, and this additional conservativeness adversely affects power. This surprising result is due to treatment effect heterogeneity, a quantity often ignored in power analyses. We find that heterogeneity increases power for large effect sizes but decreases power for small effect sizes.

下载PDF全文

下载文献需遵守相关版权规定

论文标题